Vincenzo Ventriglia

I'm a

About Me

Vincenzo Ventriglia

Data Scientist & Machine Learning Engineer

A results-driven data professional – focused on hype-free solutions tailored to business needs.

I am currently creating value at the National Institute of Geophysics and Volcanology (INGV), where I develop machine learning models in the Space Weather domain. My job is complemented by finding the hidden stories in data and make them accessible to stakeholders. I studied Physics in Italy and Germany, previously worked on Analytics in the strategic division of the world's largest professional services network, and in the Data Science department of the leading Italian publisher.

When not at work, I enjoy theatre (maybe you spotted San Carlo in the sidebar), diving into personal finance, learning a new language, or organising PyData Roma Capitale

  • Latest Role: Data Scientist & MLE @ INGV
  • Based in: Rome & Naples, Italy

Short CV

This is a (very) short CV; if you are interested in the full one, please contact me.

Experience

Data Scientist & MLE

12/2023 - to date

Istituto Nazionale di Geofisica e Vulcanologia (INGV), Rome (IT)

Organiser & Community Manager

11/2024 - to date

PyData Roma Capitale, Rome (IT)

Guest Lecturer

12/2025 - 12/2025

Sapienza Università di Roma, Rome (IT)

Data Scientist

09/2022 - 11/2023

Zanichelli Editore, Bologna (IT)

Data Engineer

08/2021 - 08/2022

Deloitte Consulting, Bologna (IT)

Education

MSc in Theoretical Physics

09/2017 - 07/2020

Università degli Studi di Napoli Federico II, Naples (IT)

Erasmus+

03/2019 - 08/2019

Goethe-Universität Frankfurt am Main, Frankfurt am Main (DE)

BSc in Physics

09/2013 - 05/2017

Università degli Studi della Campania L. Vanvitelli, Caserta (IT)

Research Intern

02/2017 - 05/2017

Istituto Nazionale di Fisica Nucleare (INFN), Naples (IT)

Skills

Programming

Python
  • Core language since 2019 for Data Science, Machine Learning, and backend development
Rust
  • INGV: Developed a high-performance GNSS package, slashing processing times from minutes to seconds
SQL
  • Zanichelli: ETL for ML processing, data analysis
  • Deloitte: ETL, data warehousing, BI
C
  • My first exposure to programming; used for numerical analysis and linear algebra

AI Frameworks & ML

PyTorch, Hugging Face, LangChain
  • Deep Learning, NLP, LLM integrations
YOLO
  • INGV: Automated pattern recognition and anomaly detection for GNSS/HF signal integrity
CatBoost, scikit-learn
  • Probabilistic & time-series forecasting, explainable clustering, advanced feature engineering
SHAP
  • XAI solution for model validation and explanation

Data Engineering

Pandas, Polars
  • High-performance data manipulation and advanced feature engineering
ODI, Dagster
  • Definition, implementation, and orchestration of modern ETL data pipelines
Databases
  • Relational & Cloud: Oracle, Postgres, AWS Athena
  • NoSQL: MongoDB
Pydantic
  • Strict data validation for robust Python applications

Data Viz & BI

Plotly, Seaborn, Quarto
  • Reporting, static rendering, and interactive data visualisations
Streamlit
  • Developed actionable web apps from scratch to democratise data access for C-suite
Power BI, Tableau
  • Near real-time sales dashboards and BI reporting for leading retail and corporate clients
Excel, VBA
  • Data analysis and corporate finance support

CI/CD & DevOps

Git, Docker
  • Version control, containerisation, reproducible environments
Jenkins
  • Automated CI/CD pipelines for seamless integration and deployment
MLflow, Optuna
  • Model lifecycle management, experiment tracking, and hyperparameter optimisation
FastAPI
  • Building highly performant and robust APIs for machine learning model inference

Leadership & Engagement

Public speaking
  • Presented at 20+ international conferences (industry & academic)
  • Guest Lecturer on Python & AI at Sapienza University
Community building
  • Organiser @ PyData Roma Capitale
  • Planning events, managing logistics, and fostering an inclusive tech community
Mentorship
  • Mentoring junior data scientists, promoting modelling and architectural best practices
Agile leadership
  • Tech Lead in R&D environments, balancing priorities, deadlines, and cross-functional communication

Languages

Italian
  • Native
English
  • Fluent (C2)
  • University, research, and daily work in English
German
  • Pre-intermediate (A2)
  • Erasmus+ Exchange in Frankfurt am Main

Expertise

Machine Learning


  • Time Series:
    • Scalable forecasting with statistical, econometric and machine learning models
    • Probabilistic forecasting and post-hoc uncertainty quantification with Conformal Prediction
    • Advanced feature engineering

  • Real-time inference: Designed, developed and deployed real-time models, integrating data from heliophysics satellites, ionosondes, and magnetometers

  • XAI: Explainability with SHAP, a game theoretic approach to explain the output of any model

  • Other: Executed a variety of classification, regression, and clustering tasks, including user segmentation, churn prediction, and demand/sales forecasting for strategic planning

  • Business Intelligence & Analytics


  • Web Applications:
    • Full-stack development of an actionable web app to democratise data access for C-suite, which has become the de facto standard in the company
    • Developed a web app to produce near real-time statistics on any investment portfolio

  • Dashboard: Near real-time sales dashboard for a leading hearing aid retailer

  • ETL & Data Integration:
    • Data integration for corporate finance
    • Creation of a central data warehouse for CRM
    • Contributed to the definition and implementation of modern data pipelines
  • Space


  • Space Weather: Tech lead in the development of explainable forecasting models, supporting cost-sensitive decision-making for GNSS/HF disruption mitigation in aerospace & defence applications

  • GNSS:
    • Developed a high-performance, lightweight (Python + Rust) package for modelling ionospheric quantities from multi-constellation GNSS observables, slashing processing times from minutes to seconds
    • Probabilistic forecast of low-latitude ionospheric scintillation

  • Remote Sensing: Satellite imagery to perform Earth observations to study the effect of wildfires, assessing vegetation health and its changes over time

  • Relativistic Astrophysics: Numerical integration of light-like geodesics of a rotating black-hole spacetime, simulating photons orbiting the black hole

  • Data-driven Marketing


  • A/B testing: Assessing the impact of marketing campaigns

  • CRM: Creation of a central data warehouse to ensure uniform CRM KPIs and consistent marketing funnel analysis for a leading hearing aid retailer

  • Digital:
    • Designed a Next-Best-Action recommender for sales reps, clustering high-volume digital sessions for a leading Italian publisher
    • Customer sentiment and feedback analysis with NLP techniques for a major European bank
  • Finance


  • Corporate Finance: Data integration for corporate finance, support for financial controllers interacting with mission-critical analytics applications for a leading automotive manufacturer

  • Portfolio Management: Starting from buy/sell transactions, Personal Finance for Newbies (or PFN) produces easy-to-use, near real-time statistics on any investment portfolio, providing insights from higher-level metrics (P&L, asset class weights) to those pertaining to risk and returns over time

  • M&A: Market analyses to support CEO strategic decision-making for multi-million euro potential M&As

  • Conferences

    Here are some conferences and initiatives related to AI, data or broader research.

    PyData Roma Capitale logo

    PyData Roma Capitale | Organiser

    Part of the organising team of the Roman chapter of PyData, a community for everyone who loves Python, data and meeting tech fellows.

    Our goal is to foster an inclusive environment for connecting, sharing work, and exchanging ideas on evolving challenges in AI, data science & engineering, research, and industry. We are passionate about open-source tools and bringing together enthusiasts and professionals from diverse backgrounds.

    Website Meetup Linkedin

    Some conferences I have attended or will attend


    Where we'll meet
    • PyCon Lithuania 2026
       |  Vilnius, Lithuania  |  Speaker
    • PyCon DE & PyData 2026
       |  Darmstadt, Germany  |  Speaker
    • PyCon Italia 2026
       |  Bologna, Italy  |  Speaker
    • PyData London 2026
       |  London, UK  |  Speaker
    • EuroPython 2026
       |  Kraków, Poland  |  Speaker
    • Committee on Space Research (COSPAR) 2026 – 46th Scientific Assembly
       |  Florence, Italy  |  Speaker

    Where we've met
    2025
    • New Space Economy 2025 – European Expoforum
       |  Rome, Italy  |  Exhibitor  |  YouTube  |  Spotify
    • PyData Eindhoven 2025
       |  Eindhoven, Netherlands  |  Speaker
    • Beacon Satellite Symposium 2025
       |  Rome, Italy  |  Speaker
    • AI in Physical Sciences
       |  Rome, Italy  |  Keynote Speaker
    • IAGA / IASPEI Joint Scientific Meeting 2025
       |  Lisbon, Portugal  |  Speaker
    • Mathematics for Signal Processing and Applications in Geophysics and Other Fields
       |  L'Aquila, Italy  |  Speaker
    • PyCon DE & PyData 2025
       |  Darmstadt, Germany  |  Speaker  |  YouTube
    • Machine Learning and Computer Vision in Heliophysics
       |  Sofia, Bulgaria  |  Speaker
    2024
    • New Space Economy 2024 – European Expoforum
       |  Rome, Italy  |  Exhibitor
    • Space Weather Italian Community Congress
       |  Rome, Italy  |  Poster
    • PyData Amsterdam 2024
       |  Amsterdam, Netherlands  |  Attendee
    • 4th URSI Atlantic Radio Science Meeting (AT-RASC)
       |  Gran Canaria, Spain  |  Poster
    2023
    • PyCon Italia 2023
       |  Florence, Italy  |  Attendee

    Projects

    Here is a selection of projects I have worked on as a researcher, as a student or in my leisure time.

    • All
    • Time Series
    • Remote Sensing
    • Space
    • Finance
    • Other
    PyTECGg logo

    Total Electron Content (TEC) reconstruction with GNSS data – a Python 🐍 package with a Rust 🦀 core

    GitHub PyPI
    Scintill-AI project

    Research project for GNSS ionospheric scintillation forecasting at low latitudes

    GitHub
    Personal Finance for Newbies

    Web app to produce near real-time statistics on your investment portfolio

    GitHub Web App
    Iono-LuGRE project

    Sensing of the Earth's Plasmasphere and Ionosphere from the Lunar surface with GNSS signals

    GitHub Web App
    T-FORS project

    Traveling Ionospheric Disturbances Forecasting System (T-FORS), funded by the European Community, Horizon Europe

    GitHub Paper
    Vesuvius Sentinel (Earth Observation project)

    Satellite imagery project (Sentinel-2 mission) to study Mount Vesuvius pre- and post-July 2017 wildfires

    GitHub
    k-means for time series analysis

    Machine Learning algorithm to identify periods of growth, decline and stationarity in stock data

    GitHub
    Black hole ray tracing image

    Image of a black hole, produced by ray tracing photons in a rotating spacetime in General Relativity

    GitHub
    Quantum Neural Network

    Does adding quantum features improve the overall performance of a neural network?

    GitHub
    Secret santa mailer project

    Draw a recipient for each secret Santa and send an email to each Santa's inbox of who their gift recipient is

    GitHub