VM
At LIST · ENVISION Unit

VaibhavMangroliya

Turning data into decisions.

Data Science • Machine Learning • AI • M.Sc. Mathematics

I write Python that goes to production. Right now that means a data-validation pipeline at LIST that gates Luxembourg's national environmental database. Before that, almost two years at India's National Stock Exchange: NAV calculation and XBRL parsing for 2,700+ listed companies.

Luxembourg 🇱🇺

M.Sc.
Mathematics, Uni.lu
5+
Applied ML / data projects
DataQA
Prod. data pipeline · LIST
~2 yrs
Production engineering · NSE India
About

The thread?Turning data into decisions.

From a stock-exchange trading floor to a maths degree in Luxembourg to building intelligent systems: one throughline.

My journey

I started close to the markets, nearly two years at India’s National Stock Exchange, building production financial systems: a Python NAV-calculation tool across the Fair Value Hierarchy and the Oracle-SQL XBRL parser behind 2,700+ listed companies. Working that close to data is what pulled me toward the quantitative, model-driven side.

That pull brought me to the University of Luxembourg, where I’m finishing an M.Sc. in Mathematics with my focus firmly on machine learning, AI, and data. Alongside it I’ve built a real ML project portfolio (physics-informed neural networks, LSTM/XGBoost forecasters, and NLP with transformers), with the DeepLearning.AI foundations to back the theory.

Today at LIST I build DataQA, a production Python pipeline that ingests Luxembourg’s national environmental time-series, validates it against everything that can quietly go wrong, and loads the clean results back. Real data engineering: ETL, validation, scale, and the edge cases you only find when they break something, the foundation every good model is actually built on.

Looking for

Data ScientistMachine Learning EngineerData EngineerApplied ML / AIQuant / ML (finance)

Availability

Available from autumn 2026, once my current LIST internship concludes. Open to next steps in Luxembourg and across the EU.

01 / 03

Machine Learning & Modelling

Hands-on ML across a real project portfolio: physics-informed neural networks (PINNs) for option pricing at 99.8% accuracy, LSTM/XGBoost forecasting ensembles, and NLP with Hugging Face transformers. PyTorch, TensorFlow, scikit-learn, grounded in an applied-maths MSc and DeepLearning.AI's Supervised ML & Neural Networks courses.

02 / 03

Production Python & Data Engineering

Production Python at LIST: pandas, NumPy, pytest, ruff, REST APIs. Drove order-of-magnitude speedups by vectorising every check module, designed a Seasonal Check on per-(month, hour) statistical bands, and ship via GitLab merge requests. ETL, data validation, time-series at scale.

03 / 03

Quantitative Finance: a domain edge

Two years applying engineering to markets at NSE India: NAV computation, Fair Value Hierarchy (L1/2/3), VaR & Expected Shortfall, options pricing & the Greeks, GARCH. A domain I can speak fluently, and now apply machine learning and data to.

How I work

My standard is simple: code I won’t be embarrassed by in six months. At LIST that has meant order-of-magnitude speedups across the validation pipeline, eliminating CI hangs by hardening every external API call, and designing and shipping a new Seasonal Check module built on per-(month, hour) statistical bands. At NSE it meant a normalised 23-table SQL schema that cut data errors by 40%, instead of another fragile Excel workflow.

I also teach the things I learn. My YouTube channel has 290K+ views: Assembly, Engineering Physics, admissions guides. Explaining work to a non-expert audience is the same skill you need to write good client-facing reports.

Skills

Stack &expertise.

Machine learning and data engineering first, with the maths and quant-finance ground I have behind it.

PyTorch/TensorFlow/scikit-learn/Hugging Face/XGBoost/LSTM/Transformers/Python/pandas/NumPy/SciPy/pytest/SQL/Power BI/ETL/Airflow/Time-Series/Git/Linux/Black-Scholes/Monte Carlo/NSE/PyTorch/TensorFlow/scikit-learn/Hugging Face/XGBoost/LSTM/Transformers/Python/pandas/NumPy/SciPy/pytest/SQL/Power BI/ETL/Airflow/Time-Series/Git/Linux/Black-Scholes/Monte Carlo/NSE/

Machine Learning & Deep Learning

PyTorchTensorFlowscikit-learnHugging FaceTransformers / NLPLSTMXGBoostPINNsPCAMonte Carlo

Python & Data Engineering

pandasNumPySciPypytestruffREST APIs (requests)Vectorised processingETL pipelinesTime-seriesData quality & validation

Data & Databases

SQLOracleSchema designPower BIData modellingPandas / Notebooks

Quantitative Finance

NAV CalculationFair Value HierarchyBlack-ScholesGreeks (Δ, Γ, Θ, V, ρ)VaR (Param/Hist/MC)Expected ShortfallGARCH/EGARCHModern Portfolio Theory

Tooling & Languages

GitGitLab MRsGitHubLinuxJava / Spring BootMATLABLaTeXBloomberg

Currently Exploring

Apache AirflowApache KafkaETL orchestrationMLOps basicsStreaming pipelines
Experience

Where I’veshipped.

Two roles, one habit: write Python that other engineers (and regulators) can read.

Research Intern, ENVISION Unit (LEO Observatory)

InternshipCurrent

Luxembourg Institute of Science and Technology (LIST)

Apr 2026 – PresentEsch-Belval, Luxembourg
  • Maintainer of DataQA, a production Python pipeline (pandas, pytest, ruff) that validates Luxembourg’s national environmental time-series (air temperature, humidity, precipitation, global irradiance) against the KISTERS WISKI database via the KiWIS REST API. 65+ merge requests authored and ~200 merges reviewed in just my first 2 months.
  • Built Global Irradiance validation from scratch: three new checks (physical-limit 0–1100 W/m² with night-time flagging via solar zenith angle, constant-value, and 20 km spatial-consistency), including the solar_zenith.py utility that derives sun position from station coordinates.
  • Designed and shipped a new Seasonal Check module for temperature and humidity, using per-(month, hour) statistical bands from historically validated data with a 90-day on-disk threshold cache.
  • Vectorised every iterrows() loop across all check modules for order-of-magnitude speedups, dropped NumPy as a direct dependency, and brought the codebase into pandas 3 / copy-on-write compliance. Hardened all KiWIS API calls with timeouts, eliminating CI hangs.
  • Grew the test suite to 373 functions at ~98% coverage with a skip-invariant harness and Ruff type-annotation enforcement. Built rank-correlating-stations, a companion pipeline producing per-parameter Pearson-correlation rankings (AT 0.94–0.99) across the 12-station network to feed DataQA’s spatial-consistency check, catching a glob-collision bug silently mixing 81 precipitation files into temperature analysis.
  • Shipped a standalone River Discharge Hydrograph Generator CLI that renders print-resolution hydrographs from KiWIS data over per-calendar-day percentile baselines, built only from validated rows, with 100% test coverage and CI on Python 3.13.

Student Research Assistant, Department of Mathematics

Part-time

University of Luxembourg

May 2025 – PresentEsch-sur-Alzette, Luxembourg
  • Supported the Mathematics Department in preparing and typesetting research papers and academic documents in LaTeX (Overleaf), ensuring consistent formatting of equations, citations, and document structure.
  • Assisted with the organisation of departmental events and academic activities, coordinating logistics and on-site support.
  • Provided general academic and administrative support to faculty across the department.

Associate Systems Analyst

Full-time

National Stock Exchange of India (NSE)

Dec 2022 – Jun 2024Mumbai, India
  • Developed a Python-based NAV calculation tool automating Fair Value hierarchy classification (Level 1/2/3 assets), asset-liability aggregation, and Net Asset Value computation from Oracle database, directly applicable to investment-fund valuation and reporting.
  • Built an XBRL parsing system transforming unstructured financial-statement data (Balance Sheet, P&L, Cash Flow) into a normalised SQL schema (23 tables). Reduced data errors by 40% and enabled automated validation across 2,700+ listed companies.
  • Built Java / Spring Boot regulatory-compliance web applications enabling NSE’s compliance team to process SEBI filings, replacing manual Excel-based workflows with automated, audit-ready pipelines.
Education

Academicbackground.

University of Luxembourg

M.Sc. in Mathematics

Mathematical Modelling & Computational Sciences

09/2024 – Present

Vidyalankar Institute of Technology

B.E. in Electronics & Telecommunication

Grade: 1.4, Top 4% in department

08/2018 – 05/2022

Credentials

Certifications.

Click any card to expand and see the full topic coverage.

Supervised Machine Learning: Regression and Classification

DeepLearning.AI (Coursera)

Linear RegressionLogistic RegressionGradient DescentCost FunctionsRegularizationClassificationModel Evaluationscikit-learn / NumPy

Neural Networks and Deep Learning

DeepLearning.AI (Coursera)

Neural Network FundamentalsForward & Backward PropagationGradient DescentVectorisation (NumPy)Activation FunctionsLogistic Regression as a NNDeep L-layer Networks

ETL and Data Pipelines with Shell, Airflow and Kafka

IBM (Coursera)

ETL vs ELTShell ScriptingApache Airflow DAGsApache KafkaBatch & StreamingData PipelinesBash Automation

Getting Started with Power BI

LinkedIn Learning

Power BI DesktopData ModelingDAX BasicsDashboards & ReportsPower QueryData Visualization

Complete Python Developer

Zero to Mastery (Udemy)

OOP in PythonDecorators & GeneratorsFile I/OWeb ScrapingTesting & Debugging

Portfolio and Risk Management

University of Geneva (Coursera)

Modern Portfolio TheoryCAPMEfficient FrontierStrategic Asset AllocationTactical Asset AllocationValue-at-Risk (VaR)Expected ShortfallCurrency Risk HedgingForwards & OptionsPortfolio OptimizationCorrelation Analysis

Bloomberg Finance Fundamentals

Bloomberg LP

Financial System & Money FlowInvestment Types & InstrumentsStock ExchangesRisk & Return AnalysisPortfolio ManagementESG & Responsible Investing

Corporate Finance Fundamentals

Coursera

Financial StatementsTime Value of MoneyCapital BudgetingDCF AnalysisCost of Capital

Data Structures in JAVA

Coding Ninjas

Arrays & Linked ListsStacks & QueuesTrees & GraphsRecursionSorting & Searching
Projects

Selectedwork.

Things I built because I wanted to understand them properly. Machine learning, math, and data: usually all three.

FeaturedPyTorchPINNsDeep LearningBlack-Scholes

Finance-Informed Neural Networks for Option Pricing

PINN embedding Black-Scholes PDE constraints directly into the neural-network loss function.

Achieved 99.8% accuracy against analytical solutions with 40× faster inference than finite-difference methods.

Custom automatic differentiation for Greeks computation (Delta, Gamma, Theta).

View on GitHub
Interactive

Explore the Quant Lab

Live pricing tools, simulation notebooks, and quantitative finance experiments, running in your browser.

Visit Quant Lab
Testimonials

What people say.

“Vaibhav consistently stood out as a sharp and dependable professional. He showed a high level of ownership in his work, often handling critical modules with minimal guidance. Beyond his technical skills, Vaibhav is a collaborative team player with a professional attitude. I confidently recommend him for roles that require strong analytical thinking and problem-solving ability.”

Rahil Kamani

National Stock Exchange of India • 7.1 yrs exp.

View on LinkedIn
Available for opportunities

Let’s talk.

Best for data science, machine learning, and data-engineering roles, with a Risk Management edge in finance. I read every message.

Luxembourg 🇱🇺CEST (UTC+2)Open to relocation across the EU