Budha Sree M
Home About Skills Projects Experience Contact
profile pic

Budha Sree M


Data Engineer
Data Quality Analyst
Visualisation

India

Resume

About

I care about making data accessible, ethical, and powerful across public policy, development data and product teams.

I'm known for my Quality Assurance mindset - I catch what others miss. I care deeply about how data is structured, tested, and told. That's also why I'm drawn to data visualisation. The lines between data engineer and visualiser, backend and frontend, are getting blurry and that makes it an exciting space to work in.

I have a solid background of 7+ years in Business Intelligence (T-SQL, Oracle PL/SQL, RDBMS, MyBI, SSIS) and a deep interest in the intersection of data, development, and decision-making. In last one year, I have also trained myself on Tableau, Power BI, Python, D3.js & AWS Cloud in last one year and worked on a few independent projects. 

I'm currently exploring: data engineering, data quality assurance, data journalism, development data, and story-driven analytics. 

Let's connect if you're working with:
Open government data & social sector analytics
Data storytelling, Business Intelligence, and Data Visualisation
Ethical AI, transparency, or policy research
Health, development, or civic tech projects



Skills

Programming Languages:
Visualization & Reporting:
Developer Tools:
Version Control:
Project Management:
Cloud:

Connect With Me

Projects

Here is a preview of my recent projects.

lng_pttrn
Language Patterns in India

An interactive India Map showing Monolinguals, Bilinguals and Trilinguals across India.

ASER
ASER - Reading Assessment

Can children in Std III, V, and VIII read a Std II-level text or a story?

 Check out more projects 

Professional Experience

Data Engineer | Data Analyst | Quality Assurance

Mar 2024 - Present

  • Published a Tableau data story visualizing ASER (2018–2024) reading assessment to highlight India’s silent learning crisis, the impact of COVID and foundational literacy gaps.
  • Created a Power BI dashboard that projects LFPR across age-groups, gender, regions & religions to identify inconsistencies and reinforce quality across multiple datasets.
  • Built a local SQL Express database that integrates socio-economic datasets on education (UDISE+ and ASER), labour (LFPR) and linguistic census in India.
  • Conducted data profiling, validation, and cleansing across public datasets (UDISE+, ASER, LFPR), focusing on accuracy and completeness for reporting and dashboarding.
  • Diagnosed gaps in UDISE+ School Infrastructure reliability (e.g., clean water, sanitation) across 1.47 million schools using Python and simulated uncertainty scenarios using Monte Carlo model.
  • Authored One Nation, Two Maps research analyzing the mismatch in District vs. PC boundaries in India through Lancet & Harvard SDG studies through Power BI dashboards, demonstrating the consequences of inconsistent geographic mappings.
  • Built a Sankey chart using D3.js to reveal how school dropout reasons have narrowed and fragmented over time in India, surfacing invisible gaps and incomplete classifications in India’s education data.
  • Designed a D3.js-powered Interactive India Map visual for Language Patterns across 36 Indian states/UTs, maintaining regional consistency and avoiding redundancy in state-wise linguistic categorization.

Civil Services Aspirant

Aug 2019 - Feb 2022

  • Quit IT to pursue Indian Civil Services (UPSC) with focus on Public Administration, Ethics and Governance.
  • Studied Indian institutional data structures, their gaps, and reporting inconsistencies, which now inform my approach to ETL validation and public-sector data quality & governance./li>

Infosys

Technology Analyst & Lead

Jan 2019 - Oct 2019

  • Insta Award 2019 for initiatives to design & publish newsletters, creating user guides & leading projects.
  • Provided leadership to a team of 5 members and trained them on Oracle SQL & Azure, in addition reduced on-boarding time by 85% via custom technical guides.
  • Developed cross-validation scripts and business logic test cases, helping reduce data mismatches and improving QA turnaround time.

Business Developer & Data Quality Analyst

Jul 2017 - Dec 2018

  • Infosys Recognition Award 2018 by AMP for designing & building a new Data Store by integrating both structured policy data & unstructured claims narratives, improving reporting readiness for APRA compliance.
  • Automated ETL workflows using SSIS and embedded C# to process and clean unstructured insurance claims with inconsistent formatting.
  • Built MS Access dashboards and user-facing interfaces to help business teams clean and amend erroneous data directly, improving data accuracy and transparency.
  • Participated in defect triage & coordinated resolution of root cause issues across ETL modules.

Developer Programmer & Data Quality Analyst

Mar 2015 - Jun 2017

  • Infosys Recognition Award 2017 by AMP for developing Value of Lost Business metric and resolving conflicts between Actuaries & Finance using QA test scenarios which reduced discrepancies from 12% to 0.01%.
  • PLAY Winner for developing a completely new data model with highly scalable rule engine for Register Valuation of commissions for easier analysis & analytics using MyBI reporting.
  • Identified fraudulent fee payments to advisors, which was highly recognised by CEO.
  • Improved FATCA compliance accuracy from 60% to 99% using SQL REGEXP based parsing.
  • Highly configurable, Metadata-driven, scalable ETL design for Net Promotor Score.

Senior Systems Engineer

Oct 2012 - Feb 2015

  • GEM Award 2014 for delivering a complex project with intense business analysis, which minimized the projection of business loss by 30% and in addition, minimized rework by 40% ensuring compliance and dataquality.
  • Power Programmer Award 2014 for my excellent programming skills as a Junior developer.
  • Knowledge Transfer sessions on various Subject Areas of Client Business.

Get in touch

Feel free to reach out to me for any queries or collaborations.