Yadu Sarathchandran, PhD

Yadu Sarathchandran, PhD

Data Scientist | Machine Learning Engineer | Physicist

About Me

Experienced researcher specializing in machine learning, statistical analysis, and data engineering. With a PhD in Physics (2022), I bring a unique perspective to solving impactful problems in R&D and business. My expertise lies in synthesizing results quickly and effectively, driving impactful decisions through data-driven insights.

Professional Experience

Tula Health

Senior Data Scientist

Sep 2023 - Present · 2 years

  • Developed and implemented machine learning pipelines using Neural Network models (CNN, RNN, LSTM, Transformer) and tree-based models (Random Forest, XGBoost, Catboost) to process and analyze noisy biometric data (photoplethysmography, bioimpedance, other healthcare data), achieving high predictive performance.
  • Led the advancement of data modeling and preprocessing pipeline, implementing data validation protocols and utilizing Git for version control, enhancing pipeline maturity.
  • Ran large-scale experiments to benchmark ML features and models and collaborated with cross-functional teams to develop advanced mathematical models for signal processing and predictive analytics in biomarker trends.
  • Utilized AWS services for model deployment data storage and processing, ensuring scalable and efficient ML operations.

Data Scientist

Aug 2022 - Sep 2023 · 1 yr 2 mos

  • Advanced analytics, signal processing, and machine learning from wearable data to predict biomarkers.
  • Algorithmic development and data infrastructure for the R&D team.
  • Computational Modeling of physiological systems.

Booster Fuels, Inc.

Data Scientist

April 2022 – Aug. 2022

  • Designed and executed large-scale experiments using Python and SQL to simulate lifts in KPIs, resulting in improved customer retention.
  • Implemented A/B testing framework using API calls to proprietary algorithms, driving increase in the North Star metric.
  • Developed and maintained data pipelines and reporting dashboards using dbt, Git, and Looker to support product improvements and track user behavior.
  • Conducted in-depth analyses on user engagement and behavior patterns to improve operational efficiency and inform strategic decision-making.

Oak Ridge National Laboratory (ORNL)

Research Assistant - PhD

Aug. 2016 – April 2022

  • Applied scientific techniques to analyze complex experimental data, solving a decades-old problem in physics, resulting in 3 technical talks and a publication.
  • Implemented and optimized simulation techniques in High Performance Computing (HPC) environments utilizing GPU and CUDA, enhancing model performance and accuracy.
  • Developed custom data analysis and visualization frameworks using Python to investigate material properties, effectively communicating findings through 2 technical talks at conferences.

Skills & Expertise

Machine Learning

  • Supervised Learning: Neural Networks, Tree-based methods, Bayesian analysis, SVM
  • Unsupervised Learning: Clustering, PCA, Autoencoders
  • Deep Learning: CNNs, RNNs, LSTM, Transformer (TensorFlow, PyTorch)
  • NLP: HuggingFace, LangChain, NLTK

Programming & Tools

  • Languages: SQL, Python, Bash
  • Databases: PostgreSQL, MongoDB
  • Tech Stack: PyTorch, Scikit-learn, Pandas, TensorFlow, dbt, Looker, Tableau
  • Cloud: AWS (S3, Lambda, SageMaker, Redshift), GCP (Vertex AI)

Transferable Skills

  • Data Analysis & Experimentation
  • Simulations & A/B Testing
  • Statistical Analysis
  • Quantitative Modelling
  • Project Management
  • Product Development

Projects

Bloodraven - Ask your personal Greenseer all your questions about ASOIAF!

  • Developed an interactive chatbot for multi-turn question-answering using Retrieval-Augmented Generation (RAG) with large language models (LLMs) like Gemma/LLama, integrated through HuggingFace, LangChain, and FAISS. The model was optimized on an NVIDIA T4 GPU.
  • Built a responsive web interface using Gradio, providing real-time interactions and context-aware answers through seamless integration with the RAG system.
  • Collected relevant knowledge using web-scraped data and the Wikipedia API, enriching the retrieval process to ensure accurate responses. Deployed the app on Hugging Face Spaces for easy public access: Bloodraven - your personal Greenseer

AdTech Product Experimentation Analysis

  • Analyzed an A/B testing experiment to determine the success of a new ad product introduced to reduce the overspending on the advertising platform to improve the efficiency of allocation of advertising resources.
  • Defined metrics and performed statistical analysis on the overspend, revenue, and budget in the control and treatment groups, across segments, and determined that the new product increases the platform revenue by 50%.
  • Check out my article in Medium: Product Experimentation Analysis

AI-Powered Book Recommendation System

  • Developed a personalized book recommendation system using GPT4All (an open-source LLM) and FastAPI, providing users with contextual book suggestions based on their genre preferences, themes, and reading level requirements.
  • Implemented a responsive frontend using vanilla JavaScript and modern CSS, integrated with Google Sites for seamless user experience, featuring real-time recommendations and detailed explanations for each suggested book.
  • Built a scalable backend architecture using FastAPI and CORS middleware, enabling asynchronous API endpoints and cross-platform compatibility, with the GPT4All orca-mini-3b model handling contextual understanding and recommendation generation.
  • Check out the website (ongoing development!) - The Restricted Section

Customer Churn Analysis at Robinhood

  • Administered exploratory data analysis (data cleaning, visualization, feature engineering) on the investment portfolio data of 5500 users to determine the customer churn rate by using statistical and time-series principles.
  • Implemented machine learning algorithms (Random Forest, XGBoost) to predict customer churn. Deployed the XGBoost model in the AWS Cloud using SageMaker with API Gateway to predict user churn (F1 score - 0.91).
  • Check out the code repository in Github: Customer Churn Analysis and Prediction

RAGFeynman; LLM Question Answering Assistant

  • Developed a question-answering assistant about the life and teachings of Feynman by leveraging Retrieval-Augmented Generation (RAG) with large language models (LLMs) such as Gemma or TinyLlama by utilizing HuggingFace, Langchain, and FAISS.
  • Implemented a web interface using Streamlit for user-friendly interaction with the RAG system designed to retrieve relevant documents and augment the information with the capabilities of open-sourced LLM models.
  • Ensured accurate and detailed answers by combining retrieval-based and generation-based methods. Check out the source code repository in Github: RAGFeynman

Loan Application Prediction

  • Created a machine learning model to identify which new applicants should be given a loan in the future. Wrangled two large datasets (one contained application data for every customer that has been given a loan in a 6 month period. The other contained every loan that has been given in this time and whether it has been a good loan or a bad loan).
  • Implemented a binary classification model to accurately predict the default rate or the defined success of given loans using machine learning algorithms (Tree-based, XGBoost) with an F1 score of 0.78, and deployed using Flask.
  • The model out-performed traditional lending models based on credit-scores. Check out the code repository in Github: Loan Application Prediction

Fraud Detection and Analysis of Financial Transactions

  • Wrangled a large dataset of financial transactions from credit cards by EU cardholders in September 2013, and performed an exploratory data analysis procedure (visualization, class-balancing, feature engineering).
  • Implemented binary classification models to predict fraudulent transactions based on machine learning algorithms (Tree-based, XGBoost) with a high F1 score of 0.94, and deployed in AWS Cloud (SageMaker, Lambda, S3).

Education

University of Tennessee, Knoxville, TN

Ph.D. in Physics, Minor in Computational Sciences

M.S. in Physics

IISER Thiruvananthapuram

BS-MS Dual-Degree in Physics, Minor in Chemistry

Testimonials

Sunishchal Dev

AI Safety Researcher at RAND | Research Mentor at Algoverse

"Yadu was a solid asset to our routing team at Booster Fuels, consistently delivering on our data and reporting needs. He showed real initiative, diving deep to understand how our data flowed and using that knowledge to simulate business scenarios that informed strategic decisions around customer retention. What stood out most was his positive attitude and genuine eagerness to learn—I always looked forward to our 1:1s. I truly enjoyed working with Yadu and hope we get to collaborate again."

Magnus Skonberg

Data Science Manager at Fortegra

"Yadu has an incredibly powerful, robust mindset and approaches all obstacles with an open mind and a can-do attitude. Despite joining Booster Fuels recently, his contributions as an Analyst were felt immediately. Through his strong technical and written communication skills, Yadu set the bar for analysis reporting and contributed directly toward one of our markets being set on pace for a 33% efficiency improvement. Yadu has an incredibly bright future ahead of him, and I offer the strongest of recommendations based on his technical and emotional intelligence, attention to detail, and infectious positive energy.
If there are any questions, please feel free to contact me directly."

Dima Bolmatov, Ph.D.

Incoming Assistant Professor at Texas Tech University

"I have known Yadu for the last three years as a colleague at the Shull Wollan Center at Oak Ridge National Laboratory, and most recently, as a collaborator. During this time, I had the privilege of observing how he led multiple research projects—from implementing state-of-the-art data analysis and developing experimental techniques to troubleshooting and generating new scientific insights from complex data. Though highly self-motivated and capable of working independently, Yadu is a generous team member and collaborator, always willing to support his peers. I truly enjoyed our interactions and discovered in Yadu a hardworking, driven individual who will no doubt deliver impactful breakthroughs in any field he chooses.
If you're seeking a motivated, versatile professional with strong scientific skills, a team-oriented mindset, and the drive to deliver results, I would wholeheartedly recommend bringing Yadu in to your team!"

Contact Me