ABJASREE.
Vol. 04 — Fourth year of serviceOpen to new opportunities

ABJASREE.

Lead data scientist building intelligent clinical AI — 4+ years shipping GenAI, clinical NLP, and computer-vision pipelines that turn unstructured medical data into decisions.

  • 04Years
  • 03Companies
  • Curiosity
Abjasree — Lead Data Scientist, headshot
Fig. 1 — The scientist. India · remote / hybrid / on-site.
Sec. 01

A profile of the scientist

Data, treated as decisions

I'm a Lead Data Scientist with 4+ years of experience turning unstructured healthcare data into business-impactful solutions. Specializing in generative AI, LLMs, and clinical AI, I build scalable NLP and CV pipelines that enhance accuracy, reduce manual review time, and drive operational efficiency.

My work spans from radiomics-based diagnostics to AI-powered document processing and therapy-outcome analytics, delivering measurable improvements in healthcare systems. I'm passionate about transforming unstructured medical data into actionable insights that accelerate healthcare decision-making.

Role
Lead Data Scientist · DATYCS
Stack
Python · PyTorch · AWS · Azure
Based
India · open to remote
Superpower
turning unstructured data into clinical insight
Sec. 02

Record of work

2021 → present
  1. Entry 03Apr 2024 — PresentLead Data Scientist · Remote

    DATYCS

    • Led a 6-member GenAI & Research squad; built AWS/Azure CI/CD reducing model roll-out time by −80% (3 weeks → 5 days).
    • Architected & deployed 3 transformer-based NLP/CV pipelines processing 10K+ AML/CLL charts into Snowflake, boosting throughput +35%.
    • Fine-tuned TrOCR for handwritten prescriptions, improving accuracy +25% and enabling real-time medication safety alerts.
    • Deployed Vision LLM on Azure to parse multi-handwriting prior-authorization forms into structured JSON (+28% accuracy, −45% manual review).
    • Leveraged autonomous AI agents for patient-specific search validation and clinician response generation (+30% accuracy, −40% reply time).
    • Partnered with Eversana to analyze AML & CLL therapy-switch drivers and adverse effects, shaping pharmaceutical market-access strategies.
  2. Entry 02Aug 2022 — Mar 2024Data Scientist · Chennai

    IIT Madras

    • Automated analysis of 1,000+ CT/MRI images, reducing manual preprocessing time by −80%.
    • Increased renal calculi & pulmonary nodule diagnostic accuracy by +18% using CNN + radiomics + Bayesian-optimized XGBoost.
    • Deployed QA models at Apollo Proton Cancer Center, cutting turnaround −10% and flagging 12 high-risk plans per quarter.
  3. Entry 01Dec 2021 — Aug 2022Teaching Assistant · Part-time

    Univ.AI

    • Assisted in teaching machine learning and data science courses to students from diverse backgrounds.
    • Mentored students in practical ML projects and helped them understand complex AI/ML concepts.
    • Contributed to curriculum development and course material preparation.
Sec. 03

Published findings

Research · publications
  1. F.01Oct 2023 — Jan 2024Publication · Nature Sci. Reports

    Radiomics for NSCLC Classification ↗

    Built radiomics model with 83% accuracy for differentiating adenocarcinoma and squamous cell carcinoma in non-small cell lung cancer beyond nodule morphology in chest CT. Decision-curve analysis showed net clinical benefit across thresholds.

    • Radiomics
    • Machine learning
    • Medical imaging
  2. F.02Sept 2022 — Oct 2023Med-NeurIPS 2023

    Zero-Shot Image Registration (ZSIR-FE) ↗

    Designed DNN feature-based registration improving Dice score by +0.30 and running faster than SIFT on BRaTs dataset through feature extraction approach.

    • Deep learning
    • Computer vision
    • Image registration
  3. F.03Sept 2022 — Oct 2023Publication · Nature Sci. Reports

    Pulmonary Nodules Classification ↗

    Engineered radiomic features + XGBoost on limited Indian dataset, achieving 89% accuracy (+10% vs prior studies) for pulmonary nodules classification from chest CT scans.

    • XGBoost
    • Feature engineering
    • Medical AI
  4. F.04Sept 2022 — Jul 2023Clinical deployment

    Automatic Detection of Renal Calculi

    Processed 163 CT scans and built deep-learning pipeline reaching 85% accuracy (+25 pp improvement); slated for clinical deployment.

    • Deep learning
    • Medical imaging
    • Clinical AI
  5. F.05May 2019 — June 2020MS Thesis

    Elliptic Flow of Light Nuclei ↗

    Simulated heavy-ion collisions with AMPT; computed elliptic flow using STAR detector data for light nuclei. Developed C++ macros to compare Coalescence models with experimental measurements, achieving thesis distinction. Also worked on simple ML classification models to distinguish signals and background.

    • HPC
    • C++
    • Machine learning
Sec. 04

Bill of materials

Tools in active service

Languages & Frameworks

  • 01Python
  • 02SQL
  • 03C++
  • 04R
  • 05MATLAB
  • 06PyTorch
  • 07TensorFlow
  • 08Hugging Face
  • 09Scikit-Learn
  • 10LangChain

Cloud & MLOps

  • 11AWS SageMaker
  • 12AWS S3
  • 13Azure ML
  • 14Azure Functions
  • 15Snowflake
  • 16Docker
  • 17CI/CD

AI/ML Domains

  • 18GenAI & LLMs
  • 19Clinical NLP
  • 20Computer Vision
  • 21RAG
  • 22AI Agents
  • 23Reinforcement Learning
Sec. 05

Verified records

Certifications · education

Certifications

Education

Sec. 06

Open a line

All transmissions read

Got a role, a problem, or just curious about my work? Drop a line — I read everything that lands in my inbox.

Enter your full name
Enter a valid email address
Describe the opportunity or inquiry