Portfolio — Vol. 01 / 2026
India · Open to Collaborations

Antara
Shaw.

Data Scientist · Data Engineer · AI Enthusiast · Bioinformatics Graduate

Transforming data into insights and building intelligent systems that solve real-world problems — with the rigour of a scientist and the craft of a designer.

Data Science Machine Learning GenAI Bioinformatics Cloud Pipelines Research Data Science Machine Learning GenAI Bioinformatics Cloud Pipelines Research Data Science Machine Learning GenAI Bioinformatics Cloud Pipelines Research
01 — About
Portrait of Antara Shaw

Curious by nature,
rigorous by training.

I work at the intersection of data, biology, and intelligence — designing systems that learn, scale and explain themselves. My practice moves fluidly between research notebooks, production pipelines and generative AI applications.

A Biotechnology graduate from NIT Durgapur, I've led campus chapters, shipped cloud-native analytics, and authored research on protein structure prediction. I care about clarity — in code, in charts and in conversation.

01
Data Science

Statistical modelling, experimentation, and storytelling with data.

02
Data Engineering

Robust pipelines, warehousing, and real-time event systems.

03
Artificial Intelligence

Applied ML, deep learning, and generative AI architectures.

04
Bioinformatics

Sequence analysis, structural biology, and molecular research.

A short timeline
  1. 2020
    NIT Durgapur
    Began undergraduate studies in Biotechnology.
  2. 2022
    Leadership
    General Secretary, SPIC MACAY — curating cultural programming.
  3. 2023
    Research
    Bioinformatic analysis of GPCR-like proteins using AlphaFold.
  4. 2024
    Data & AI
    Built production data pipelines and multi-agent AI systems.
02 — Capabilities

A toolkit refined
over many seasons.

Selected technologies and methods I reach for when shaping a new problem.

012

Programming

  • Python
  • SQL
025

Data

  • Pandas
  • NumPy
  • Power BI
  • Tableau
  • Excel
033

Machine Learning

  • Scikit-learn
  • TensorFlow
  • PyTorch
043

Cloud & DevOps

  • AWS
  • GCP
  • Docker
054

AI & GenAI

  • LangChain
  • Hugging Face
  • RAG
  • LLMs
064

Bioinformatics

  • BLAST
  • HMMER
  • AlphaFold
  • InterProScan
03 — Selected Work

Featured
projects.

A curated edit of six projects spanning data engineering, applied AI and computational biology.

YouTube Trending Video Analytics
01

YouTube Trending Video Analytics

Cloud-native analytics platform ingesting trending video data and surfacing audience patterns through interactive dashboards.

  • AWS
  • Athena
  • QuickSight
  • SQL
Real-Time Traffic Prediction System
02

Real-Time Traffic Prediction System

Streaming pipeline forecasting congestion using deep learning over live geospatial signals.

  • TensorFlow
  • Kafka
  • Docker
  • Google Maps API
Smart Disaster Response Coordination
03

Smart Disaster Response Coordination

Multi-agent system orchestrating resources and communications across emergency response teams.

  • GCP
  • Pub/Sub
  • Firestore
  • Multi-Agent AI
Emotion-Based Music Recommendation
04

Emotion-Based Music Recommendation

Recommender that interprets user affect to curate personalised music journeys in real time.

  • Python
  • Machine Learning
  • Cloud
House Price Prediction Model
05

House Price Prediction Model

Regression model with feature engineering pipeline and uncertainty calibration for market estimates.

  • Python
  • Scikit-learn
Bioinformatic Analysis of GPCR-like Proteins
06

Bioinformatic Analysis of GPCR-like Proteins

Structural and evolutionary study of GPCR-like receptors using multiple sequence alignment and AlphaFold.

  • BLAST
  • MAFFT
  • AlphaFold
04 — Experience & Leadership

A practice built
across disciplines.

  1. 2024 — Present
    Data Engineering (Entry-Level)

    Building data platforms, ETL workflows and analytics infrastructure for production teams.

  2. 2022 — 2024
    General Secretary, SPIC MACAY

    Led the campus chapter — curated heritage programming and managed cross-functional volunteer teams.

  3. 2021 — 2023
    Video Editor & Volunteer

    Produced narrative video content and supported community outreach initiatives.

  4. 2022 — 2024
    Research & Academic Projects

    Authored independent research on bioinformatics, ML and AI systems.

05 — Achievements
01
LeetCode
SQL Badge
02
LeetCode
Pandas Badge
03
Coursework
ML & GenAI Certifications
04
Lab
Bioinformatics Research
05
Campus
Community Leadership
06 — Contact

Let's build
something rare.

Whether it's a research collaboration, an AI product, or a data platform — I'd love to hear what you're working on.