Skip to content
View ChaitanyaSaik's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ChaitanyaSaik

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ChaitanyaSaik/README.md

Typing SVG


LinkedIn GitHub Email


🧠 About Me

class ChaitanyaSaiKurapati:
    def __init__(self):
        self.role         = "AI/ML Engineer | Full-Stack Developer"
        self.education    = "B.Tech CSE @ Amrita Vishwa Vidyapeetham (2022–2026)"
        self.cgpa         = 8.47
        self.focus        = ["LLMs", "RAG Architectures", "Distributed Systems", "Production ML"]
        self.publications = ["ICCATEET 2026", "IEEE INDISCON 2025"]
        self.leetcode     = "250+ problems solved | Top 15%"
        self.motto        = "Ship to production, not just notebooks."

    def current_work(self):
        return "AI-powered Resume Intelligence @ Alpxin Technologies"

    def ask_me_about(self):
        return ["LangChain", "FAISS", "FastAPI", "PyTorch", "Apache Spark"]

πŸ“Š Impact at a Glance

🎯 Metric πŸš€ Result
Resume–Job Matching Accuracy 80% with Sentence Transformers + FAISS
Medical Imaging Classifier 94.2% accuracy / 0.96 AUC (Published)
RAG Hallucination Rate Reduced from 34% β†’ 8%
Distributed Pipeline Throughput 4Γ— improvement on 1M+ records/day
URL Shortener Load 50K concurrent requests handled
Malware False Positives βˆ’20% via NLP-based approach
AutoML Experiment Time < 3 minutes end-to-end

πŸ”₯ Featured Projects

Project Description Stack Highlight
🧠 AI Resume Intelligence NLP-based resume–job matching system FastAPI Β· FAISS Β· Sentence Transformers Β· Streamlit 80% match accuracy
πŸ“„ Smart Document Assistant (RAG) Production RAG pipeline with minimal hallucination LangChain Β· FAISS Β· OpenAI Β· Python 34% β†’ 8% hallucination
πŸ”— Scalable URL Shortener High-throughput distributed shortener Flask Β· Redis Β· MySQL 50K concurrent requests
⚑ AutoML Platform Automated model selection & hyperparameter tuning Scikit-learn · Python <3 min experiments
πŸ“Š Distributed Data Pipeline Large-scale Spark ETL pipeline PySpark Β· Apache Spark Β· AWS S3 1M+ records/day, 4Γ— throughput
πŸ₯ Chest X-Ray Disease Detection Multi-label classification using Vision Transformers PyTorch Β· Vision Transformer Β· DenseNet 94.2% accuracy Β· 0.96 AUC
πŸ›‘ Malware Detection via LLMs LLM-enhanced static analysis for malware NLP Β· Transformers Β· Python βˆ’20% false positives

πŸ› οΈ Tech Stack

🐍 Languages

Python Java SQL

πŸ€– AI / ML / LLM

PyTorch Scikit-Learn HuggingFace LangChain OpenAI XGBoost

βš™οΈ Backend & APIs

FastAPI Flask Streamlit

πŸ—„οΈ Databases

MySQL PostgreSQL

☁️ Cloud & Big Data

AWS Apache Spark


πŸ“š Publications

πŸ“„ Multi-Label Chest X-Ray Diagnosis

ICCATEET 2026

Proposed a hybrid Vision Transformer + DenseNet architecture for multi-label disease classification on chest X-rays. Achieved 94.2% accuracy and 0.96 AUC across 14 pathological categories.

πŸ“‘ TCP vs UDP in Multi-Robot Systems

IEEE INDISCON 2025

Comparative analysis of TCP and UDP communication protocols in multi-robot coordination environments, with latency benchmarking and reliability tradeoff analysis under real-world noise conditions.


πŸ† Achievements

πŸ₯‡ Β Hackathon Finalist β€” Top 8 out of 120 teams Β |Β  πŸ’» Β 250+ LeetCode problems β€” Top 15% globally

πŸ‘¨β€πŸ« Β Mentored 15+ students in Data Structures & Algorithms Β |Β  ✍️ Β Published technical articles on AI & ML

πŸ“– Β 2Γ— Peer-Reviewed Publications (IEEE + ICCATEET) Β |Β  ⚑ Β AI/ML Intern @ Alpxin Technologies


πŸ’Ό Experience

🏒 AI/ML Intern β€” Alpxin Technologies (Apr 2025 – Jul 2025)

  • 🎯 Built an AI-powered Resume Intelligence System achieving 80% accuracy in job-to-resume matching
  • πŸ” Improved resume parsing and semantic similarity using Sentence Transformers + FAISS
  • 🧩 Designed a skill-gap detection pipeline leveraging LLMs for targeted candidate feedback
  • πŸš€ Deployed end-to-end scalable system using FastAPI + Streamlit with production-ready architecture

🎯 Currently Focused On

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  πŸ”­ Scalable ML systems for production workloads         β”‚
β”‚  πŸ§ͺ Advanced LLM + RAG architectures                     β”‚
β”‚  ☁️  Cloud-native AI deployment (AWS + Docker)           β”‚
β”‚  πŸ“Š Distributed data engineering with Apache Spark       β”‚
β”‚  πŸŽ“ Graduating May 2026 β€” Open to MLE / SDE Roles        β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

🀝 Let's Connect

I'm always open to collaborating on AI/ML projects, discussing research ideas, or exploring full-time opportunities.

LinkedIn Email GitHub


"The best model is the one that's running in production."

⭐ If you find my work valuable, consider starring a repo β€” it means a lot!

Pinned Loading

  1. Evaluating-TCP-vs.-UDP-in-Centralized-Multi-Robot-Communication-using-Star-Topology Evaluating-TCP-vs.-UDP-in-Centralized-Multi-Robot-Communication-using-Star-Topology Public

    This project presents a MATLAB-based simulation framework that evaluates the performance of TCP (Transmission Control Protocol) and UDP (User Datagram Protocol) in a centralized multi-robot communi…

    MATLAB 1

  2. SIPA-Real-Time-AI-Interview-Preparation-Assistant SIPA-Real-Time-AI-Interview-Preparation-Assistant Public

    A comprehensive AI-powered platform designed to help job seekers excel in technical interviews through intelligent resume analysis, mock interview simulations, and personalized learning paths.

    Python 1

  3. Distributed-Data-Processing-using-PySpark Distributed-Data-Processing-using-PySpark Public

    This project demonstrates a scalable data processing pipeline using Apache Spark (PySpark) to handle large datasets efficiently. The focus is on comparing distributed computing with traditional seq…

    Jupyter Notebook

  4. Intelligent-AutoML-Engine-for-End-to-End-Data-Analysis-Predictive-Modeling Intelligent-AutoML-Engine-for-End-to-End-Data-Analysis-Predictive-Modeling Public

    An end-to-end ML & DL workflow automation platform built with Streamlit that empowers users to train, evaluate, visualize, and deploy machine learning models seamlessly β€” without writing code. It s…

    Python

  5. Multi-Label-Chest-X-Ray-Diagnosis-using-Vision-Transformers-and-CNNs Multi-Label-Chest-X-Ray-Diagnosis-using-Vision-Transformers-and-CNNs Public

    A deep learning research project comparing Vision Transformers (ViT) and DenseNet121 CNN for multi-label chest disease classification using the NIH ChestX-ray14 dataset.

    Jupyter Notebook