Skip to content

Pr1ncePS2002/Backend_System_Design_For-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Backend System Design for AI

A comprehensive collection of handwritten notes on backend system design concepts for AI applications.

For downloading notes : https://drive.google.com/drive/folders/1HeiTfb70as7mTTbolUF_78RXx68L5zGT?usp=drive_link

Table of Contents

Networking, Web & Security

Databases

Vector DB & Search

Caching

System Design & Architecture


1.1. Networking and Web Fundamentals

Networking and Web Fundamentals

Networking and Web Fundamentals

What Happens When You Enter google.com

What Happens When You Enter google.com

DNS

DNS

DNS Records

DNS Records


1.2. Web Sockets + Authentication & Security

WebSockets

WebSockets

Authentication and Security

Authentication and Security

Sessions

Sessions

Hashing and Salting

Hashing and Salting


1.3. Token Based Auth & Rate Limiting

Token Based Auth

Token Based Auth

Access Control List and Rule Engine

Access Control List and Rule Engine

Rate Limiting

Rate Limiting

Distributed Rate Limiting

Distributed Rate Limiting


1.4. Prompt Injection & PII Masking

API Keys

API Keys

Prompt Injection

Prompt Injection

PII Masking Implementation

PII Masking Implementation


1.5. MCP Tool Authorisation

MCP Tool Authorisation

MCP Tool Authorisation

Checklist Before Executing a Tool

Checklist Before Executing a Tool


2.1. Database Fundamentals

Database and Storage Fundamentals

Database and Storage Fundamentals

SQL vs NoSQL

SQL vs NoSQL

NoSQL Types

NoSQL Types

Transactions

Transactions


2.2. Indexes, Query Optimisation, Normalisation & Denormalisation

Database Optimisation

Database Optimisation

Indexing Strategies

Indexing Strategies

Query Optimisation

Query Optimisation

Normalisation vs Denormalisation

Normalisation vs Denormalisation


2.3. Bloom Filter & Location Based Databases

Bloom Filters

Bloom Filters

Location Based Databases

Location Based Databases


2.4. Sharding & Replication

Distributed Databases

Distributed Databases

Types of Sharding

Types of Sharding

Replication

Replication


2.5. DB Migration, Connection Pooling & NoSQL Optimisation

NoSQL Optimisation

NoSQL Optimisation

DB Migration

DB Migration

Connection Pooling

Connection Pooling


3. Vector DB, Metadata Filtering & Hybrid Search

AI Specific Storage

AI Specific Storage

Why Indexing

Why Indexing

Embedding Storage

Embedding Storage

Metadata Filtering

Metadata Filtering

Hybrid Search

Hybrid Search

RAG and Document Retrieval

RAG and Document Retrieval


4.1. Caching, Types of Caching, Redis and CDN

Caching

Caching

Caching Strategies

Caching Strategies

Redis

Redis

Content Delivery Network

Content Delivery Network


4.2. Distributed Caching, Cache Replacement Policies, Thrashing, AI Based Caching

Distributed Caching

Distributed Caching

Cache Replacement Policies

Cache Replacement Policies

Cache Thrashing

Cache Thrashing

AI Response Caching

AI Response Caching


5.1. System Design

System Design

System Design

Trade-offs and Limitations

Trade-offs and Limitations


5.2. Monolith vs Microservices

Monolith vs Microservices

Monolith vs Microservices

Microservices Architecture

Microservices Architecture

Monolith to Microservice Migration

Monolith to Microservice Migration


5.3. Scaling, Load Balancing, Capacity Estimation, Consistent Hashing

Scaling

Scaling

Real System Example

Real System Example

Capacity Estimation

Capacity Estimation

Load Balancing

Load Balancing

Consistent Hashing

Consistent Hashing

Virtual Nodes

Virtual Nodes

Scaling LLM Workloads

Scaling LLM Workloads

Cost Based Scaling

Cost Based Scaling


5.4. API Design, Patterns, GraphQL, gRPC, Streaming APIs for LLMs

REST API Design

REST API Design

Best Practices in REST API Design

Best Practices in REST API Design

GraphQL

GraphQL

gRPC

gRPC

Asynchronous APIs

Asynchronous APIs

API Gateway Pattern

API Gateway Pattern

Backend for Frontend

Backend for Frontend

Streaming APIs for LLMs

Streaming APIs for LLMs

5.5. Message Queues, Kafka, RabbitMQ, Pub/Sub, Event-Driven Architecture, DB as Queues

Message Queues

Message Queues

Kafka

Kafka

RabbitMQ

RabbitMQ

Publish Subscribe Model

Publish Subscribe Model

Event-Driven Architecture

Event-Driven Architecture

Database as Queue

Database as Queue

AI Task Queues

AI Task Queues


5.6. Eventual Consistency, AI System Reliability, Model Fallback Strategy

Eventual Consistency

Eventual Consistency

AI System Reliability

AI System Reliability

Model Fallback Strategies

Model Fallback Strategies

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors