Speaking at PyCon India 2025: “RAGs to Riches – Efficient Web Chatbot with Async Scraping & Hybrid Re-Ranking”

I am excited to share that I will be speaking at PyCon India 2025 in Bangalore on a topic that is very close to my current work in applied Generative AI:
RAGs to Riches: Efficient Web Chatbot with Async Scraping & Hybrid Re-Ranking
This session focuses on building production-grade Retrieval-Augmented Generation (RAG) systems that move beyond demos and actually scale.
What the talk is about
The talk walks through how to:
build a web chatbot powered by RAG architecture
perform asynchronous web scraping to keep knowledge fresh
use hybrid re-ranking (semantic + keyword + metadata signals)
optimize latency, recall, and cost
avoid common pitfalls like hallucinations and noisy retrieval
Attendees will see how asynchronous pipelines dramatically improve ingestion throughput and how retrieval quality depends far more on chunking, indexing, and ranking strategies than just “which LLM is used.”
Who this talk is for
Python developers
data and ML engineers
architects deploying LLM systems
anyone moving from prototype to real-world AI workloads
You will gain practical patterns for deploying web-scale chatbots backed by modern Python async capabilities and hybrid search techniques.
Why this matters
As LLM adoption accelerates, the real differentiator is not model selection but how intelligently you retrieve and rank information. This talk emphasizes:
high-quality data pipelines
efficient Python async design
evaluation techniques for RAG quality
reliability, observability, and safety
Closing note
PyCon India has always been a community I admire, and it is an honor to contribute to it in 2025. I am looking forward to connecting with developers, researchers, and practitioners who are building the next generation of AI systems.
If you are attending PyCon India 2025 in Bangalore, I would be happy to meet you there.



