I will develop production rag systems engineered for latency cost and trust

A
anwarullahkhan1
A
anwarullahkhan1
Anwar K

About this gig

Most RAG chatbots are demos in production cosplay they screenshot well and fall apart by the third user question. I build the version that doesn't.


For teams whose demo (yours, ChatGPT, or a freelancer's) needs to become something users trust.


𝗘𝗡𝗚𝗜𝗡𝗘𝗘𝗥𝗘𝗗 𝗔𝗚𝗔𝗜𝗡𝗦𝗧 𝗙𝗢𝗨𝗥 𝗕𝗨𝗗𝗚𝗘𝗧𝗦:

Retrieval BM25 + dense + reranker, RAGAS context precision >0.75

Latency sub-800ms time-to-first-token, p95 under 2.5s

Cost typical $0.0008/query on gpt-4o-mini, modeled up front

Trust faithfulness >0.85, source citations, per-query observability


𝗣𝗥𝗢𝗢𝗙, 𝗡𝗢𝗧 𝗣𝗥𝗢𝗠𝗜𝗦𝗘𝗦

Every build ships with an eval report against YOUR docs and YOUR Q&A pairs. Miss the agreed thresholds and you don't pay the final 30%. In writing.


𝗖𝗔𝗣𝗔𝗖𝗜𝗧𝗬

Two production builds per month. If my reply badge shows >24h, I'm full that week.


𝗡𝗢𝗧 𝗙𝗢𝗥 𝗬𝗢𝗨 𝗜𝗙

You're shopping ChatGPT wrappers under $200. Plenty of those book one.


𝗡𝗘𝗫𝗧 𝗦𝗧𝗘𝗣

Send a 1-paragraph problem statement, a sample doc, and three example user questions. I reply within 24h with a fixed quote or a referral.

Get to know Anwar K

Anwar K

AI Software Engineer

  • FromPakistan
  • Member sinceFeb 2026
  • Avg. response time1 hour
  • Languages

    English
I’m a full-stack engineer with deep exposure to modern AI—ranging from ML to agentic systems—and hands-on experience delivering production-ready applications for clients and startups. I build scalable, cloud-ready solutions across backend, frontend, and mobile, combining technical depth with a product-driven mindset to deliver reliable, high-impact results.

My Portfolio

Other AI Development Services I Offer