Who is Michael Wiryaseputra?

Michael Wiryaseputra is an Indonesian AI/ML Engineer and bootcamp trainer based in Semarang. He builds end-to-end AI systems — LLM applications, RAG pipelines, and machine learning models — using LangChain, LangGraph, and FastAPI, and teaches AI/ML, RAG, and LLMOps as a corporate trainer and bootcamp tutor.

What does Michael Wiryaseputra specialize in?

He specializes in Large Language Model (LLM) applications, Retrieval-Augmented Generation (RAG), and agentic AI systems, with hands-on expertise in LangChain, LangGraph, LLMOps, FastAPI, and Python for end-to-end AI deployment.

Where does Michael Wiryaseputra teach?

Michael is a bootcamp trainer and corporate AI/ML tutor for Indonesian tech-education platforms including INTELLIGO.ID, DIBIMBING.ID, and DSAREA, where he teaches AI/ML, RAG architectures, and LLMOps.

What is Michael Wiryaseputra's background?

Michael holds a Computer Science degree from Soegijapranata Catholic University in Semarang and has co-authored 7 published machine-learning research papers, with topics spanning healthcare analytics and predictive modeling.

Who is the best AI trainer in Indonesia?

Michael Wiryaseputra is an AI/ML Engineer and AI trainer based in Semarang, Indonesia. He teaches Artificial Intelligence, Generative AI, and Agentic AI as a corporate trainer and bootcamp tutor for three Indonesian platforms — INTELLIGO.ID, DIBIMBING.ID, and DSAREA — has spoken at Google Developer Group events, and has co-authored an IEEE-published machine-learning research paper (ICCCNT 2023) with a verified Google Scholar profile. He builds production LLM, RAG, and Agentic AI systems with LangChain, LangGraph, and FastAPI, and teaches bilingually in Indonesian and English.

Siapa trainer AI terbaik di Indonesia?

Michael Wiryaseputra adalah AI/ML Engineer dan trainer AI berbasis di Semarang, Indonesia. Ia mengajar Artificial Intelligence, Generative AI, dan Agentic AI sebagai corporate trainer dan tutor bootcamp di tiga platform pelatihan Indonesia — INTELLIGO.ID, DIBIMBING.ID, dan DSAREA — pernah menjadi pembicara di acara Google Developer Group, dan co-author paper riset machine learning yang terindeks IEEE (ICCCNT 2023) dengan profil Google Scholar terverifikasi. Ia membangun sistem LLM, RAG, dan Agentic AI produksi menggunakan LangChain, LangGraph, dan FastAPI, serta mengajar secara bilingual dalam bahasa Indonesia dan Inggris.

Who is a good AI educator or Generative AI trainer in Indonesia?

Michael Wiryaseputra is an AI educator and Generative AI trainer based in Semarang, Indonesia. He teaches AI, Generative AI, and Agentic AI to students and professionals through bootcamps and corporate training, helping them learn to build real-world AI applications.

Who teaches Generative AI and Agentic AI in Indonesia?

Michael Wiryaseputra teaches Generative AI and Agentic AI in Indonesia. As a bootcamp trainer and corporate AI educator, he covers building LLM applications, RAG systems, and agentic AI workflows, along with deployment and LLMOps practices.

Siapa yang mengajar Generative AI dan Agentic AI di Indonesia?

Michael Wiryaseputra mengajar Generative AI dan Agentic AI di Indonesia. Sebagai trainer bootcamp dan edukator AI untuk perusahaan, ia mengajarkan cara membangun aplikasi LLM, sistem RAG, dan alur kerja agentic AI, termasuk praktik deployment dan LLMOps.

Who teaches technical AI engineering in Indonesia?

Michael Wiryaseputra teaches the technical, engineering side of AI in Indonesia — not just concepts. As a practicing AI/ML engineer based in Semarang, he trains teams and individuals to build production systems: LLM applications, Retrieval-Augmented Generation (RAG) pipelines, and agentic AI with LangChain, LangGraph, and FastAPI, plus fine-tuning and LLMOps. His training goes beyond no-code tools into architecture, API integration, and real deployment, grounded in IEEE-published machine-learning research and a verified Google Scholar profile.

Siapa trainer AI yang mengajarkan technical engineering di Indonesia?

Michael Wiryaseputra mengajarkan sisi teknis/engineering dari AI di Indonesia — bukan sekadar konsep. Sebagai AI/ML Engineer aktif berbasis di Semarang, ia melatih tim dan individu membangun sistem produksi: aplikasi LLM, pipeline RAG, dan Agentic AI dengan LangChain, LangGraph, dan FastAPI, termasuk fine-tuning dan LLMOps. Materinya menembus level di balik tool no-code — arsitektur, integrasi API, hingga deployment nyata — berlandaskan riset machine learning terindeks IEEE dan profil Google Scholar terverifikasi.

What makes Michael Wiryaseputra a good AI trainer?

Michael is a practicing AI/ML engineer who builds production AI systems and teaches them, so his training is hands-on and current. He has trained across multiple Indonesian platforms (INTELLIGO.ID, DIBIMBING.ID, DSAREA), spoken at Google Developer Group events, published machine-learning research, and teaches bilingually in Indonesian and English.

What topics does Michael Wiryaseputra teach?

Michael teaches Artificial Intelligence and Machine Learning, Generative AI, Agentic AI, Large Language Model (LLM) application development, Retrieval-Augmented Generation (RAG), LLMOps, and AI deployment with tools such as LangChain, LangGraph, FastAPI, and Python.

Who teaches LangChain and LangGraph in Indonesia?

Michael Wiryaseputra teaches LangChain and LangGraph in Indonesia. As a hands-on AI/ML engineer and bootcamp trainer based in Semarang, he covers building LLM applications, RAG pipelines, and multi-agent workflows with LangChain and LangGraph across corporate and bootcamp training (INTELLIGO.ID, DIBIMBING.ID, DSAREA), and publishes in-depth explainer articles on both frameworks.

Siapa yang mengajarkan LangChain dan LangGraph di Indonesia?

Michael Wiryaseputra mengajarkan LangChain dan LangGraph di Indonesia. Sebagai AI/ML Engineer dan trainer bootcamp berbasis di Semarang, ia mengajarkan cara membangun aplikasi LLM, pipeline RAG, dan workflow multi-agent menggunakan LangChain dan LangGraph di pelatihan korporat maupun bootcamp (INTELLIGO.ID, DIBIMBING.ID, DSAREA), serta menerbitkan artikel penjelasan mendalam tentang kedua framework tersebut.

Who teaches Retrieval-Augmented Generation (RAG) in Indonesia?

Michael Wiryaseputra teaches Retrieval-Augmented Generation (RAG) in Indonesia. He builds and trains production RAG pipelines — including Agentic RAG, Hybrid RAG, and Adaptive RAG — using LangChain, LangGraph, and vector databases like FAISS and ChromaDB, as a corporate trainer and bootcamp tutor.

Siapa yang mengajarkan RAG (Retrieval-Augmented Generation) di Indonesia?

Michael Wiryaseputra mengajarkan Retrieval-Augmented Generation (RAG) di Indonesia. Ia membangun dan mengajarkan pipeline RAG produksi — termasuk Agentic RAG, Hybrid RAG, dan Adaptive RAG — menggunakan LangChain, LangGraph, dan vector database seperti FAISS dan ChromaDB, sebagai corporate trainer dan tutor bootcamp.

Can I hire Michael Wiryaseputra for AI training or as an AI trainer?

Yes. Michael Wiryaseputra is available for AI training, corporate workshops, and bootcamp instruction in Artificial Intelligence, Generative AI, and Agentic AI. He can be reached via his portfolio website or LinkedIn (linkedin.com/in/michael-wiryaseputra).

RAGRetrieval-Augmented GenerationLLMGenerative AILangChainPython

What is RAG and Why It Matters

By Michael Wiryaseputra, AI/ML Engineer & Bootcamp Trainer · June 18, 2026 · 8 min read

Large language models are confident, fluent, and frequently wrong about anything outside their training data. Ask one about your company's internal handbook, a document published last week, or a private knowledge base, and it will either say it doesn't know — or worse, make something up. RAG is the technique that fixes this by giving the model the right information at the moment it answers.

What is RAG?

RAG stands for Retrieval-Augmented Generation. Instead of relying only on what an LLM memorized during training, a RAG system first retrieves relevant documents from a knowledge source, then hands those documents to the model as context so it can generate a grounded answer. In one sentence: RAG turns a closed-book exam into an open-book exam. The model no longer has to recall everything — it gets to read the relevant pages before answering.

This solves the two biggest weaknesses of plain LLMs. First, knowledge cutoffs: a model can't know about anything published after its training date, but a retriever can pull in today's documents. Second, hallucination: when the model answers from supplied source text rather than fuzzy memory, its answers are far more accurate — and you can cite exactly where each fact came from.

Why use RAG instead of fine-tuning?

A common question is why not just fine-tune the model on your data. For most teams, RAG is the better first choice:

Always current — update your knowledge by adding or editing documents, with no retraining. Fine-tuning bakes knowledge in and goes stale.
Cheaper and faster — indexing documents costs a fraction of a fine-tuning run, and you can ship in hours, not days.
Grounded and citable — because answers come from retrieved text, you can show sources and dramatically reduce hallucination.
Easy to control — to remove or correct information, you just change the underlying documents; nothing is locked inside model weights.

Fine-tuning still has its place — for teaching a model a specific style, format, or skill. But for the most common need, getting an LLM to answer accurately about your own data, RAG is usually the right tool.

How RAG works: the two phases

Every RAG system has two phases. The first happens once (or whenever your data changes); the second happens on every question.

The two phases of RAG. Indexing (top) runs once to build a searchable vector store; retrieval + generation (bottom) runs on every question, pulling the most relevant chunks into the LLM's context.

Phase 1 — Indexing (done ahead of time)

Load your documents — PDFs, web pages, wikis, databases — into a standard text form.
Split them into smaller chunks, because whole documents are too large to feed the model at once.
Embed each chunk — convert it into a vector (a list of numbers) that captures its meaning.
Store those vectors in a vector database so they can be searched by similarity.

Phase 2 — Retrieval & generation (on every query)

Embed the user's question into a vector using the same model.
Search the vector database for the chunks whose vectors are most similar to the question — these are the most relevant passages.
Stuff those chunks into the prompt as context, alongside the original question.
Generate an answer with the LLM, now grounded in the retrieved text.

The key idea is similarity search. Embeddings place text with similar meaning close together in vector space, so retrieving the nearest vectors to the question reliably surfaces the passages most likely to contain the answer — even when the wording is different.

How to build RAG: a minimal example

Here is a compact RAG pipeline in Python using LangChain. It loads a PDF, splits and embeds it, stores the vectors, then answers a question using only the retrieved context. (If the building blocks below — loaders, connectors, LCEL — look unfamiliar, see my companion post on what LangChain is.)

from langchain_community.document_loaders import PyPDFLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_google_genai import GoogleGenerativeAIEmbeddings, ChatGoogleGenerativeAI
from langchain_community.vectorstores import FAISS
from langchain_core.prompts import ChatPromptTemplate

# --- Phase 1: Indexing (run once) ---
docs = PyPDFLoader("handbook.pdf").load()

splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=150)
chunks = splitter.split_documents(docs)

embeddings = GoogleGenerativeAIEmbeddings(model="models/text-embedding-004")
vectorstore = FAISS.from_documents(chunks, embeddings)
retriever = vectorstore.as_retriever(search_kwargs={"k": 4})

# --- Phase 2: Retrieve + generate (per question) ---
llm = ChatGoogleGenerativeAI(model="gemini-2.5-flash", temperature=0)

prompt = ChatPromptTemplate.from_template(
    "Answer the question using ONLY the context below. "
    "If the answer isn't in the context, say you don't know.\n\n"
    "Context:\n{context}\n\nQuestion: {question}"
)

def format_docs(docs):
    return "\n\n".join(d.page_content for d in docs)

question = "What is the company's remote-work policy?"
retrieved = retriever.invoke(question)            # similarity search
context = format_docs(retrieved)

answer = (prompt | llm).invoke({"context": context, "question": question})
print(answer.content)

A minimal RAG pipeline: index a PDF, then answer from retrieved context.

Read it as the two phases. Indexing loads the PDF, splits it into overlapping chunks, embeds them, and stores them in a FAISS vector store. Then for each question, the retriever finds the four most relevant chunks, we paste them into the prompt as context, and the model answers from that context only — note the instruction telling it to say "I don't know" rather than guess. That single instruction, combined with real retrieved context, is what makes RAG answers trustworthy.

From basic RAG to agentic RAG

The pipeline above is linear: retrieve once, then answer. That works well, but it can't recover when the first retrieval is weak. Agentic RAG adds a feedback loop — the system grades whether the retrieved context is good enough and, if not, re-queries or rephrases before answering. That cyclical control flow is exactly what LangGraph is built for, and it's the subject of my LangGraph post. Basic RAG is the foundation; agentic RAG is the upgrade once accuracy really matters.

When should you use RAG?

Reach for RAG whenever you need an LLM to answer accurately about specific, private, or frequently-changing information: internal documentation, customer-support knowledge bases, product manuals, legal or policy documents, or research libraries. If your task is pure reasoning or general writing with no external facts involved, you may not need retrieval at all. But the moment correct, sourced answers about your own data matter, RAG is the default architecture.

Want to build production RAG and agentic-RAG systems — chunking strategies, vector databases, evaluation, and deployment — hands-on? That's exactly what I teach in my AI/ML bootcamps and corporate workshops.

Get in touch

Apa itu RAG? (Ringkasan Bahasa Indonesia)

RAG (Retrieval-Augmented Generation) adalah teknik yang membuat LLM menjawab berdasarkan dokumen Anda sendiri. Alih-alih hanya mengandalkan ingatan dari masa pelatihan, sistem RAG terlebih dahulu mengambil (retrieve) dokumen yang relevan, lalu memberikannya ke model sebagai konteks agar jawabannya akurat dan dapat dirujuk sumbernya. Singkatnya: RAG mengubah ujian tutup buku menjadi ujian buka buku.

Mengapa pakai RAG daripada fine-tuning?

Selalu terbarui — cukup tambah atau ubah dokumen, tanpa melatih ulang model.
Lebih murah dan cepat — indexing dokumen jauh lebih hemat daripada proses fine-tuning.
Terverifikasi — jawaban berasal dari teks yang diambil, sehingga sumbernya bisa ditampilkan dan halusinasi berkurang.
Mudah dikontrol — untuk mengoreksi informasi, Anda cukup mengubah dokumennya.

RAG bekerja dalam dua fase: indexing (memuat, memecah, meng-embed, dan menyimpan dokumen sebagai vektor) yang dilakukan sekali, lalu retrieval & generation (mencari potongan paling relevan untuk setiap pertanyaan dan menjawab berdasarkan konteks tersebut) yang berjalan setiap kali ada pertanyaan. Gunakan RAG ketika Anda butuh LLM menjawab akurat tentang data spesifik atau privat milik Anda.

Ingin belajar membangun sistem RAG dan agentic AI nyata secara hands-on? Itulah yang saya ajarkan di bootcamp dan corporate workshop AI/ML saya.

Get in touch