Skip to content
DevDisciplineNIGHT CITY NEVER SLEEPS
Blog+About
Loading
DevDiscipline
instagramlinkedin

© 2026 ALPARSLAN SELÇUK DEVELİOĞLU

Privacy Policy·Terms of Service

Leave your email. I will send new posts.

Blog+

Built for my own writing practice. Editor, SEO, multilingual — all in one.

See Details
Archive
SearchBlog+About
A Document-Based Legal AI That Can Say "I Don't Know" | DevDiscipline
Software12 min2 views

A Document-Based Legal AI That Can Say "I Don't Know"

Logged: May 28, 2026
  • The dream, and the real problem
  • Quick fix: this is not "training"
  • How the pieces fit together
  • 1) Turning the document into text, and a PDF bug
  • 2) Chunking: why and how?
  • 3) Embedding: text into numbers, without losing the meaning
  • 4) Vector search: boring but enough
  • 5) Generation: pushing the model to the source
  • Does it really stop hallucination? A test
  • When it finds more than one document
  • Two engines: cloud Haiku or a local model
  • Lessons from the road
  • Where it is useful, and how it scales?
  • 10 million documents: what big data has waiting for us
  • Putting the three models side by side
  • For the people who say "talk is cheap, show the code"

Contents

  • The dream, and the real problem
  • Quick fix: this is not "training"
  • How the pieces fit together
  • 1) Turning the document into text, and a PDF bug
  • 2) Chunking: why and how?
  • 3) Embedding: text into numbers, without losing the meaning
  • 4) Vector search: boring but enough
  • 5) Generation: pushing the model to the source
  • Does it really stop hallucination? A test
  • When it finds more than one document
  • Two engines: cloud Haiku or a local model
  • Lessons from the road
  • Where it is useful, and how it scales?
  • 10 million documents: what big data has waiting for us
  • Putting the three models side by side
  • For the people who say "talk is cheap, show the code"

Do not miss new posts

I can send notes like this by email. No noise, just new posts.

Software Engineering Series

#1A Document-Based Legal AI That Can Say "I Don't Know"// reading#1Opening 11 Million Character HTML in a Mobile WebView: Virtual Chunking#1Vibe Coding for 10 Years Experienced Software Developer
Series Index: 3 entries recordeddev_discipline://series_meta
#yapay zeka#hukuk#hukuk yapay zeka#hukuk rag#hukuki belge arama#halüsinasyon önleme#belgeye dayalı ai#legal ai#rag system#document-based ai

You May Also Like

Opening 11 Million Character HTML in a Mobile WebView: Virtual Chunking
Software

Opening 11 Million Character HTML in a Mobile WebView: Virtual Chunking

9 minApril 18, 2026
Vibe Coding for 10 Years Experienced Software Developer
Software

Vibe Coding for 10 Years Experienced Software Developer

11 minMay 24, 2026
Is Discipline Necessary for Success? Or Have They Been Lying to You All Along?
Growth

Is Discipline Necessary for Success? Or Have They Been Lying to You All Along?

10 minMay 6, 2026