RAG-Based Legal Research Assistant for Finding Similar Past Cases

Paper Details
Manuscript ID: 2126-0423-4510
Vol.: 2 Issue: 4 Pages: 125-134 Apr - 2026 Subject: Data Science And Big Data Language: English
ISSN: 3068-1995 Online ISSN: 3068-109X DOI: https://doi.org/10.64823/ijter.2604015
Abstract

This paper presents the design, architecture, and evaluation of a Retrieval-Augmented Generation system that assists new legal assistants in locating connected and similar past cases for new filings. The solution addresses Job 1, Legal Assistant, and leverages a curated Knowledge Base of 10 structured research session logs spanning five practice areas at a fictional law firm. Generative AI is assessed as capable of handling approximately 80 percent of the task, with the remaining 20 percent requiring human legal judgment, case validity verification, and jurisdiction-specific reasoning. The RAG architecture pairs a HuggingFace sentence transformer embedding model with FAISS vector search and GPT-4o-mini for grounded generation. Three query enhancement techniques improve retrieval precision beyond the baseline. Evaluation across eight metrics covering retrieval quality, generation quality through the RAG Triad, and operational performance demonstrates that the solution meets or exceeds the 0.80 target threshold on seven of eight dimensions. The paper documents limitations in cost, latency, case law currency, and the irreplaceable need for attorney oversight.

Keywords
Retrieval-Augmented Generation RAG legal research pre-trained models FAISS groundedness faithfulness knowledge base GenAI evaluation legal assistant
Share
Paper Metrics
  • Views 44
  • Downloads 12
Cite this Article

Shivanand R Koppalkar (2026). RAG-Based Legal Research Assistant for Finding Similar Past Cases. International Journal of Technology & Emerging Research (IJTER), 2(4), 125-134. https://doi.org/10.64823/ijter.2604015

BibTeX
@article{ijter2026212604234510,
  author = {Shivanand R Koppalkar},
  title = {RAG-Based Legal Research Assistant for Finding Similar Past Cases},
  journal = {International Journal of Technology &  Emerging Research },
  year = {2026},
  volume = {2},
  number = {4},
  pages = {125-134},
  doi =  {10.64823/ijter.2604015},
  issn = {3068-109X},
  url = {https://www.ijter.org/article/212604234510/rag-based-legal-research-assistant-for-finding-similar-past-cases},
  abstract = {This paper presents the design, architecture, and evaluation of a Retrieval-Augmented Generation system that assists new legal assistants in locating connected and similar past cases for new filings. The solution addresses Job 1, Legal Assistant, and leverages a curated Knowledge Base of 10 structured research session logs spanning five practice areas at a fictional law firm. Generative AI is assessed as capable of handling approximately 80 percent of the task, with the remaining 20 percent requiring human legal judgment, case validity verification, and jurisdiction-specific reasoning. The RAG architecture pairs a HuggingFace sentence transformer embedding model with FAISS vector search and GPT-4o-mini for grounded generation. Three query enhancement techniques improve retrieval precision beyond the baseline. Evaluation across eight metrics covering retrieval quality, generation quality through the RAG Triad, and operational performance demonstrates that the solution meets or exceeds the 0.80 target threshold on seven of eight dimensions. The paper documents limitations in cost, latency, case law currency, and the irreplaceable need for attorney oversight.},
  keywords = {Retrieval-Augmented Generation, RAG, legal research, pre-trained models, FAISS, groundedness, faithfulness, knowledge base, GenAI evaluation, legal assistant},
  month = {Apr},
}
Copyright & License

Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.