Integração de LLM e RAG para análise de documentos jurídicos

Pereira, Dante Flesch

Please use this identifier to cite or link to this item: https://hdl.handle.net/10923/27083

Type:	Monografia
Title:	Integração de LLM e RAG para análise de documentos jurídicos
Author(s):	Pereira, Dante Flesch
Advisor:	Bandeira, Denise
Issue Date:	2024
Keywords:	ANÁLISE JURÍDICA PROCESSAMENTO DE LINGUAGEM NATURAL LARGE LANGUAGE MODEL RETRIEVAL AUGMENTED GENERATION LEGAL ANALYSIS NATURAL LANGUAGE PROCESSING
Abstract:	O uso de Large Language Model (LLM) e de técnicas de Retrieval Augmented Generation (RAG) possibilita automatizar a análise de documentos jurídicos, uma tarefa complexa que tradicionalmente exige uma extensa revisão manual. Esse trabalho apresenta uma prova de conceito que visa demonstrar a viabilidade dessa integração no contexto de trabalho do Ministério Público do Estado do Rio Grande do Sul (MPRS), focando na análise de documentos do Tribunal de Contas do Estado do Rio Grande do Sul (TCE/RS) para identificar casos de irregularidade administrativa. A proposta inclui o desenvolvimento de um sistema capaz de realizar consultas em linguagem natural, recuperando e gerando respostas com base em informações relevantes dos documentos processados. A arquitetura combina aprendizado de máquina, processamento de linguagem natural e busca semântica, com o objetivo de otimizar o trabalho jurídico, garantindo precisão e eficiência. Resultados preliminares destacam a importância dessa abordagem para lidar com grandes volumes de texto assim como a necessidade de validação contínua das respostas geradas. The use of Large Language Models (LLM) and Retrieval Augmented Generation (RAG) techniques offers the potential to automate the analysis of legal documents, a complex task that traditionally requires extensive manual review. This work presents a proof of concept aimed at demonstrating the feasibility of this integration in the context of Ministério Público do Estado do Rio Grande do Sul (MPRS), focusing on the analysis of documents from the Tribunal de Contas do Estado do Rio Grande do Sul (TCE/RS) to identify cases of administrative irregularities. The proposal includes the development of a system capable of performing natural language queries, retrieving and generating answers based on relevant information from processed documents. Architecture combines machine learning techniques, natural language processing and semantic search, with the goal of optimizing legal work, ensuring accuracy and efficiency. Preliminary results highlight the importance of this approach in handling large volumes of text as well as the need for continuous validation of generated responses.
URI:	https://hdl.handle.net/10923/27083
Appears in Collections:	TCC Sistemas de Informação

Files in This Item:

File	Description	Size	Format
2024_2_DANTE_FLESCH_PEREIRA_TCC.pdf	Texto completo	1,39 MB	Adobe PDF	Open View

Show full item record See statistics Search for other versions of fulltext via SFX