CFP last date
15 May 2024
Reseach Article

Legal Documents Clustering using Latent Dirichlet Allocation

by Ravi Kumar V, K. Raghuveer
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 2 - Number 6
Year of Publication: 2012
Authors: Ravi Kumar V, K. Raghuveer
10.5120/ijais12-450384

Ravi Kumar V, K. Raghuveer . Legal Documents Clustering using Latent Dirichlet Allocation. International Journal of Applied Information Systems. 2, 6 ( May 2012), 27-33. DOI=10.5120/ijais12-450384

@article{ 10.5120/ijais12-450384,
author = { Ravi Kumar V, K. Raghuveer },
title = { Legal Documents Clustering using Latent Dirichlet Allocation },
journal = { International Journal of Applied Information Systems },
issue_date = { May 2012 },
volume = { 2 },
number = { 6 },
month = { May },
year = { 2012 },
issn = { 2249-0868 },
pages = { 27-33 },
numpages = {9},
url = { https://www.ijais.org/archives/volume2/number6/170-0384/ },
doi = { 10.5120/ijais12-450384 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2023-07-05T10:43:43.874721+05:30
%A Ravi Kumar V
%A K. Raghuveer
%T Legal Documents Clustering using Latent Dirichlet Allocation
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 2
%N 6
%P 27-33
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

At present due to the availability of large amount of legal judgments in the digital form creates opportunities and challenges for both the legal community and for information technology researchers. This development needs assistance in organizing, analyzing, retrieving and presenting this content in a helpful and distributed manner. We propose an approach to cluster legal judgments based on the topics obtained from Latent Dirichlet Allocation (LDA) using similarity measure between topics and documents. The developed topic based clustering model is capable of grouping the legal judgments into different clusters in effective manner. As per as our knowledge is concerned this is the first approach to cluster Indian legal judgments using LDA topic model

References
  1. J. Allen, et al. "Topic detection and tracking pilot study final report". In Proc. of the DARPA Broadcast News Transcription and understanding Workshop, 1998.
  2. Marti Hearst. "Texttiling: Segmenting text into multi-paragraph subtopic passages". Computational Linguistics, 1997, Vol. 23. Pages 33–64.
  3. M. Utiyama and H. Isahara. "A statistical model for domain-independent text segmentation". In Proc. of the ACL 2001, pages 499–506.
  4. M. Shafiei and E. Milios. "A statistical model for topic segmentation and clustering". In Proc. of Canadian AI'08.
  5. D. Beeferman, A. Berger, and J. Lafferty. "A model of lexical attraction and repulsion". In Proc. of the ACL, pages 1997, pages 373–380.
  6. F. Choi, P. Wiemer-Hastings, and J. Moore. "Latent semantic analysis for text segmentation". In Proc. of EMNLP, 2001, pages 109–117.
  7. H. Kozima. Text segmentation based on similarity between words full text. In Proc. of the ACL, pages 286–288, 1993.
  8. H. Kozima and T. Furugori. "Similarity between words computed by spreading activation on an English dictionary". In Proceedings of the ACL, 1993, pages 232–239.
  9. Wei Xu, Xin Liu and Yihong Gong. "Document Clustering Based On Non-negative Matrix Factorization". In Proc. of SIGIR'03 July 28–August 1, 2003, Toronto, Canada. Pages267-273
  10. Qiang Lu, William Keenan, Jack G. Conrad and Khalid Al-Kofahi. "Legal Document Clustering with Built-in Topic Segmentation". In Proc. of CIKM'11, October 24–28, 2011, Glasgow, Scotland, UK. Pages 383-392
  11. Anna Huang. "Similarity Measures for Text Document". In Proc. of NZCSRSC 2008, April 2008, Christchurch, New Zealand.
  12. M. Saravanan. , B. Ravindran and S. Raman. "Using Legal Ontology for Query Enhancement in Generating a Document Summary". In Proc. of JURIX 2007, 20th International Annual Conference on Legal Knowledge and Information Systems, Leiden, Netherlands, 13-15th Dec 2007. Pages 171-172.
  13. P. Berkhin. "A survey of clustering data mining techniques". Grouping Multidimensional Data 2006, pages 25–71.
  14. D. M. Blei, A. Y. Ng, and M. I. Jordan. "Latent Dirichlet allocation". Journal of Machine Learning Research Vol. 3 (2003) 993-1022.
  15. http://www. keralawyer. com/asp/sub. asp?pageVal=judgements
Index Terms

Computer Science
Information Sciences

Keywords

Latent Dirichlet Allocation (lda) Legal Judgments Documents Clustering Cosine Similarity