Unsupervised-Based Information Extraction from Unstructured Arabic Legal Documents
Resumen
In order to make the unstructured or semi-structured traditional legal texts that are meet the requirements of high-level application such as A.I appli- cations in legal, must overcoming on challenge how to extract and analyze structured information from the legal documents automatically. This paper proposes architecture that using a combined approach that utilizes features, lexical and rules based approaches to extract the needed information from traditional legal documents. This research uses a dataset that is collected from Iraq federal court of cassation decisions documents to extract two sets of in- formation, the first is a set of general information, including reference law category, date of decision, court of jurisdiction name, and document no., deci- sion type that are called valuables attributes information, and the document essence is a focused legal information that include principle, arguments, opin- ions legal, and facts of the case which can used in any analysis phase. This research is a part of big project entitled “The Arabic documents opinion ex- traction using argumentation mining”, and the preliminary results were quite promising.Citas
M.Dragoni, S.Villata, W.Rizzi, and G.Governatori, 2016, Combining NLP Approaches for Rule Extraction from Legal Documents,1st Work- shop on MIning and Reasoning with Legal texts (MIREL), France.
M.Francine,E.Boiy, R.Mochales, and C.Reed,2007,Automatic Detec- tion of Arguments in Legal Texts,11th International Conference on Artifi- cial Intelligence and Law (ICAIL), CA USA.
M.García,K.Atkinson, D.Bollegala, K.Chapman, F.Coenen, C.Roberts, and K.Robson,2017,Context-Based Information Extraction from Com- mercial Law Documents ,17th International Conference on Artificial Intel- ligence and Law (ICAIL), London, United Kingdom.
Cortez.E, Da Silva.A, 2013,Unsupervised Information Extraction by Text Segmentation, Springer, New York Dordrecht London.
M.Fasha, N.Obeid, and B.Hammo, 2017, A Proposed Model for Ex- tracting Information from Arabic-Based Controlled Text Domains, New Trends in Information Technology (NTIT), Jordan.
G. Buey, C.Roman, A.Garrido, C.Bobed, and E.Mena, 2017, Automat- ic Legal Document Analysis Results of Information Extraction Processes using an Ontology, 23rd International Symposium on Methodologies for Intelligent Systems (ISMIS),Warsaw, Poland.
I.Bouaziz, F.Gargouri, 2016, A Framework for the Annotation of Ar- abic Legal Documents, 27th International Business Information Manage- ment Association Conference (IBIMA), Milan, Italy.
N.Lagos, F.Segond, S.Castellani, and J.O’Neill,2010, Event Extraction for Legal Case Building and Reasoning, Springer, FIP Advances in Infor- mation and Communication Technology.
M.AL ACHKAR, R.Mahmoudb, 2012, Arabic Legal Information Sys- tem for Case-Law Indexing and Retrieval, Egyptian Computer Science Journal(ECS), Egyptian.
T.Tin, J.Leonard, M.Davies, K.Gerard, and R.Edita, 2009, Informa- tion Extraction from Legal Documents, IEEE, Eighth International Sym- posium on Natural Language Processing, Philippines.
I.Bouaziz, F.Gargouri, 2015, Towards an Arabic legal ontology based on documents properties extraction, IEEE.