CaseLink: Inductive Graph Learning for Legal Case Retrieval (2403.17780v3)
Abstract: In case law, the precedents are the relevant cases that are used to support the decisions made by the judges and the opinions of lawyers towards a given case. This relevance is referred to as the case-to-case reference relation. To efficiently find relevant cases from a large case pool, retrieval tools are widely used by legal practitioners. Existing legal case retrieval models mainly work by comparing the text representations of individual cases. Although they obtain a decent retrieval accuracy, the intrinsic case connectivity relationships among cases have not been well exploited for case encoding, therefore limiting the further improvement of retrieval performance. In a case pool, there are three types of case connectivity relationships: the case reference relationship, the case semantic relationship, and the case legal charge relationship. Due to the inductive manner in the task of legal case retrieval, using case reference as input is not applicable for testing. Thus, in this paper, a CaseLink model based on inductive graph learning is proposed to utilise the intrinsic case connectivity for legal case retrieval, a novel Global Case Graph is incorporated to represent both the case semantic relationship and the case legal charge relationship. A novel contrastive objective with a regularisation on the degree of case nodes is proposed to leverage the information carried by the case reference relationship to optimise the model. Extensive experiments have been conducted on two benchmark datasets, which demonstrate the state-of-the-art performance of CaseLink. The code has been released on https://github.com/yanran-tang/CaseLink.
- Improving BERT-based Query-by-Document Retrieval with Multi-task Optimization. In ECIR.
- DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization-based re-ranking for case law retrieval. CoRR abs/2108.03937 (2021).
- Injecting the BM25 Score as Text Improves BERT-Based Re-rankers. In ECIR.
- LeiBi@COLIEE 2022: Aggregating Tuned Lexical Models with a Cluster-driven BERT-based Model for Case Law Retrieval. CoRR abs/2205.13351 (2022).
- Arian Askari and Suzan Verberne. 2021. Combining Lexical and Neural Retrieval with Longformer-based Summarization for Effective Case Law Retrieval. In DESIRES (CEUR).
- Retrieval for Extremely Long Queries and Documents with RPRS: a Highly Efficient and Effective Transformer-based Re-Ranker. CoRR abs/2303.01200 (2023).
- LEGAL-BERT: The Muppets straight out of Law School. CoRR abs/2010.02559 (2020).
- Ilias Chalkidis and Dimitrios Kampas. 2019. Deep learning in law: early adaptation and legal word embeddings trained on large corpora. Artif. Intell. Law 27, 2 (2019), 171–198.
- How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation. In ECIR.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT.
- Graph Condensation for Inductive Node Representation Learning. In ICDE.
- Competition on Legal Information Extraction/Entailment (COLIEE).
- Inductive Representation Learning on Large Graphs. In NeurIPS.
- B. HARRIS. 2002. Final appellate courts overruling their own “wrong” precedents: the ongoing search for principle. LAW QUARTERLY REVIEW 118, 7 (2002), 408–427.
- Argument discovery via crowdsourcing. VLDB J. (2017).
- Karen Spärck Jones. 2004. A statistical interpretation of term specificity and its application in retrieval. J. Documentation 60, 5 (2004), 493–502.
- Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR.
- Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In ICLR.
- SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval. CoRR abs/2304.11370 (2023).
- Learning Better Representations for Neural Information Retrieval with Graph Information. In CIKM.
- Investigating Conversational Agent Action in Legal Case Retrieval. In ECIR.
- Query Generation and Buffer Mechanism: Towards a better conversational agent for legal case retrieval. Inf. Process. Manag. (2022).
- Incorporating Retrieval Information into the Truncation of Ranking Lists for Better Legal Search. In SIGIR.
- LeCaRD: A Legal Case Retrieval Dataset for Chinese Law System. In SIGIR.
- Incorporating Structural Information into Legal Case Retrieval. ACM Trans. Inf. Syst. (2023).
- CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding. In EMNLP.
- Document Ranking with a Pretrained Sequence-to-Sequence Model. In EMNLP.
- Jay M. Ponte and W. Bruce Croft. 2017. A Language Modeling Approach to Information Retrieval. SIGIR (2017).
- Incorporating Judgment Prediction into Legal Case Retrieval via Law-aware Generative Retrieval. CoRR abs/2312.09591 (2023).
- Exploiting Positional Information for Session-Based Recommendation. ACM Trans. Inf. Syst. 40, 2 (2022), 35:1–35:24.
- Exploiting Cross-session Information for Session-based Recommendation with Graph Neural Networks. ACM Trans. Inf. Syst. (2020).
- Rethinking the Item Order in Session-based Recommendation with Graph Neural Networks. In CIKM.
- GAG: Global Attributed Graph Neural Network for Streaming Session-based Recommendation. In SIGIR.
- ImGAGN: Imbalanced Network Embedding via Generative Adversarial Graph Networks. In SIGKDD.
- Semantic-Based Classification of Relevant Case Law. In JURISIN.
- HeteGCN: Heterogeneous Graph Convolutional Networks for Text Classification. In WSDM.
- Stephen E. Robertson and Steve Walker. 1994. Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval. In SIGIR.
- BERT-PLI: Modeling Paragraph-Level Interactions for Legal Case Retrieval. In IJCAI.
- Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. (2014).
- Law Article-Enhanced Legal Case Matching: a Model-Agnostic Causal Learning Approach. CoRR abs/2210.11012 (2022).
- Disease Prediction via Graph Neural Networks. IEEE J. Biomed. Health Informatics (2021).
- Prompt-based Effective Input Reformulation for Legal Case Retrieval. CoRR abs/2309.02962 (2023).
- CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed Graphs. In ECIR.
- Building Legal Case Retrieval Systems with Lexical Matching and Summarization using A Pre-Trained Phrase Scoring Model. In ICAIL.
- Representation Learning with Contrastive Predictive Coding. CoRR abs/1807.03748 (2018).
- Graph Attention Networks. In ICLR.
- NOWJ at COLIEE 2023 - Multi-Task and Ensemble Approaches in Legal Information Processing. CoRR abs/2306.04903 (2023).
- InducT-GCN: Inductive Graph Convolutional Networks for Text Classification. In ICPR.
- Neural Graph Collaborative Filtering. In SIGIR.
- Zhaowei Wang. 2022. Legal Element-oriented Modeling with Multi-view Contrastive Learning for Legal Case Retrieval. In IJCNN.
- Lawformer: A pre-trained language model for Chinese legal long documents. AI Open 2 (2021), 79–84.
- LegalGNN: Legal Information Enhanced Graph Neural Network for Recommendation. ACM Trans. Inf. Syst. (2022).
- LEVEN: A Large-Scale Chinese Legal Event Detection Dataset. In ACL.
- Graph Convolutional Networks for Text Classification. In AAAI.
- Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale Extraction. In SIGIR.
- Contrastive Learning for Legal Judgment Prediction. ACM Trans. Inf. Syst. 41, 4 (2023), 25.
- Double-Scale Self-Supervised Hypergraph Learning for Group Recommendation. In CIKM.
- CFGL-LCR: A Counterfactual Graph Learning Framework for Legal Case Retrieval. In SIGKDD.
- Graph-less Neural Networks: Teaching Old MLPs New Tricks Via Distillation. In ICLR.
- Iteratively Questioning and Answering for Interpretable Legal Judgment Prediction. In AAAI.
- Boosting legal case retrieval by query content selection with large language models. In SIGIR-AP.
- Yanran Tang (6 papers)
- Ruihong Qiu (26 papers)
- Hongzhi Yin (211 papers)
- Xue Li (124 papers)
- Zi Huang (126 papers)