Large Language Models in Law: A Survey (2312.03718v1)
Abstract: The advent of AI has significantly impacted the traditional judicial industry. Moreover, recently, with the development of AI-generated content (AIGC), AI and law have found applications in various domains, including image recognition, automatic text generation, and interactive chat. With the rapid emergence and growing popularity of large models, it is evident that AI will drive transformation in the traditional judicial industry. However, the application of legal LLMs is still in its nascent stage. Several challenges need to be addressed. In this paper, we aim to provide a comprehensive survey of legal LLMs. We not only conduct an extensive survey of LLMs, but also expose their applications in the judicial system. We first provide an overview of AI technologies in the legal field and showcase the recent research in LLMs. Then, we discuss the practical implementation presented by legal LLMs, such as providing legal advice to users and assisting judges during trials. In addition, we explore the limitations of legal LLMs, including data, algorithms, and judicial practice. Finally, we summarize practical recommendations and propose future development directions to address these challenges.
- TensorFlow: a system for large-scale machine learning, in: The 12th USENIX Symposium on Operating Systems Design and Implementation, pp. 265β283.
- An analytical study of information extraction from unstructured and multidimensional big data. Journal of Big Data 6, 1β38.
- A summary of the research on the judicial application of artificial intelligence. Chinese Studies 9, 14.
- Using multi shares for ensuring privacy in database-as-a-service, in: The 44th Hawaii International Conference on System Sciences, IEEE. pp. 1β9.
- Explanation in AI and law: Past, present and future. Artificial Intelligence 289, 103387.
- Is ChatGPT leading generative AI? what is beyond expectations? Academic Platform Journal of Engineering and Smart Systems 11, 118β134.
- Precedent and discretion. The Supreme Court Review 2019, 313β334.
- A neural probabilistic language model. Journal of Machine Learning Research 3, 1137β1155.
- Does the use of risk assessments in sentences respect the right to due process? a critical analysis of the wisconsin v. loomis ruling. Law, Probability and Risk 17, 45β53.
- Containers and cloud: From LXC to docker to kubernetes. IEEE Cloud Computing 1, 81β84.
- On the opportunities and risks of foundation models. arXiv preprint, arXiv:2108.07258 .
- Large language models in machine translation. EMNLP-CoNLL , 858.
- Graphics processing unit (GPU) programming strategies and trends in gpu computing. Journal of Parallel and Distributed Computing 73, 4β13.
- Class-based n-gram models of natural language. Computational Linguistics 18, 467β480.
- Language models are few-shot learners. Advances in Neural Information Processing Systems 33, 1877β1901.
- Artificial intelligence, for real. Harvard Business Review 1, 1β31.
- A search engine for natural language applications, in: The 14th International Conference on World Wide Web, pp. 442β452.
- Genie: A generator of natural language semantic parsers for virtual assistant commands, in: The 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 394β410.
- AI in finance: challenges, techniques, and opportunities. ACM Computing Surveys 55, 1β38.
- A comprehensive survey of AI-generated content (AIGC): A history of generative AI from GAN to ChatGPT. arXiv preprint, arXiv:2303.04226 .
- Extracting training data from large language models, in: 30th USENIX Security Symposium, pp. 2633β2650.
- Deep learning in law: early adaptation and legal word embeddings trained on large corpora. Artificial Intelligence and Law 27, 171β198.
- A deep learning method for judicial decision support, in: IEEE 19th International Conference on Software Quality, Reliability and Security Companion, IEEE. pp. 145β149.
- Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing, in: IEEE 17th International Symposium on High Performance Computer Architecture, IEEE. pp. 266β277.
- Artificial intelligence in education: A review. IEEE Access 8, 75264β75278.
- Natural language processing. Fundamentals of Artificial Intelligence , 603β649.
- Word2vec. Natural Language Engineering 23, 155β162.
- Artificial intelligence and the transformation of humans, law and technology interactions in judicial proceedings. Law, Technology and Humans 2, 4β18.
- Diffusion models in vision: A survey. IEEE Transactions on Pattern Analysis & Machine Intelligence , 1β20.
- ChatLaw: Open-source legal large language model with integrated external knowledge bases. arXiv preprint, arXiv:2306.16092 .
- Artificial intelligence and judicial modernization. Springer.
- Opportunities and challenges in explainable artificial intelligence (XAI): A survey. arXiv preprint, arXiv:2006.11371 .
- Data privacy: Definitions and techniques. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 20, 793β817.
- The judicial demand for explainable artificial intelligence. Columbia Law Review 119, 1829β1850.
- The matrix in context: Taking stock of police gang databases in london and beyond. Youth Justice 20, 11β30.
- Explainable artificial intelligence: A survey, in: The 41st International Convention on Information and Communication Technology, Electronics and Microelectronics, IEEE. pp. 0210β0215.
- Shortcut learning of large language models in natural language understanding: A survey. arXiv preprint, arXiv:2208.11857 .
- GLaM: Efficient scaling of language models with mixture-of-experts, in: International Conference on Machine Learning, PMLR. pp. 5547β5569.
- Stability and reliability in judicial decisions. Cornell Law Review 73, 422.
- Predictive policing: not yet, but soon preemptive? Policing and Society 30, 905β919.
- Applications of artificial intelligence in agriculture: A review. Engineering, Technology & Applied Science Research 9.
- Investigating the listening and transcription performance in court: experiences from stenographers in philippine courtrooms. Journal of Language and Pragmatics Studies 2, 100β111.
- The impact of artificial intelligence on rules, standards, and judicial discretion. Southern California Law Review 93, 1.
- Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity. The Journal of Machine Learning Research 23, 5232β5270.
- Minds, bodies, and machines. Artificial Intelligence: Its Scope and Limits , 269β303.
- False positives, false negatives, and false analyses: A rejoinder to machine bias: Thereβs software used across the country to predict future criminals. and itβs biased against blacks. Federal Probation 80, 38.
- Large language models in education: Vision and opportunities, in: IEEE International Conference on Big Data, IEEE. pp. 1β10.
- Model-as-a-service (MaaS): A survey, in: IEEE International Conference on Big Data, IEEE. pp. 1β10.
- Deep learning. MIT Press.
- Building sustainable free legal advisory systems: Experiences from the history of AI & law. Computer Law & Security Review 34, 314β326.
- Preserving the rule of law in the era of artificial intelligence (AI). Artificial Intelligence and Law 30, 291β323.
- An introduction to neural networks. CRC Press.
- Concepts in law. volumeΒ 88. Springer Science & Business Media.
- Artificial intelligence in medicine. Metabolism 69, S36βS40.
- Transformer in transformer. Advances in Neural Information Processing Systems 34, 15908β15919.
- A court of specialists: Judicial behavior on the UK Supreme Court. Oxford University Press, USA.
- Predictive policing as a new tool for law enforcement? recent developments and challenges. European Journal on Criminal Policy and Research 24, 201β218.
- Learning distributed representations of concepts, in: The Eighth Annual Conference of the Cognitive Science Society, Amherst, MA. p.Β 12.
- Training compute-optimal large language models. arXiv preprint, arXiv:2203.15556 .
- Alexa, siri, cortana, and more: an introduction to voice assistants. Medical Reference Services Quarterly 37, 81β88.
- Lawyer LLaMA technical report. arXiv preprint, arXiv:2305.15062 .
- Caffe: Convolutional architecture for fast feature embedding, in: The 22nd ACM International Conference on Multimedia, pp. 675β678.
- Machine learning: Trends, perspectives, and prospects. Science 349, 255β260.
- A review on explainability in multimodal deep neural nets. IEEE Access 9, 59800β59821.
- In-datacenter performance analysis of a tensor processing unit, in: The 44th Annual International Symposium on Computer Architecture, pp. 1β12.
- Generalized optimal matching methods for causal inference. The Journal of Machine Learning Research 21, 2300β2353.
- Text summarization from legal documents: a survey. Artificial Intelligence Review 51, 371β402.
- ChatGPT for good? on opportunities and challenges of large language models for education. Learning and Individual Differences 103, 102274.
- Deep reinforcement learning for sequence-to-sequence models. IEEE Transactions on Neural Networks and Learning Systems 31, 2469β2489.
- BERT: Pre-training of deep bidirectional transformers for language understanding, in: NAACL-HLT, pp. 4171β4186.
- Next-generation of virtual personal assistants (microsoft cortana, apple siri, amazon alexa and google home), in: IEEE 8th Annual Computing and Communication Workshop and Conference, IEEE. pp. 99β103.
- A performance comparison of container-based technologies for the cloud. Future Generation Computer Systems 68, 175β182.
- Legal remedies for a forgiving society: Childrenβs rights, data protection rights and the value of forgiveness in AI-mediated risk profiling of children by dutch authorities. Computer Law & Security Review 38, 105430.
- A review and state of art of internet of things (IoT). Archives of Computational Methods in Engineering , 1β19.
- Deep learning. Nature 521, 436β444.
- PyTorch distributed: Experiences on accelerating data parallel training. The VLDB Endowment 13, 3005β3018.
- Towards understanding and mitigating social biases in language models, in: International Conference on Machine Learning, PMLR. pp. 6565β6576.
- When machine learning meets privacy: A survey and outlook. ACM Computing Surveys 54, 1β36.
- Deep learning for procedural content generation. Neural Computing and Applications 33, 19β37.
- Summary of ChatGPT-related research and perspective towards the future of large language models. Meta-Radiology , 100017.
- RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint, arXiv:1907.11692 .
- BaGuaLu: targeting brain scale pretrained models with over 37 million cores, in: The 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 192β204.
- Predicting risk in criminal procedure: actuarial tools, algorithms, AI and judicial decision-making. Current Issues in Criminal Justice 32, 22β39.
- Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys 56, 1β40.
- Natural language processing: an introduction. Journal of the American Medical Informatics Association 18, 544β551.
- A brief report on LawGPT 1.0: A virtual legal assistant based on GPT-3. arXiv preprint, arXiv:2302.05729 .
- Sentence-T5: Scalable sentence encoders from pre-trained text-to-text models, in: Findings of the Association for Computational Linguistics, pp. 1864β1874.
- A review on the attention mechanism of deep learning. Neurocomputing 452, 48β62.
- AI in judicial application of law and the right to a court. Procedia Computer Science 192, 2220β2228.
- Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems 35, 27730β27744.
- Recent progress on generative adversarial networks (gans): A survey. IEEE Access 7, 36322β36333.
- Sequence-to-sequence prediction of vehicle trajectory via LSTM encoder-decoder architecture, in: IEEE Intelligent Vehicles Symposium, IEEE. pp. 1672β1678.
- PyTorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems 32.
- Meaningful explanations of black box AI decision systems, in: The AAAI Conference on Artificial Intelligence, pp. 9780β9784.
- A comparison of sequence-to-sequence models for speech recognition., in: Interspeech, pp. 939β943.
- Improving access to justice in state courts with platform technology. Vanderbilt Law Review 70, 1993.
- Improving language understanding by generative pre-training .
- Language models are unsupervised multitask learners. OpenAI Blog 1, 9.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research 21, 5485β5551.
- Explainable AI: From black box to glass box. Journal of the Academy of Marketing Science 48, 137β141.
- Artificial intelligence & human rights: Opportunities & risks. Berkman Klein Center Research Publication .
- Developing artificially intelligent justice. Stanford Technology Law Review 22, 242.
- Speech to text conversion using android platform. International Journal of Engineering Research and Application 3, 253β258.
- An automated conversation system using natural language processing (NLP) chatbot in python. Central Asian Journal of Medical and Natural Science 3, 314β336.
- AI and law: A fruitful synergy. Artificial Intelligence 150, 1β15.
- βthatβs (not) the output i expected!β on the role of end user expectations in creating explanations of AI systems. Artificial Intelligence 298, 103507.
- Legal and human rights issues of AI: Gaps, challenges and vulnerabilities. Journal of Responsible Technology 4, 100005.
- Predictive crime mapping: Arbitrary grids or street networks? Journal of quantitative criminology 33, 569β594.
- Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 22500β22510.
- Artificial intelligence a modern approach. Pearson Education, Inc.
- The IBM 2015 english conversational telephone speech recognition system, in: Annual Conference of the International Speech Communication Association, pp. 3140β3144.
- Basic principles of term formation in the multilingual and multicultural context of EU law, in: Language and Culture in EU Law. Routledge, pp. 183β206.
- What language model to train if you have one million GPU hours? arXiv preprint, arXiv:2210.15424 .
- Towards a standard for identifying and managing bias in artificial intelligence. NIST Special Publication 1270.
- Intern: A new learning paradigm towards general vision. arXiv preprint arXiv:2111.08687 .
- Self-attention with relative position representations, in: Proceedings of NAACL-HLT, pp. 464β468.
- The smart court-a new pathway to justice in china?, in: International Journal for Court Administration, HeinOnline. p.Β 1.
- Compact graph architecture for speech emotion recognition, in: IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE. pp. 6284β6288.
- Megatron-LM: Training multi-billion parameter language models using model parallelism. arXiv preprint, arXiv:1909.08053 .
- A survey on image data augmentation for deep learning. Journal of big data 6, 1β48.
- Mastering the game of go without human knowledge. Nature 550, 354β359.
- Automated extraction of semantic legal metadata using natural language processing, in: IEEE 26th International Requirements Engineering Conference, IEEE. pp. 124β135.
- Using DeepSpeed and Megatron to train megatron-turing NLG 530b, a large-scale generative language model. arXiv preprint, arXiv:2201.11990 .
- Process for adapting language models to society (PALMS) with values-targeted datasets. Advances in Neural Information Processing Systems 34, 5861β5873.
- Judge v robot?: Artificial intelligence and judicial decision-making. University of New South Wales Law Journal, The 41, 1114β1133.
- Artificial intelligence and speedy trial in the judiciary: Myth, reality or need? a case study in the brazilian supreme court (STF). Government Information Quarterly 39, 101660.
- Can online courts promote access to justice? a case study of the internet courts in china. Computer Law & Security Review 39, 105461.
- Artificial intelligence and law: An overview. Georgia State University Law Review 35, 19β22.
- Sequence to sequence learning with neural networks. Advances in Neural Information Processing Systems 27.
- Data sharing by scientists: practices and perceptions. PloS One 6, e21101.
- LLaMA: Open and efficient foundation language models. arXiv preprint, arXiv:2302.13971 .
- Speech to text and text to speech recognition systems-areview. IOSR Journal of Computer Engineering 20, 36β43.
- UGC-VQA: Benchmarking blind video quality assessment for user generated content. IEEE Transactions on Image Processing 30, 4449β4464.
- Attention is all you need. Advances in Neural Information Processing Systems 30.
- Sequence to sequence-video to text, in: IEEE International Conference on Computer Vision, pp. 4534β4542.
- The bottom-up evolution of representations in the transformer: A study with machine translation and language modeling objectives, in: EMNLP-IJCNLP, pp. 4396β4406.
- Why fairness cannot be automated: Bridging the gap between EU non-discrimination law and AI. Computer Law & Security Review 41, 105567.
- Emergent abilities of large language models. arXiv preprint, arXiv:2206.07682 .
- Federated learning with differential privacy: Algorithms and performance analysis. IEEE Transactions on Information Forensics and Security 15, 3454β3469.
- Innovative research on legal talents training model in the era of artificial intelligence, in: 16th International Conference on Computer Science & Education, IEEE. pp. 257β262.
- Privacy asymmetries: Access to data in criminal defense investigations. UCLA Law Review 68, 212.
- AI-generated content (AIGC): A survey. arXiv preprint, arXiv:2304.06632 .
- Googleβs neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint, arXiv:1609.08144 .
- Lawformer: A pre-trained language model for chinese legal long documents. AI Open 2, 79β84.
- Human judges in the era of artificial intelligence: challenges and opportunities. Applied Artificial Intelligence 36, 2013652.
- LegalGNN: Legal information enhanced graph neural network for recommendation. ACM Transactions on Information Systems 40, 1β29.
- Transformers from an optimization perspective. Advances in Neural Information Processing Systems 35, 36958β36971.
- Whatβs inside the black box? AI challenges for lawyers and researchers. Legal Information Management 19, 2β13.
- Criminal justice, artificial intelligence systems, and human rights, in: ERA Forum, Springer. pp. 567β583.
- Large language models for robotics: A survey. arXiv preprint, arXiv:2311.07226 .
- Distributed training of large language models, in: The 29th IEEE International Conference on Parallel and Distributed Systems, IEEE. pp. 1β8.
- Panguβ--Ξ±πΌ\alphaitalic_Ξ±: Large-scale autoregressive pretrained chinese language models with auto-parallel computation. preprint arXiv:2104.12369 .
- Study on artificial intelligence: The state of the art and future prospects. Journal of Industrial Information Integration 23, 100224.
- Graph convolutional networks: a comprehensive review. Computational Social Networks 6, 1β23.
- Intelligent analysis and application of judicial big data sharing based on blockchain, in: 6th International Conference on Artificial Intelligence and Big Data, IEEE. pp. 592β596.
- Understanding bag-of-words model: a statistical framework. International Journal of Machine Learning and Cybernetics 1, 43β52.
- DIALOGPT: Large-scale generative pre-training for conversational response generation, in: The 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 270β278.
- A survey on multi-task learning. IEEE Transactions on Knowledge and Data Engineering 34, 5586β5609.
- A survey of large language models. arXiv preprint, arXiv:2303.18223 .
- Iteratively questioning and answering for interpretable legal judgment prediction, in: The AAAI Conference on Artificial Intelligence, pp. 1250β1257.
- How does NLP benefit legal system: A summary of legal artificial intelligence. arXiv preprint, arXiv:2004.12158 .
- JEC-QA: a legal-domain question answering dataset, in: The AAAI Conference on Artificial Intelligence, pp. 9701β9708.
- Strengthening legal protection against discrimination by algorithms and artificial intelligence. The International Journal of Human Rights 24, 1572β1593.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.