Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health (2304.10447v1)

Published 20 Apr 2023 in cs.CL

Abstract: Pretrained LLMs have been used in various natural language processing applications. In the mental health domain, domain-specific LLMs are pretrained and released, which facilitates the early detection of mental health conditions. Social posts, e.g., on Reddit, are usually long documents. However, there are no domain-specific pretrained models for long-sequence modeling in the mental health domain. This paper conducts domain-specific continued pretraining to capture the long context for mental health. Specifically, we train and release MentalXLNet and MentalLongformer based on XLNet and Longformer. We evaluate the mental health classification performance and the long-range ability of these two domain-specific pretrained models. Our models are released in HuggingFace.

Citations (26)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health (2304.10447v1)

Summary

Related Papers