Computer Science > Computation and Language

arXiv:2406.07393 (cs)

[Submitted on 11 Jun 2024 (v1), last revised 24 Jun 2024 (this version, v2)]

Title:Limited Out-of-Context Knowledge Reasoning in Large Language Models

Authors:Peng Hu, Changjiang Gao, Ruiqi Gao, Jiajun Chen, Shujian Huang

Abstract:Large Language Models (LLMs) have demonstrated strong capabilities as knowledge bases and significant in-context reasoning capabilities. However, previous work challenges their out-of-context reasoning ability, i.e., the ability to infer information from their training data, instead of from the context or prompt. This paper focuses on a significant facet of out-of-context reasoning: Out-of-Context Knowledge Reasoning (OCKR), which is to combine multiple knowledge to infer new knowledge. We designed a synthetic dataset with seven representative OCKR tasks to systematically assess the OCKR capabilities of LLMs. Using this dataset, we evaluated the LLaMA2-13B-chat model and discovered that its proficiency in this aspect is limited, regardless of whether the knowledge is trained in a separate or adjacent training settings. Moreover, training the model to reason with complete reasoning data did not result in significant improvement. Training the model to perform explicit knowledge retrieval helps in only one of the tasks, indicating that the model's limited OCKR capabilities are due to difficulties in retrieving relevant knowledge. Furthermore, we treat cross-lingual knowledge transfer as a distinct form of OCKR, and evaluate this ability. Our results show that the evaluated model also exhibits limited ability in transferring knowledge across languages. The dataset used in this study is available at this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2406.07393 [cs.CL]
	(or arXiv:2406.07393v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.07393

Submission history

From: Peng Hu [view email]
[v1] Tue, 11 Jun 2024 15:58:59 UTC (137 KB)
[v2] Mon, 24 Jun 2024 14:59:54 UTC (73 KB)

Computer Science > Computation and Language

Title:Limited Out-of-Context Knowledge Reasoning in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Limited Out-of-Context Knowledge Reasoning in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators