Paper Review: Large Language Models

This paper review was conducted as part of the seminar "Large Language Models."

Abstract

Language models are increasingly being used to solve complex reasoning tasks. However, they often struggle with these tasks because they can generate several different answers that are consistent with the prompt. This can make it difficult for the language model to select the correct answer. In this paper, I provide an analysis of a new decoding strategy called self-consistency. This strategy tackles a problem by sampling various reasoning paths and selecting the most consistent answer. The paper evaluates self-consistency decoding on a number of arithmetic and common sense reasoning benchmarks and shows that it significantly improves the performance of language models on these tasks. In addition, the limitations of the self-consistency method are discussed, and future research directions are suggested.

Original Paper

The original paper can be found here: Wang, X., Wei, J., Schuurmans, D., Le, Q., Chi, E., Narang, S., Chowdhery, A., & Zhou, D. (2023). Self-Consistency Improves Chain of Thought Reasoning in Language Models. arXiv preprint arXiv:2203.11171.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
acl2023.sty		acl2023.sty
acl_natbib.bst		acl_natbib.bst
anthology.bib		anthology.bib
custom.bib		custom.bib
paper_review_SelfConsistency.pdf		paper_review_SelfConsistency.pdf
presentation.pdf		presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper Review: Large Language Models

Abstract

Original Paper

About

Releases

Packages

Languages

alikhalajii/seminar_large_LM_2023

Folders and files

Latest commit

History

Repository files navigation

Paper Review: Large Language Models

Abstract

Original Paper

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages