GSM8K Dataset Papers With Code
Por um escritor misterioso
Last updated 28 março 2025

GSM8K is a dataset of 8.5K high quality linguistically diverse grade school math word problems created by human problem writers. The dataset is segmented into 7.5K training problems and 1K test problems. These problems take between 2 and 8 steps to solve, and solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the final answer. A bright middle school student should be able to solve every problem. It can be used for multi-step mathematical reasoning.

Phi-1.5: 41.4% HumanEval in 1.3B parameters (model download link in comments) : r/LocalLLaMA

niansong1996/lever-gsm8k-codex · Hugging Face

Minerva: Solving Quantitative Reasoning Problems with Language Models – Google Research Blog

WizardMath - Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct - AI Papers Academy

Sparse Fine-Tuning for Accelerating Large Language Models with DeepSparse - Neural Magic

📜 Top LLM Papers of the Week - by Yoon Baek

Aojun Zhou - CatalyzeX

The OIG Dataset

PDF] ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection

Top Important LLM Papers for the Week from 30/10 to 5/11

MMLU Dataset Papers With Code
Add GSM8K dataset · Issue #3201 · huggingface/datasets · GitHub

Sparse Fine-Tuning for Accelerating Large Language Models with DeepSparse - Neural Magic

Paper Review: LLaMA: Open and Efficient Foundation Language Models – Andrey Lukyanenko
Recomendado para você
-
Treino Mes 10, PDF, Treinamento de força28 março 2025
-
Mês 01 - 6x:semana - Academia, PDF28 março 2025
-
LISTA DE SUBSTITUIÇÕES - MEMBROS INFERIORES E SUPERIORES - Baixar28 março 2025
-
Advanced High Intensity Training Variables ebook by David Groscup - Rakuten Kobo28 março 2025
-
Mental Wellness Resources - Santa Barbara High School28 março 2025
-
PDF HORIZONTAL Every Day Chore Task Check List28 março 2025
-
TAY Acute Linkage Program - Felton Institute28 março 2025
-
Screening for Tay‐Sachs disease carriers by full‐exon sequencing28 março 2025
-
PDF) Low-Reynolds-number airfoil design optimization using deep28 março 2025
-
U-Net: deep learning for cell counting, detection, and morphometry28 março 2025
você pode gostar
-
Cartoon Network: Punch Time Explosion XL - Xbox 360 : Video Games28 março 2025
-
Emoji Phraseology - Back 2 School Edition28 março 2025
-
Exclusivo - Homenagem ao Aerosmith28 março 2025
-
How to put players usernames on text labes - Scripting Support28 março 2025
-
THE WARRIORS : TRUE FACTS OF THE TIMELESS CULT NEW YORK CLASSIC28 março 2025
-
Does Back 4 Blood have split-screen? - Gamepur28 março 2025
-
Garten Of Banban Chapter 4 New Coloring pages / Color All New Monsters / New Best Version! - in 202328 março 2025
-
Metrics That Matter. Measurement for artist marketing…28 março 2025
-
Bíblia X Celular28 março 2025
-
Roblox - ESCAPAMOS DA BIBLIOTECA ASSUSTADORA (Escape Miss Marie's Library)28 março 2025