Light

xukp20 / math Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 1.0 56.23 MB

my tries related to MATH LLM, out of use at 2023/8/20

Python 51.01% Jupyter Notebook 48.68% Shell 0.31%

math's Introduction

MATH

Generate captions for asy command

2023/7/8: use gpt-3.5-turbo to generate captions for the test set.

The instruction may could be better.

MATH	problems with asy	total	percentage
train	707	7500	0.094
test	419	5000	0.0838

Category	algebra	counting_and_probability	geometry	intermediate_algebra	number_theory	prealgebra	precalculus
train	55	58	324	40	2	177	51
test	23	47	188	29	1	100	31

2023/7/11

Results with captions of gpt models on 42 samples of prm800k

model	raw LLaMa 65B	w gpt3.5	w gpt4	w text	w gpt 3.5 strip	w gpt4 strip	with text strip
correct (same + recheck)	1 + 1	2 + 4	0 + 4	1 + 4	1 + 1	1 + 1	1 + 2

correct + recheck	raw	3.5	4	text	3.5 strip	4 strip	text strip
	prealgebra_930 prealgebra_1512	geometry_248 prealgebra_378 prealgebra_930 geometry_226 geometry_795 counting_and_probability_731	counting_and_probability_282 geometry_226 counting_and_probability_731 prealgebra_914	geometry_248 algebra_1349 geometry_226 counting_and_probability_731 prealgebra_1114	prealgebra_930 geometry_226	geometry_248 geometry_226	geometry_283 geometry_183 counting_and_probability_731

419 MATH test with asy

MATH	raw LLaMa	w 3.5
correct + recheck	15+9	11 + 11
fail	27	77

LLaMa:
- algebra_489 counting_and_probability_250 counting_and_probability_281 counting_and_probability_288 counting_and_probability_328 geometry_145 geometry_15 geometry_242 geometry_267 geometry_375 geometry_47 prealgebra_1040 prealgebra_1375 prealgebra_1507 prealgebra_930
GPT3.5-turbo

math's People

Contributors

Stargazers

Watchers

Forkers

johndpope

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.