Comments (5)
Okay, but is there any clarification of what actually happened? I am hearing a bunch of crazy rumors like it's on par with GPT-4 which I highly doubt, and so they removed it for that reason. Open source models in my opinion haven't even surpassed GPT 3.5 if you for example ask them for translations or more obscure facts, they only are better at reasoning and more simple common questions or maybe more focused models with coding etc. I just have a hard time beliving they turned what I would personally call for my use cases a sub GPT 3.5 model into a GPT 4 tier one, but I am downloading it now (mirrored). I highly doubt the model is suddenly good at these things but the cat is already out of the bag and already found a new owner if true
from wizardlm.
Yeah already used a English word for no reason 💔
Not even close to GPT4 at least for my uses, going to try other things I guess. I think Japanese might not be fair as it only has 3200 vocab and is likely not made for this but does illustrate my point well. GPT 3.5 doesn't have this issue btw
from wizardlm.
Update: https://twitter.com/WizardLM_AI/status/1780101465950105775
[WizardLM](https://twitter.com/WizardLM_AI)
[@WizardLM_AI](https://twitter.com/WizardLM_AI)
🫡 We are sorry for that.
It’s been a while since we’ve released a model months ago😅, so we’re unfamiliar with the new release process now: We accidentally missed an item required in the model release process - toxicity testing.
We are currently completing this test quickly and then will re-release our model as soon as possible. 🏇
❤️Do not worry, thanks for your kindly caring and understanding.
from wizardlm.
Holy crap this model is good, just don't ask it to translate anything I guess. I have never seen a open source LLM get this right, so I think it might actually replace GPT 3.5 for me ignoring speed and Japanese. I am seriously impressed every other model failed this miserably but this is as good as GPT 3.5 and I would say it formatted it better than GPT 3.5
Not scientific at all but open source LLMs always get this one wrong I have noticed so it's my go to, so maybe GPT 3.4? Or sub GPT-4 or on par if you don't care about translating things but I would say it's a stretch still. It did get some songs wrong at the end when I let it generate more, but it happened past a certain number and I noticed my context was set to 512 tokens so I think that was actually the issue not the model
from wizardlm.
I think this should be closed though since we know what the cause was now
from wizardlm.
Related Issues (20)
- Number of vocab in WizardCoder-1B
- which model to use for what's the root of 256256?
- Question about WizardCoder-1B-V1.0 HOT 1
- how to change the script to finetune based on codellama
- quantity with auto_gptq avg loss: nan
- Which version of LLama
- Error when inference with WizardCoder-33B-V1.1
- RuntimeError : indices should be either on cpu or on the same device as the indexed tensor (cpu) HOT 1
- UnboundLocalError: local variable 'sentencepiece_model_pb2' referenced before assignment
- Caching doesn't work
- Where's alpaca_data_cleaned.json?
- Typo in about text
- Does the EVOL process of instruction dataset has been released?
- help!Why can't my vllm work?
- How does it support multi-turn conversations?
- WizardLM/WizardCoder-33B-V1.1 cannot find in huggingface?
- Mirrors of deleted WizardLM models HOT 1
- Reasoning seems flawed
- Sorry but I have to say it: WOW?! HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wizardlm.