vision-cair / chatcaptioner Goto Github PK
View Code? Open in Web Editor NEWOfficial Repository of ChatCaptioner
License: MIT License
Official Repository of ChatCaptioner
License: MIT License
Thank you for your great work!
I am currently having trouble receiving any response from ChatGPT during the caption.ipynb inference process, although I am able to use ChatGPT through a web browser. Could you please let me know the expected time for each chat process during inference, and if using ChatGPT Plus would help to accelerate this process.
Thank you for your assistance!
Hi, I can run the demo url and be able to upload an image with the first answer from BLIP. However, then the process stuck on waiting for the chatgpt reply, any ideas about this issue?
或者说有没有如何使用个人上传视频完成字幕生成的详细操作指引呢?如果能像图像字幕那样开发gradio web页面最好不过了,再次感谢开发者的工作
Thank you for the great work!
I understand to run BLIP2, I need a 24G GPU, but now I only have 4 16G GPUs, is there a way I can run this repo on multiple GPUs?
Hi,Thank you very much for your contribution!When I run the code, blip2 doesn't answer the question. As a result, the rest of the dialogue is completely blank.
Hi thanks for the work. I would like to know why is the number of frames hard coded in video_reader.py and captioning files? How can it be made dynamic so it accepts any input like a video of 3 minutes? Does the code base supports it?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.