quester-one Goto Github PK
Name: Men Tianyi
Type: User
Company: XDU CASIA
Location: Beijing, China
Name: Men Tianyi
Type: User
Company: XDU CASIA
Location: Beijing, China
Code for EMNLP 2023 paper "Emergence of Abstract State Representations in Embodied Sequence Modeling"
[Arxiv 2024] Adversarial Attacks on Multimodal Agents
[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Robust recipes for to align language models with human and AI preferences
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
BabyAI platform. A testbed for training agents to understand and execute language commands.
a state-of-the-art-level open visual language model | 多模态预训练模型
为ChatGPT/GLM提供图形交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm2等本地模型。兼容复旦MOSS, llama, rwkv, newbing, claude, claude2等
A toolkit for developing and comparing reinforcement learning algorithms.
gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.
Minimalistic gridworld package for OpenAI Gym
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Unify Efficient Fine-Tuning of 100+ LLMs
A library for advanced large language model reasoning
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"
Emergent world representations: Exploring a sequence model trained on a synthetic task
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
Trajectory-as-Exemplar Prompting with Memory for Computer Control
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools.
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.