Comments (2)
(For future reference, you can run locally using --runner DirectRunner
)
We don't have any current plans to do better turn segmentation, but I'm sure that direction would greatly improve the quality of that dataset. We would definitely welcome contributions in that direction!
from conversational-datasets.
I figured it out how your script works on local machines.
For OpenSubtitles, you assume that the context of a message is the preceding N
sentences. Do you have any plan to use some segmentation tools to segment the turns of each conversation? E.g., Automatic Turn Segmentation for Movie & TV Subtitles.
Thanks,
Peixiang
from conversational-datasets.
Related Issues (20)
- Would you release the data on google drive? HOT 3
- Chinese Data HOT 1
- AmazonQA Data Size HOT 2
- Why not include posts in the Reddit dataset? HOT 1
- apache-beam==2.5.0 requirements error HOT 1
- Support for python3 HOT 1
- Missing required option: region – in Google Cloud HOT 6
- No module named deprecation_wrapper HOT 2
- Is it possible to get access to the raw data from other storage/computational platform and read/process data there.
- Quota exceeded: Your project exceeded quota for free query bytes scanned. For more information, see https://cloud.google.com/bigquery/troubleshooting-errors HOT 1
- Access not available: "http://models.poly-ai.com/convert/v1/model.tar.gz"
- Large datasets
- how to run ? HOT 2
- "No module named module_wrapper" HOT 1
- Get more workers with Google Cloud's free trial HOT 2
- Local Download
- The app is blocked HOT 1
- 'Comment' is not defined
- Line 114
- AttributeError: 'NoneType' object has no attribute 'Client'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from conversational-datasets.