Code Monkey home page Code Monkey logo

altibmttsdictionaries's Introduction

AltIBMTTSDictionaries

An alternative set of dictionaries for IBMTTS. Accepts community contributions, but generally aims to be a minimalist project.

IBMTTS dictionary

This project contains dictionary files specifically optimized for IBMTTS 6.7, the same version used on Linux, but also available for Windows, which are located in the IBMTTS 6.7 folder in this repository. The IBMTTS dictionaries fix a number of IBMTTS-specific pronunciation issues, and remove over 300 words that it pronounces correctly. To use the dictionaries on Linux, rename enumain.dic and enuroot.dic to main.dct and root.dct, respectively, and place them in /var/opt/IBM/ibmtts/dict, or /var/opt/IBM/ibmtts/dict/en, or /var/opt/IBM/ibmtts/dict/en_US.

Contributing

If you'd like to contribute words to this dictionary, please create an issue or pull request with the word you would like to be corrected, as well as your suggested pronunciation, and provide sources for it, whether that would be dictionaries, YouTube videos or similar. However, before you suggest a change, make sure that the word you want changed is actually pronounced incorrectly, and that the original pronunciation is not less common but otherwise acceptable, as your entry will get removed if this turns out to be the case. Additionally, entries that will not be accepted include, but are not limited to:

  • Domain names, file names, usernames, social media handles and similar words
  • Compound words that have very few real world examples, E.G. a compound that only appears on one blog post on one website
  • Dictionary entries that could be interpreted as forcing a personal preference on users, E.G. an entry in the main dictionary that expands an abbreviation that you like
  • Abbreviation dictionary entries. For various reasons, including the limitations of the abbreviation dictionary format, and the possibility of abbreviations having several meanings and/or conflicting with real words/acronyms, abbreviation dictionary entries are not accepted for any language
  • Very large dictionary contributions in one issue or pull request. This project values careful sourcing and quality over quantity, so lots of entries at once, e.g. in the hundreds or thousands, are unlikely to be accepted, unless evidence is provided that the entries match the standards that are expected here

Main vs root dictionary

When adding new entries, the roots dictionary should generally be used in most cases. However, the main dictionary should be used if adding:

  • Acronyms, such as UTI or FAQ
  • Entries containing hyphens
  • Entries that would break legitimate words if they were added to the root dictionary

Letter representation in main dictionaries

In entries that use SPRs, I don't really care how they're written, as long as the resulting pronunciation is valid. However, in the main dictionary, for consistency and to guard against sentence bugs, letters should be represented by this set of misspellings.

altibmttsdictionaries's People

Contributors

mohamed00 avatar thequinbox avatar kara-louise avatar masonasons avatar

Stargazers

 avatar RetroBunn avatar Lunkhopao Haokip avatar  avatar Hai Nguyen Ly avatar Yukie Satoshi Nobuharu avatar Pratik Patel avatar Stanislaw Przedzinkowski avatar  avatar  avatar  avatar  avatar  avatar  avatar Noah Carver avatar  avatar Talon avatar  avatar

Watchers

James Cloos avatar  avatar  avatar RetroBunn avatar  avatar

altibmttsdictionaries's Issues

Add name: Tami

I've yet to come across a synth that says this correctly, but the name "Tami", is supposed to be said "Tammy", with a Y.

Say the name Meagan correctly

I just came across the name Megan (spelled Meagan). With these dictionaries, it's pronounced incorrectly, the A is really showing.

Add word: Tauri

Tauri is a new web framework, with the goal of replacing Electron. But IBMTTS doesn't pronounce it correctly. It's supposed to be said as tau re, but as one word, and less stretched out.

How to handle words that have two proper sayings, neither of which are how Eloquence says it?

I'm presently trying to work on correcting the word carabiner. I found at least 3 sources that say it how I do, and put it in my working copy of the dictionary:
https://www.youtube.com/watch?v=Ql1VQP1zgg8
https://www.youtube.com/watch?v=6Ft6KPLPp4c
and https://www.youtube.com/watch?v=HWRPzCamSMs
However, I found another that says it slightly differently:
https://www.youtube.com/watch?v=_RxfgygfNYM
That's not how Eloquence says it, but it is another way. How should I proceed here? Go with the one that seems more common?

Improve name: Nikola

I found an Eloquence dictionary entry that says Nikola as it's supposed to be pronounced, at least that's how I've found it said in YouTube videos and from someone who has the name. Would be great if that could be updated to be: nikola `[.1nI.0kx.0lx]

Don't strech out details as much.

I'm not honestly sure I like how this dictionary handles the word "details." It makes it sound like there's two E's when there's not. Could this be fixed?

Name: Stirland

This should be pronounced like Stirlend and not stir land.

Name: Karlee

The name Karlee should be pronounced like Karly

Add name: Kaliah

This is supposed to be said like kuh, followed by the name leah.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.