- Build with Gradle v7.6.4
- Improve publishing
- Ported runtime Groovy classes to Java (specifically, in Luxembourgish language component)
- Migrate CI/CD from Travis to GitHub Actions
- Add compability with Java 11 and 17
- Lock down Java compatibility to 1.8
- Reduce non-API dependency leakage (particularly
groovy-all
) - Dependency resolution errors following JCenter shutdown
- Javadoc errors in legacy, third-party code
- Legacy Maven POMs
- Upgraded Commons IO to v2.17.0, fixing
- CVE-2021-29425
- CVE-2024-47554
- Upgraded Guava to v33.3.1-jre, fixing
- CVE-2020-8908
- CVE-2023-2976
- Upgraded HSQLDB to v2.7.3, fixing
- CVE-2022-41853
- Upgraded Log4J to v2.24.1, fixing
- CVE-2019-17571
- CVE-2021-4104
- CVE-2022-23302
- CVE-2022-23305
- CVE-2022-23307
- Upgraded SLF4J to v2.0.16, fixing
- CVE-2020-9488
- CVE-2021-44228
- CVE-2021-44832
- CVE-2021-45046
- CVE-2021-45105
v5.2.1 (2022-05-25)
This is a maintenance release to provide artifacts on Maven Central following the shutdown of JCenter, and also to mitigate the Log4Shell vulnerabilities.
- Artifacts are now available on Maven Central, following the shutdown of JCenter/Bintray
- Log4J dependencies have been updated to the latest versions, mitigating Log4Shell vulnerabilities
v5.2 (2016-09-15)
This is expected to be the last milestone release in the 5.x generation of MaryTTS.
- Support for Luxembourgish
- Completely rewritten text preprocessing module for English (removing reliance on FreeTTS)
- Better support for Java 8
- Some migration towards building with Gradle
- All unit-selection voices have been rebuilt using the new Gradle plugin
- All HSMM voices are now hosted on Bintray and can be resolved as dependencies
- Voices with open data now have open-source voicebuilding repositories hosted on GitHub, which include downloadable distribution packages
- #533: support system preferredmodule
- #525: Update dependencies and maven plugins
- #523: solving quotes phonetisation problem
- #506: Duplicate subexpressions in
WeightedCodebookMfccMapper.java
- #499: Upgrade Apache Commons Collections to v3.2.2
- #493: Drop assembly plugin
- #425: various fixes for C++ client source code
- #399: Component installer License download re-code following #395
- #393:
package.html
files converted intopackage-info.java
files - #362: add support for Groovy
- #353: Swap inlined third-party code with dependencies
- #351: update of maven plugins used for the website and build plugins
- #333: remove subscription/post links to archived mary-dev mailing list
- #330: show port number when starting MaryTTS server
- #320: move outdated example code from runtime assembly into doc directory
- #309: try to process tokens if they contain word characters, even when they are tagged as punctuation
- #228: fix on drop FreeTTS dependencies
- #227: Enhanced OutputStreams for Mary Client
- #217: incrementality changes
- #593: Don't split up multiple punctuation marks in tokenization
- #570: Praat TextGrid output is invalid with boundaries (times are not monotonic)
- #564: add missing TOKENS examples
- #555: HMMModel generates malformed XML duration attributes
- #531: java.awt.HeadlessException in (Half)PhoneLabelFeatureAligner
- #516: Single words conduct to have a wrong POS which leads to a crash of the target feature module
- #515: preprocessing contraction and double quotes correction
- #503:
halfphoneUnitFeatureDefinition_ac.txt
does not have any continuous features even thoughhalfphoneFeatures_ac.mry
does - #480: IBAN code → stacktrace
- #469: APML is broken
- #468: SABLE is broken
- #467: SIMPLEPHONEMES is broken
- #465: enable acoustic features by default
- #460: Tokens mistakenly POS-tagged as punctuation cause wrong boundary insertion
- #458: VoiceCompiler generates invalid package name from db.voicename property
- #452: Disable assertions in user startup scripts
- #448: unit selection: final boundary durations synthesized 50% shorter than requested
- #428: error in marytts cart DecisionNode
- #421: Force English locale for parsing date when English language is used
- #409: ensure that ICU4J's resource is read with the correct encoding, regardless of environment
- #398: Use https URLs whenever possible
- #395: component installer hangs if licenses cannot be downloaded
- #375: add Groovy script to generate component descriptor XML and fix POM template
- #369: handle exceptions on missing or malformed userdict entries
- #365: upgrade groovy-maven (formerly gmaven) plugin to solve noClassDefFoundError when running MaryTTS server
- #359: don't append an /6/ to the previous syllable if that syllable is not adjacent
- #354: move custom jtok resources into jtok-user
- #352: javadoc fails with Java 8
- #342: workaround for NullPointerException in syllables that violate sonority constraints
- #341: temporarily handle digit suffix stress notation from legacy LTS CARTs until these are rebuilt
- #322: drop transitional punctuation POS tag logic
- #314: not processing null results from phonemise methods
- #237: fix for incorrect linear interpolation in MathUtils.interpolateNonZeroValues
- #213: fix for rate adjustment
- #206: fix for LTSTrainerTest failure on unexpected file.encoding
- #204: fix for Locale null breaking MaryServer
- #202: URISyntaxException avoids WikipediaMarkupCleanerTest failing if workspace contains space
- #198: fix for closing fileOutputStream after audio save
- #185: fix for EnvironmentTest failure on Java 8
v5.1.2 (2015-01-23)
Another milestone release with several improvements and fixes.
- For French, numbers in the input text were silently ignored; this has been improved using a spellout formatter from ICU4J. This fix can (and will soon) also be applied to other languages that are missing a text Preprocess module.
- A new rule-based Syllabfier class has been implemented.
- All releases are now hosted on GitHub again; references to the Downloads on Bitbucket have been updated accordingly.
- Targets are cached so these do not get recreated, and can be re-used later.
- Words are now added to wordlist in batches for new language components.
- A toString() helper method was added to MaryData class to help with debugging.
- #268: getAllophone() no longer silently returns
null
on invalid input. - #267:
REALISED_DURATIONS
andREALISED_ACOUSTPARAMS
are no longer (mistakenly) available as input types. - #266: Syllabification no longer ignores any provided stress diacritics.
- #258: Byte overflow in TargetFeatureComputer -- issue with limit of array.
- #218: a voice would not be built correctly following the groupId change in v.5.1.1.
- #154: trailing whitespace in config files could prevent correct parsing.
v5.1.1 (2014-10-05)
A milestone release that stabilizes changes in build and hosting infrastructure and fixes several issues.
- Telugu and Turkish: two voices were added which had not been rebuilt since v4.0.0.
- added
TOKENS
example text for Italian, Turkish, British English, Telugu, and Russian. - Voice components are sorted by locale and name in
download/marytts-components.xml
.
- Added to the documentation section on the mary site is a page on MaryTTS's history.
- Also added a point on MP3 support to the FAQ.
- Minor tweaks to navigation menu, earlier version release notes, download links, etc.
- Continuous Integration is now provided by Travis CI.
- The web server running at http://mary.dfki.de has been replaced with a new machine, which hosts the latest stable website content and runs the latest stable build as an online demo.
- The latest release artifacts are hosted at Bintray and indexed in jCenter.
- The latest development (SNAPSHOT) artifacts are hosted at OJO.
- A website built from the latest development (SNAPSHOT) version is hosted at http://marytts.github.io/.
- All supported languages have been moved into a container module
marytts-languages
. - Several maven plugins were updated to the latest versions.
- Building MaryTTS now requires Maven 3.0.2 or later.
- added missing voice resources that had been present in v5.0 before hosting had switched servers, as well as fixing some missing license files.
- #207: Deploying to Bintray/jCenter required changing the groupId from
marytts
tode.dfki.mary
. - #206: LTSTrainerTest would fail if UTF-8 encoding was not specified
- #204: A locale set to
null
no longer breaks the Mary server - #202: WikipediaMarkupCleanerTest failed if workspace contains space
- #185: EnviromentTest no longer fails with Java 8
v5.1 (2014-07-16)
Following significant restructuring introduced in v5.0, this is a milestone release to stabilize numerous new features and fixes over the past two years.
Apart from numerous fixed issues, there are several notable improvements in this version.
Thanks to Sathish Pammi's work at ENST and UPMC in Paris, there is support for French TTS. This supersedes an earlier, experimental implementation, which relied on external, native resources with an incompatible license.
Moreover, thanks to the great work and generosity of the speakers, there are a number of French voices built from three open-source corpora:
- https://github.com/marytts/enst-camille-data
- https://github.com/marytts/upmc-jessica-data
- https://github.com/marytts/upmc-pierre-data
Thanks to Fabio Tesser and Giulio Paci at ISTC in Padova, numerous improvements were made to Italian TTS.
This also includes the resurrection of the HTK Labeler used for voicebuilding as an alternative to the EHMM Labeler, which may produce better results under certain conditions.
Although much of the documentation has yet to be updated, the MaryTTS website itself is now built directly from the MaryTTS source code (using the Maven Site Plugin). This unifies the fossilized legacy content with new documentation, including the GitHub-hosted wiki, and makes it easy to manage.
Finally, thanks to cloud hosting at Bintray, it is now possible to integrate MaryTTS directly into other applications without the requirement to install it first locally, simply by adding the following repository block to your pom.xml
:
<repository>
<id>bintray</id>
<url>https://dl.bintray.com/marytts/marytts</url>
</repository>
There has been some fluctuation in the core development team since the release of v5.0; departures and other responsibilities have resulted in less activity than we would have liked.
Fortunately, we do finally have several new projects for MaryTTS, and several exciting features waiting the wings, ready to be rebased on a stable version (this one), including major overhauls of the voicebuilding toolchain and the new language support, and a new web-based installer. Expect these to land in the near future.
v5.0 (2012-06-14)
MARY TTS 5.0 is the first release from the thoroughly restructured code base.
MARY TTS 5.0 is better supported by automated tests than any previous version of MARY TTS, but it may well be that in practical use some hiccups will appear that have not surfaced in testing. Therefore cautious users are advised to treat this as a beta release.
Installing MARY TTS is now performed by simply unpacking the zip
archive at the target location.
No clicking through installer pages anymore.
In particular, it is now trivial to install MARY TTS on a server without a GUI connection.
The component installer, bin/marytts-component-installer.sh
, still uses a GUI;
see issue #43 for a workaround.
It is now possible to use MARY TTS, with HMM-based voices at least, simply by placing the right files into the classpath.
For example, to use US English voice cmu-slt-hsmm
in your own code, add the following jar files to your classpath:
marytts-server-5.0-jar-with-dependencies.jar
marytts-lang-en-5.0.jar
voice-cmu-slt-hsmm-5.0.jar
Instead of marytts-server-5.0-jar-with-dependencies.jar
you can also include the individual dependencies, which can be automated using maven;
the source code on github includes examples for doing this in the user-examples
folder.
Using MARY TTS programmatically gets a lot simpler with MARY TTS 5.0 through the new MaryInterface API. The same API can be used to access the TTS components running within the same java process or as a separate client-server setup. For details, see MaryInterface.
MARY TTS 5.0 includes an implementation of W3C's Emotion Markup Language as a means of requesting expressive synthetic speech.
The result of course depends on the expressive capabilities of the selected synthesis voice;
try out the EMOTIONML example with the German dfki-pavoque-styles
voice on the demo page.
The MARY TTS source code has been modularized for version 5.0.
The modular structure also makes it easier to see which components belong to a given language:
sub-projects such as marytts-lang-de
, marytts-lang-en
etc. contain all code and data that is language-specific.
The updated New Language Support documentation describes how to create a new language sub-project and integrate it into the system.
The maintenance of the list of installable voices has been decentralized: if you build a voice and wish to make it available to others, you can host it somewhere on the net, e.g. on Google Drive, Dropbox or similar. For details, see Publishing a MARY TTS Voice.
MARY TTS 5.0 mostly results from the aim to establish sound software engineering practices in MARY TTS development. Aspects of this include:
- the build environment was changed to maven, allowing us to write proper unit and integration tests for the code, and run them at every build;
- the code base was moved to github, allowing us to work as a distributed team, to develop features in whichever constellations arise, and to merge them into a stable master repository when ready;
- a Continuous Integration service, kindly provided by BuildHive, automatically checks whether the code, and even the latest GitHub pull requests, build OK including all tests passing.
v4.3.1 (2011-11-30)
This is a minor release, whose main purpose it is to enable Italian TTS (see below). It also includes small but useful improvements enabling the use of MARY TTS with screen readers on Linux (see below).
Thanks to the great work by Fabio Tesser and co-workers at ISTC Padova, Italy, we can now make available support for Italian in MARY TTS. To install the Italian voice, first download and install MARY TTS from the link above, then run the MARY Component installer, click "Update" to get the list of available languages and voices, and select the Italian Language and the istc-lucia-hsmm voice.
Bill Cox has kindly provided a patch resulting in substantial reduction of time-to-audio for HMM-based voices. The key idea is to use the vocoder in streaming mode rather than produce all audio before sending it onwards. Since only the socket server, but not the http server, supports streaming audio, you need to use the socket server or a custom integration mechanism to benefit from this improvement.
Critically, however, this has improved the responsiveness of MARY TTS sufficiently to allow using MARY TTS with screen readers on Linux!
Bill also provided two male US English voices built from Arctic data sets, cmu-rms-hsmm and cmu-bdl-hsmm. According to him, specifically the rms voice is quite intelligible at high speed (with the speedup done in a post-processing step currently).
v4.3.0 (2010-12-20)
This is a feature release, adding new features while maintaining compatibility with existing 4.x voices.
This release marks the final results of work on MARY TTS in the PAVOQUE project, in which we experimented with different technologies for adding expressivity to unit selection synthesis. The release makes available those project results that may be of interest to a wider audience.
- selecting style from a multi-style database using a symbolic style feature;
- imposing target prosody using FD-PSOLA signal modification.
Style can be selected using RAWMARYXML's <prosody style="...">
markup (see new expressive voice, below).
Prosody modification is available for all unit selection voices, including older ones; to activate it, click the checkbox "Apply prosody modification" in the web interface. This feature should be considered experimental, and the quality depends on many factors, including the accuracy of the pitchmarks used for building the unit selection voice. While this feature is likely to lead to reduced quality, it enables research on expressive prosody with unit selection voices.
For more information on the MaryXML <prosody>
markup which can now be applied to all types of MARY voices, see ProsodySpecificationSupport.
- we release the multi-style expressive German voice 'dfki-pavoque-styles' (660 MB) built from the full PAVOQUE corpus;
see Steiner et al. (2010) for a description of this corpus.
The different styles can be requested using RAWMARYXML
<prosody style="A_STYLE">...</prosody>
, whereA_STYLE
is one ofhappy
,angry
,sad
, andpoker
.
- Nickolay Shmyrev has kindly made available language support for Russian, as well as the Russian male unit selection voice voxforge-ru-nsh. Thanks Nickolay!
- This release also includes a number of bugfixes, see https://mary.opendfki.de/trac/query?milestone=4.3&group=status&order=priority
v4.2.0 (2010-12-08)
This is a feature release, adding new features while maintaining compatibility with existing 4.x voices.
This release marks the final results of work on MARY TTS in the SEMAINE project, where our main focus for TTS has been on building expressive British English voices with listener vocalization capabilities.
- Improved support for expressive vocalizations in the British English unit selection voices dfki-poppy, dfki-prudence, dfki-spike and dfki-obadiah. Signal modification is now used to combine intonation contours and segmental forms, as described in Pammi et al. (2010).
- Added vocalization support to the HMM-based versions of these voices: dfki-poppy-hsmm, dfki-prudence-hsmm, dfki-spike-hsmm and dfki-obadiah-hsmm.
For details on the new MaryXML <vocalization>
tag and usage examples, see VocalizationSynthesis.
- For German, we created bits1-hsmm, a female HMM-based voice built from recordings provided by BITS.
- This release also includes a number of bugfixes, see https://mary.opendfki.de/trac/query?milestone=4.2&group=status&order=priority
v4.1.1 (2010-09-09)
This is a bugfix release.
- Fixed a critical bug for Windows users who got an "OutOfMemoryError" when trying to start the server with unit selection voices
- Other small bug fixes -- see https://mary.opendfki.de/trac/query?milestone=4.1.1&group=status&order=priority
- Added Telugu HMM-based voice "cmu-nk-hsmm".
For more background information, please also refer to the Release notes of MARY 4.1.0.
v4.1.0 (2010-09-01)
This is a feature release, adding new features while maintaining compatibility with existing 4.0 voices.
- Prosody control for HMM-based voices. Using the SSML
<prosody>
tag, it is now possible to control the intonation generated for HMM-based voices from markup. Of particular interest is the "contour" attribute, which allows you to change the shape of the intonation curve. For details and examples, see ProsodySpecificationSupport. - Expressive vocalizations in certain unit selection voices.
MaryXML now supports a new
<vocalization>
tag, with which you can request the generation of non-verbal or para-verbal expressions as they are often produced by the listener in a conversation, such as "yeah", "m-hm", laughter, sigh, etc. For details and examples, see VocalizationSynthesis.
- For British English, we release HMM-based versions of the four voices Poppy, Spike, Obadiah and Prudence.
- Updated versions of many of the previously existing voices, with improved quality.
- Faster startup times. MARY 4.1.0 starts nearly twice as fast and needs about 30% less physical memory compared to MARY 4.0.0, due to the use of memory mapping for unit selection voices.
- Quality of German and English components improved. A number of bugs were fixed which had degraded the quality of the synthesis results in MARY 4.0.0:
- Multiple bugs were also fixed under the surface. For a full list, see https://mary.opendfki.de/trac/query?milestone=4.x&group=status&order=priority
For more background information, please also refer to the Release notes of MARY 4.0.0.
v4.0.0 (2009-12-18)
This is the first stable release of the new 4.0 code. MARY 4.0 is a major cleanup over previous versions of MARY TTS.
- 100% Pure Java. All native libraries have been removed from the system. MARY should now run on any platform that has Java 1.5 or newer.
- Fully open source. All code is now open source under the LGPL, including German TTS. Voices are distributed under Creative Commons or BSD licenses.
- Many languages and voices. In addition to several high-quality German and US English voices, this release adds four expressive British English voices built for the SEMAINE project, a Turkish and a Telugu voice. Also, we have made it easy to add more languages and voices in the future.
- New component installer. New languages and voices can be installed using a new component installer tool. If new languages and/or voices are made available for download, they can be installed without having to re-install the full system.
MARY now comes with a toolkit for people who want to add support for new languages or build their own voices. There is detailed documentation for
- creating initial support for a new language;
- recording and building a unit selection voice; and
- creating an HMM-based voice from the same recordings.
- New British English expressive voices. We are releasing the voices of four expressive characters built for the SEMAINE project.
- Improved Installer. You can now install downloaded components without requiring an internet connection at install time. The Installer handles updates of language and voice components correctly.
- Improved German pronounciation. Several thousand words were added to the pronounciation lexicon, bringing the total number of German transcriptions to over 26,000. Some bugs were fixed in the transcription of unknown words, making the synthesis of German more reliable.
- Added MBROLA voices. There are still many people who use MARY with MBROLA diphone voices because of the control over prosody that they provide. Therefore we have added MBROLA voices to this release. They can be installed like the other voices through the MARY component installer.
- Voice creation tools were simplified. The handling of external programs needed, in particular, for the creation of HMM-based voices was simplified. We provide a script now that you can use to find or, if necessary, download and compile third-party software required for training HMM-based voices.
More details on individual issues addressed can be seen in the list of tickets associated with this release and in the list of tickets associated with the 4.0 beta release.
The MARY client and server code was tested on:
- Mac OS X (Intel) 10.5.8 with java 1.6.0_07
- Ubuntu Linux 8.10 and 9.04 with sun java 1.5, 1.6, and openjdk-6
- Windows XP and Vista
The web interface at http://localhost:59125 should work with any recent browser that supports AJAX. We have obtained best results with:
- Firefox 3.5 with built-in audio support
- Firefox 3.0, Internet Explorer 6 and 7, Safari 4, with Quicktime plugin
- On Ubuntu Linux with sun java 1.5 and 1.6, we observed a problem with audio playback in the MARY client: the final section is cut off. With openjdk-6, it works ok.
- On Ubuntu Linux 8.10 with the builtin Firefox 3.0, the audio plugin doesn't work properly.
- Web interface in the browser Chrome suboptimal:
Chrome does not seem to handle the HTML 5
<audio>
tag correctly. It claims it can handle it, but then doesn't play audio. Click on "Save audio file" to get the raw audio data.
Thanks to user feedback, we have fixed a number of bugs found in the beta release. We are therefore confident that the system is ready for production use. Nevertheless it is likely that new bugs will appear in new circumstances.
If you think you have identified a bug, proceed as follows:
- Check in the list of known bugs whether the bug has been reported already;
- If you cannot find it, prepare your request to the mailing list.
Try to find out exactly:
- what to do to reproduce the error;
- expected behaviour;
- Then send an informative email as described in the FAQ.
People interested in adding support for a language can get in touch, e.g. via the MARY developers mailing list.
See all changes since v4.0-beta
v4.0-beta (2009-09-24)
This is a first beta release of the new 4.0 code. MARY 4.0 is a major cleanup over previous versions of MARY TTS.
- 100% Pure Java. All native libraries have been removed from the system. MARY should now run on any platform that has Java 1.5 or newer.
- Fully open source. All code is now open source under the LGPL, including German TTS. Voices are distributed under Creative Commons or BSD licenses.
- New languages. We have added Turkish and Telugu synthesis for now, and have made it easy to add more languages in the future.
MARY now comes with a toolkit for people who want to add support for new languages or build their own voices. There is detailed documentation for
- creating initial support for a new language;
- recording and building a unit selection voice; and
- creating an HMM-based voice from the same recordings.
More details on individual issues addressed can be seen in the list of tickets associated with this release.
The MARY client and server code was tested on:
- Mac OS X (Intel) 10.5.8 with java 1.6.0_07
- Ubuntu Linux 8.10 and 9.04 with sun java 1.5, 1.6, and openjdk-6
- Windows XP and Vista
The web interface at http://localhost:59125 should work with any recent browser that supports AJAX. We have obtained best results with:
- Firefox 3.5 with built-in audio support
- Firefox 3.0, Internet Explorer 6 and 7, Safari 4, with Quicktime plugin
- On Ubuntu Linux with sun java 1.5 and 1.6, we observed a problem with audio playback in the MARY client: the final section is cut off. With openjdk-6, it works ok.
- On Ubuntu Linux 8.10 with the builtin Firefox 3.0, the audio plugin doesn't work properly.
- Web interface in the browser Chrome suboptimal:
Chrome does not seem to handle the HTML 5
<audio>
tag correctly. It claims it can handle it, but then doesn't play audio. Click on "Save audio file" to get the raw audio data.
This is beta software. It is not yet ready for production use. You are likely to find problems, and we would appreciate if you tell us about it.
If you think you have identified a bug, proceed as follows:
- Check in the list of known bugs whether the bug has been reported already;
- If you cannot find it, create a new ticket containing the following information:
- operating system and java version used;
- what to do to reproduce the error;
- expected behaviour;
- detailed log files, i.e. an excerpt of MARY TTS/log/server.log containing the error episode.
- Discuss the problem on the MARY user mailing list.
People interested in adding support for a language can get in touch, e.g. via the MARY developers mailing list.
v3.6.0 (2008-05-21)
This is a minor feature release, providing one important new feature and several bugfixes over previous MARY 3.5.0.
- New tool for creating your own HMM-based voices for MARY (see tutorial at http://mary.opendfki.de/wiki/HMMVoiceCreation) Feedback and questions about building voices in MARY are welcome via the mary-users mailing list (http://www.dfki.de/mailman/listinfo/mary-users).
- Important for many will be that the "self-healing" capabilities of the MARY installation are now working again: if you install an English system but a German voice, the system detects a misconfiguration and offers to download and install the missing components, and will start up normally after that.
- the full list of bugs fixed can be found on http://mary.opendfki.de/query?milestone=3.6
v3.5.0 (2007-12-07)
This release provides substantial new features while maintaining the API compatible to previous versions.
- installer slimmed down to just over 30 MB, including some voices;
- separate voice installer tool, allowing you to download voices comfortably and with an integrity verification prior to installation;
- new synthesis technology: Marcela Charfuelan ported the excellent HMM-based synthesis code from the HTS project (http://hts.sp.nitech.ac.jp/) to Java - several HMM-based voices already ship with MARY;
- audio effects: Oytun Turk implemented a range of audio effects that can be applied to the MARY voices, and can be controlled through the MARYGUIClient;
- voice creation toolkit: Sathish Chandra Pammi and Anna Hunecke have cleaned up and documented the tool we use for building new synthesis voices;
- voice recording tool "Redstart": Mat Wilson has programmed a very nice GUI for recording new voices;
- OGG Vorbis support: under Linux, the MARY server can now generate audio in OGG Vorbis format.
v3.1.0 (2007-08-17)
Ten months after the last stable release, a major milestone release is finally here: MARY 3.1.0.
- state of the art unit selection (English and German);
- support for two more platforms: 64 bit Linux and Mac OS X on Intel platforms;
- a voice creation toolkit (work in progress, see http://mary.opendfki.de/browser/tags/3.1.0/lib/modules/import/README for preliminary documentation if you want to try it out).
Thanks to those who have helped test the beta versions! All the problems that we have become aware of should be fixed in this release. For a reasonably complete list of issues addressed in this release, see http://mary.opendfki.de/milestone/3.1.0
Should you come across additional bugs, please post them to the mary-users mailing list (http://www.dfki.de/mailman/listinfo/mary-users).
The unit selection code released here has performed better-than-average in this year's Blizzard Challenge (http://www.festvox.org/blizzard/), showing that the system can be considered state-of-the-art. For details, see: http://festvox.org/blizzard/bc2007/blizzard_2007/full_papers/blz3_007.pdf
The German voices have been created from the BITS corpora - for details, see: http://www.dfki.de/dfkibib/publications/docs/schroeder_hunecke2007.pdf
See also the full list of bugs fixed
See all changes since v3.1.beta2
v3.1-beta2 (2007-07-15)
- Four German unit selection voices created from recordings in the BITS project;
- added acoustic models, which should also improve the English unit selection voices;
- added support for 64-bit linux and Intel Mac architectures.
- a well-documented and easy-to-use voice creation toolkit;
- smaller voices by using suitable speech coding for the databases.
- For more details, see the development page: https://mary.opendfki.de/trac/query?group=status&milestone=3.1.0
- diphone voices are basically broken. Don't install this version if you want to use diphone voices.
- only very limited testing on various platforms. Please report any errors you may find!
See all changes since v3.1-beta1
v3.1-beta1 (2006-12-13)
First beta release of the new unit selection code. Try the new slt-arctic, bdl-arctic and jmk-arctic voices!
For some details of what has been done and what still needs to be done for the stable release, see the full list of issues on the development portal.
v3.0.3 (2006-10-27)
Third bugfix release. Most relevant changes:
- fixed standalone mode for running MARY (ticket 80)
- improved control over prosody using ToBI tags (tickets 59, 60)
- Several minor improvements of German synthesis (tickets 44, 49, 57, 78)
- MBROLA to AUDIO conversion fixed (tickets 54, 55)
An update is recommended only if you encountered one of these bugs.
See also the full list of bugs fixed
v3.0.2 (2006-07-04)
Second bugfix release. A number of bugs have been fixed, including:
- WAV audio can now (really) be saved with proper audio headers (ticket 38)
- several bugs related to failing validation of intermediate processing results (due to an outdated MaryXML Schema) were fixed (tickets 40, 41, 43)
- Synthesizing APML was improved (tickets 51, 52)
- Pronunciation for German compounds was improved (ticket 36)
An update is recommended if you encountered one of these bugs.
See also the full list of bugs fixed
v3.0.1 (2006-03-07)
First bugfix release. A number of bugs have been fixed which have occurred after the initial release. Most relevant:
- Audio can now be saved with proper audio headers (ticket 32)
- Mary GUI client is now accessible via the keyboard (ticket 3)
An update is recommended.
See also the full list of bugs fixed
v3.0.0 (2006-02-14)
This is the first open-source release. We have put in a lot of effort to test and debug it, but of course the system may have some teething problems. Please help us improve the system by submitting bug reports on the OpenMary development page.