-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathindex.html
144 lines (140 loc) · 7.76 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
<!DOCTYPE html>
<html>
<head>
<meta charset="UTF-8">
<link rel="preconnect" href="https://fonts.gstatic.com">
<link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@500&display=swap" rel="stylesheet">
<style>
body{
font-family: 'Quicksand', sans-serif;
}
div {
margin-bottom: 64px;
}
</style>
</head>
<body>
<h1 style="color: black; border-bottom: 1px solid black;">🐸💬 Coqui TTS - Double Decoder Consistency v2 Samples.</h1>
<div>
<p>
This page presents audio samples of a text-to-speech model using Double Decode Consitency (DDC) method that is able to train robust and high-quality TTS models, in a very short time with a robust attention alignment and faster than real-time inference performance.
</p>
<p>
<b>Model Details:</b> These samples are generated using a DDC model and a HiFiGAN vocoder. The DDC model was trained 90k steps using a single GPU for 2 days as explained in the blog post. The vocoder was trained using real spectrograms for 250K steps.
These models provide speech synthesis with ~0.12 real-time factor on a GPU and ~1.02 on a CPU.
</p>
<p>
<b>Note that</b> the DDC model is trained with raw characters and this causes some pronunciation errors in some examples due to the non-phonemic nature of the English language.
</p>
<p>
<b>Coqui TTS:</b> <a href="https://github.com/coqui-ai/TTS">https://github.com/coqui-ai/TTS</a><br/>
<b>Blog Post:</b> <a href='https://coqui.ai/blog/tts/solving-attention-problems-of-tts-models-with-double-decoder-consistency'>here.</a>
</p>
</div>
<div>Try it yourself;</div>
<pre>
<code>
$ pip install -U TTS
$ tts --text "This is my text." --out_path this/is/my/output.wav
</code>
</pre>
<div>
<h2>😀 Basic Samples</h2>
<p>🗨️ Bill got in the habit of asking himself “Is that thought true?” and if he wasn’t absolutely certain it was he just let it go.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/s1.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ The Commission also recommends.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/s2.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ As a result of these studies, the planning document submitted by the Secretary of the Treasury to the Bureau of the Budget on August thirty-one.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/s3.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ The FBI now transmits information on all defectors, a category which would, of course, have included Oswald.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/s4.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ The human voice is the most perfect instrument of all.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/s5.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ They seem unduly restrictive in continuing to require some manifestation of animus against a Government official.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/s6.wav"></audio></td>
</tr>
</tbody>
</table>
</div>
<div>
<h2>😄 Hard Utterances</h2>
<p>🗨️ someone i know recently combined maple syrup and buttered popcorn thinking it would taste like caramel popcorn it didn't and they don't recommend anyone else do it either the gentleman marches around the principal the divorce attacks near a missing doom the color misprints a circular worry across the controversy.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/hs1.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ if you like tuna and tomato sauce try combining the two it's really not as bad as it sounds the body may perhaps compensates for the loss of a true metaphysics the clock within this blog and the clock on my laptop are on hour different from each other.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/hs2.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ a purple pig and a green donkey flew a kite in the middle of the night and ended up sunburn the contained error poses as a logical target the divorce attacks near a missing doom the opera fines the daily examiner into a murderer.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/hs3.wav"></audio></td>
</tr>
</tbody>
</table>
</div>
<div>
<h2>😁 Long Utterances</h2>
<p>🗨️ Once more unto the breach, dear friends, once more, Or close the wall up with our English dead! In peace there's nothing so becomes a man As modest stillness and humility, But when the blast of war blows in our ears, Then imitate the action of the tiger: Stiffen the sinews, summon up the blood, Disguise fair nature with hard-favored rage; Then lend the eye a terrible aspect: Let it pry through the portage of the head Like the brass cannon; let the brow o'erwhelm it As fearfully as doth a gallèd rock O'erhang and jutty his confounded base, Swilled with the wild and wasteful ocean. Now set the teeth and stretch the nostril wide, Hold hard the breath and bend up every spirit To his full height! On, on, you noble English, Whose blood is fet from fathers of war-proof, Fathers that like so many Alexanders Have in these parts from morn till even fought And sheathed their swords for lack of argument. Dishonor not your mothers; now attest That those whom you called fathers did beget you! Be copy now to men of grosser blood And teach them how to war! And you, good yeomen, Whose limbs were made in England, show us here The mettle of your pasture. Let us swear That you are worth your breeding; which I doubt not, For there is none of you so mean and base That hath not noble lustre in your eyes. I see you stand like greyhounds in the slips, Straining upon the start. The game's afoot! Follow your spirit; and upon this charge Cry 'God for Harry! England and Saint George!.</p>
<table>
<tbody>
<tr><iframe width="560" height="315" src="https://www.youtube.com/embed/ADnBCz0Wd1U" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe></tr>
<tr>
<td><audio controls preload="none"><source src="wavs/ls1.wav"></audio></td>
</tr>
</tbody>
</table>
<p>🗨️ Everyone is entitled to all the rights and freedoms set forth in this Declaration, without distinction of any kind, such as race, colour, sex, language, religion, political or other opinion, national or social origin, property, birth or other status. Furthermore, no distinction shall be made on the basis of the political, jurisdictional or international status of the country or territory to which a person belongs, whether it be independent, trust, non-self-governing or under any other limitation of sovereignty.</p>
<table>
<tbody>
<tr>
<td><audio controls preload="none"><source src="wavs/ls2.wav"></audio></td>
</tr>
</tbody>
</table>
</div>
</body>
</html>