Skip to content

Commit add9df2

Browse files
authored
Merge pull request #155 from Kitt-AI/devel
v1.2.0
2 parents 9d0af67 + 814cdb7 commit add9df2

File tree

16 files changed

+45
-22
lines changed

16 files changed

+45
-22
lines changed

README.md

Lines changed: 29 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -10,11 +10,17 @@ by [KITT.AI](http://kitt.ai).
1010

1111
(The discussion group is new since September 2016 as we are getting many messages every day. Please send general questions there. For bugs, use Github issues.)
1212

13-
Version: 1.1.1 (3/24/2017)
13+
Version: 1.2.0 (3/25/2017)
1414

1515
## Alexa support
1616

17-
Snowboy now brings hands-free experience to the [Alexa AVS sample app](https://github.com/alexa/alexa-avs-sample-app) on Raspberry Pi! Here is how you can use other Snowboy models
17+
Snowboy now brings hands-free experience to the [Alexa AVS sample app](https://github.com/alexa/alexa-avs-sample-app) on Raspberry Pi! See more info below regarding the performance and how you can use other hotword models.
18+
19+
**Performance**
20+
21+
The performance of hotword detection usually depends on the actually environment, e.g., is it used with a quality microphone, is it used on the street, in a kitchen, or is there any background noise, etc. So we feel it is best for the users to evaluate it in their real environment. For the evaluation purpose, we have prepared an Android app which can be installed and run out of box. The app is here:
22+
23+
* **resources/alexa/SnowboyAlexaDemo.apk**
1824

1925
**Personal model**
2026
* Create your personal hotword model through our [website](https://snowboy.kitt.ai) or [hotword API](https://snowboy.kitt.ai/api/v1/train/)
@@ -125,7 +131,7 @@ Currently we have built wrappers for:
125131

126132
* Java/Android
127133
* Go (thanks to @brentnd)
128-
* Node (thanks to @evancohen)
134+
* Node (thanks to @evancohen and @nekuz0r)
129135
* Perl (thanks to @iboguslavsky)
130136
* Python
131137
* iOS/Swift3 (thanks to @grimlockrocks)
@@ -160,10 +166,15 @@ Here is the list of the models, and the parameters that you have to use for them
160166
* **resources/snowboy.umdl**: Universal model for the hotword "Snowboy". Set
161167
SetSensitivity to 0.5 for better performance.
162168
* **resources/alexa.umdl**: Universal model for the hotword "Alexa". Set
163-
SetSensitivity to 0.5, and preferably set ApplyFrontend to true. This model is
164-
depressed.
165-
* **resources/alexa_02092017.umdl**: Universal model for the hotword "Alexa". This
166-
is still work in progress. Set SetSensitivity to 0.15.
169+
SetSensitivity to 0.5, and preferably set ApplyFrontend (only works on Raspberry
170+
Pi) to true. This model is depressed.
171+
* **resources/alexa/alexa_02092017.umdl**: Universal model for the hotword
172+
"Alexa". This is still work in progress. Set SetSensitivity to 0.15.
173+
* **resources/alexa/alexa-avs-sample-app/alexa.umdl**: Universal model for the
174+
hotword "Alexa" optimized for [Alexa AVS sample app](https://github.com/alexa/alexa-avs-sample-app).
175+
Set SetSensitivity to 0.6, and set ApplyFrontend (only works on Raspberry Pi)
176+
to true. This is so far the best "Alexa" model we released publicly, when
177+
ApplyFrontend is set to true.
167178

168179
## Precompiled node module
169180

@@ -178,13 +189,13 @@ dependencies like `fs`, `wav` or `node-record-lpcm16` depending on which script
178189
you use.
179190

180191
## Precompiled Binaries with Python Demo
181-
* 64 bit Ubuntu [12.04](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/ubuntu1204-x86_64-1.1.1.tar.bz2)
182-
/ [14.04](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/ubuntu1404-x86_64-1.1.1.tar.bz2)
183-
* [MacOS X](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/osx-x86_64-1.1.1.tar.bz2)
192+
* 64 bit Ubuntu [12.04](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/ubuntu1204-x86_64-1.2.0.tar.bz2)
193+
/ [14.04](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/ubuntu1404-x86_64-1.2.0.tar.bz2)
194+
* [MacOS X](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/osx-x86_64-1.2.0.tar.bz2)
184195
* Raspberry Pi with Raspbian 8.0, all versions
185-
([1/2/3/Zero](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/rpi-arm-raspbian-8.0-1.1.1.tar.bz2))
186-
* Pine64 (Debian Jessie 8.5 (3.10.102)) ([download](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/pine64-debian-jessie-1.1.1.tar.bz2))
187-
* Intel Edison (Ubilinux based on Debian Wheezy 7.8) ([download](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/edison-ubilinux-1.1.1.tar.bz2))
196+
([1/2/3/Zero](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/rpi-arm-raspbian-8.0-1.2.0.tar.bz2))
197+
* Pine64 (Debian Jessie 8.5 (3.10.102)) ([download](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/pine64-debian-jessie-1.2.0.tar.bz2))
198+
* Intel Edison (Ubilinux based on Debian Wheezy 7.8) ([download](https://s3-us-west-2.amazonaws.com/snowboy/snowboy-releases/edison-ubilinux-1.2.0.tar.bz2))
188199

189200
If you want to compile a version against your own environment/language, read on.
190201

@@ -372,6 +383,11 @@ See [Full Documentation](http://docs.kitt.ai/snowboy).
372383

373384
## Change Log
374385

386+
**v1.2.0, 3/25/2017**
387+
388+
* Added better Alexa model for [Alexa AVS sample app](https://github.com/alexa/alexa-avs-sample-app)
389+
* New decoder that works well for short hotwords like Alexa
390+
375391
**v1.1.1, 3/24/2017**
376392

377393
* Added Android demo
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
../../../../../resources/alexa_02092017.umdl
1+
../../../../../resources/alexa/alexa_02092017.umdl

examples/iOS/Swift3/SnowboyTest.xcodeproj/project.pbxproj

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@
4343
045F549A1E58A7EA00F8F4A4 /* snowboy-detect.h */ = {isa = PBXFileReference; fileEncoding = 4; lastKnownFileType = sourcecode.c.h; name = "snowboy-detect.h"; path = "../../../../include/snowboy-detect.h"; sourceTree = "<group>"; };
4444
045F549C1E58A8FD00F8F4A4 /* libsnowboy-detect.a */ = {isa = PBXFileReference; lastKnownFileType = archive.ar; name = "libsnowboy-detect.a"; path = "../../../lib/ios/libsnowboy-detect.a"; sourceTree = "<group>"; };
4545
045F549F1E58AAEC00F8F4A4 /* common.res */ = {isa = PBXFileReference; lastKnownFileType = file; name = common.res; path = ../../../../resources/common.res; sourceTree = "<group>"; };
46-
045F54A11E58AAF900F8F4A4 /* alexa_02092017.umdl */ = {isa = PBXFileReference; lastKnownFileType = file; name = alexa_02092017.umdl; path = ../../../../resources/alexa_02092017.umdl; sourceTree = "<group>"; };
46+
045F54A11E58AAF900F8F4A4 /* alexa_02092017.umdl */ = {isa = PBXFileReference; lastKnownFileType = file; name = alexa_02092017.umdl; path = ../../../../resources/alexa/alexa_02092017.umdl; sourceTree = "<group>"; };
4747
04D8BB881E52C242001D71B9 /* SnowboyTest-Bridging-Header.h */ = {isa = PBXFileReference; fileEncoding = 4; lastKnownFileType = sourcecode.c.h; path = "SnowboyTest-Bridging-Header.h"; sourceTree = "<group>"; };
4848
04D8BB891E5382BB001D71B9 /* SnowboyWrapper.h */ = {isa = PBXFileReference; fileEncoding = 4; lastKnownFileType = sourcecode.c.h; path = SnowboyWrapper.h; sourceTree = "<group>"; };
4949
04D8BB8A1E539066001D71B9 /* SnowboyWrapper.mm */ = {isa = PBXFileReference; fileEncoding = 4; lastKnownFileType = sourcecode.cpp.objcpp; path = SnowboyWrapper.mm; sourceTree = "<group>"; };

include/snowboy-detect.h

Lines changed: 13 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -63,9 +63,11 @@ class SnowboyDetect {
6363
// 2: Hotword 2 triggered.
6464
// ...
6565
//
66-
// @param [in] data Small chunk of data to be detected. See
67-
// above for the supported data format.
68-
int RunDetection(const std::string& data);
66+
// @param [in] data Small chunk of data to be detected. See
67+
// above for the supported data format.
68+
// @param [in] is_end Set it to true if it is the end of a
69+
// utterance or file.
70+
int RunDetection(const std::string& data, bool is_end = false);
6971

7072
// Various versions of RunDetection() that take different format of audio. If
7173
// NumChannels() > 1, e.g., NumChannels() == 2, then the array is as follows:
@@ -77,9 +79,14 @@ class SnowboyDetect {
7779
// @param [in] data Small chunk of data to be detected. See
7880
// above for the supported data format.
7981
// @param [in] array_length Length of the data array.
80-
int RunDetection(const float* const data, const int array_length);
81-
int RunDetection(const int16_t* const data, const int array_length);
82-
int RunDetection(const int32_t* const data, const int array_length);
82+
// @param [in] is_end Set it to true if it is the end of a
83+
// utterance or file.
84+
int RunDetection(const float* const data,
85+
const int array_length, bool is_end = false);
86+
int RunDetection(const int16_t* const data,
87+
const int array_length, bool is_end = false);
88+
int RunDetection(const int32_t* const data,
89+
const int array_length, bool is_end = false);
8390

8491
// Sets the sensitivity string for the loaded hotwords. A <sensitivity_str> is
8592
// a list of floating numbers between 0 and 1, and separated by comma. For
-571 KB
Binary file not shown.

lib/edison/libsnowboy-detect.a

111 KB
Binary file not shown.
152 KB
Binary file not shown.

lib/ios/libsnowboy-detect.a

9.28 MB
Binary file not shown.

lib/osx/libsnowboy-detect.a

121 KB
Binary file not shown.

lib/pine64/libsnowboy-detect.a

100644100755
169 KB
Binary file not shown.

0 commit comments

Comments
 (0)