Showing posts with label piauisuite. Show all posts
Showing posts with label piauisuite. Show all posts

Monday, October 24, 2016

PiAUISuite Update and Voicecommand v3 1

Voicecommand allows you to control your raspberry pi using only your voice. More information and videos on this can be found on my YouTube channel or the original blog posts.

Ive made a couple of key changes and their is a new option for people who want to help with the sox implementation to make the speech recognition more continuous rather than chunk-based.


  • The bug in the voicecommand -s hardware has been fixed.
  • Allows multilingual support with !lang and !language
  • Fixed casing bug when matching multiple variables
  • Install, Uninstall, and Update scripts are now seperated by project. So now if you want to only update youtube, just run UpdateAUISuite.sh youtube
  • tts and tts-nofill have been combined.
  • Moving away from yt.js to browse youtube in the browser. Now adding node.js youtube browsing API. See https://github.com/StevenHickson/RaspberryPiTV
  • Building https://github.com/StevenHickson/RaspberryPiTV to work with voicecommand and adding omxcontrols using https://github.com/StevenHickson/omxplayer_fifo
  • With the above, this allows a control panel that can control videos, play pandora, browse youtube, control music, and run voicecommand. Note that this is in beta and will require a lot of manual installation as their is no installation or readme yet (Hopefully soon to come).
  • Added youtube-dl cron update so that youtube-dl updates automatically every night. Often if someone says the youtube script doesnt work, it is because youtube-dl is out of date and YouTube has updated their security algorithms. Running sudo youtube-dl -U often fixes this problem.
  • Added an option in speech-recog.sh to use sox instead of arecord. Simply uncomment out the sox portion and comment the arecord portion in /usr/bin/speech-recog.sh as below:


sox -r 16000 -t alsa $hardware /dev/shm/out.flac silence 1 0.3 1% 1 0.5 1%


wget -q -U "rate=16000" -O - --post-file /dev/shm/out.flac --header="Content-Type: audio/x-flac; rate=16000" "http://www.google.com/speech-api/v1/recognize?lang=en&client=Mozilla/5.0" | sed -e s/[{}]//g| awk -v k="text" {n=split($0,a,","); for (i=1; i<=n; i++) print a[i]; exit } | awk -F: NR==3 { print $3; exit }


#arecord -D $hardware -f cd -t wav -d $duration -r 16000 | flac - -f --best --sample-rate 16000 -o /dev/shm/out.flac 1>/dev/shm/voice.log 2>/dev/shm/voice.log; wget -O - -o /dev/null --post-file /dev/shm/out.flac --header="Content-Type: audio/x-flac; rate=16000" http://www.google.com/speech-api/v1/recognize?lang="$lang" | sed -e s/[{}]//g| awk -v k="text" {n=split($0,a,","); for (i=1; i<=n; i++) print a[i]; exit } | awk -F: NR==3 { print $3; exit }


rm /dev/shm/out.flac




Please let me know how this works for people so I can debug and get this working permanently. 
As always, you can find the install, update, and new YouTube videos at my YouTube channel here:


https://www.youtube.com/channel/UCxa9JQjCl8ij_7za1_sRCVQ/videos






If you are wondering why Ive been so quiet, its because I moved, started grad school at Georgia Tech, and have been doing a technical review for a computer vision book.





Since Im a poor graduate student, please support my tinkering:






Places you can find me
Get 10% offsitewide when you shop at
at Yescom USA. Valid until October 2013! Retractable Banner Stand at Yescom USA . Valid until October 2013!

Read More..

Wednesday, June 8, 2016

PiAUISuite Update

Quick update on everything that is going on!

Ive been sick (still am but going to the doctors on Friday hopefully) so updates have been slow.

I just pushed out a bunch of stuff and finally voicecommand v3.0 is coming out officialy tomorrow night.

NEW FEATURES!
  • Add logging to /dev/shm/voice.log
  • Allow filler to be a set string that can be changed
  • Add push through mode with text so that voice isnt required for testing, -I flag (that is a capital i)
  • Remove quotes in Misc scripts
  • Switch flags to make more sense (now you follow them by a 0 or 1 to override config file options, ex: voicecommand -c0 overrides continuous to turn it off)
  • Add language support with !language==NN (NN is country code such as en_us or es or de, etc.)
  • Added Wolfram alpha API support. If you have an API you can set !api==YOUR-API
  • Better Youtube streaming (still working on making this even better but youtube is messing with me)
  • Added Hulu/Vimeo support and streaming
  • Removed need for tts-nofill
  • More advanced install scripts
  • Case insensitive commands and keywords
  • Passthrough support to output command instead of running it
  • Am going to finish the install instruction THIS WEEKEND!
  • Am going to make a GPIO example with a video hopefully this weekend
Features for 4.0 that are looking hopeful
  • Getting rid of duration and having voicecommand detect when you stop talking using sox (Ive almost got this working as well as the current implementation)
  • A nodejs server that is a variant of RPiTV to control voicecommand, youtube, and my other scripts with a pretty HTML front end.
  • Smart phone control
  • Android advanced app to allow you to record voice on phone and send data
  • XBMC plugin (maybe, this is still a stretch goal).
  • emulationstation and video game support (need to figure out how to get this running from within X11)
Read More..

Thursday, March 24, 2016

Installing and Updating PiAUISuite and voicecommand

Here are some in depth demo videos on how to install and update PiAUISuite and voicecommand.

If something goes wrong, get the newest code and reinstall. Make sure you read the README file and regularly update to fix errors and get new features.

Install instructions:

sudo apt-get install git-core
git clone git://github.com/StevenHickson/PiAUISuite.git
cd PiAUISuite/Install/
./InstallAUISuite.sh



Voicecommand setup instructions:

voicecommand -s





Update instructions:



cd PiAUISuite
git pull
cd Install
sudo ./UpdateAUISuite.sh





If youve watched the videos and read the posts and read the README and something still doesnt work, feel free to contact me and ask questions.

Thanks!



Consider donating to further my tinkering since I do all this and help people out for free.


Places you can find me
Read More..
 
Copyright 2009 Information Blog
Powered By Blogger