Difference between revisions of "Tech Reflect Voice Log"

From ESE205 Wiki
Jump to navigation Jump to search
(Initial Create)
 
m (Updated work for Jan 22-28 for Tony)
Line 5: Line 5:
 
** Looks like Google voice API will be best for STT, most others struggle significantly with numbers and names
 
** Looks like Google voice API will be best for STT, most others struggle significantly with numbers and names
 
* IBM has a good Text-To-Speech engine which we will probably use for our TTS needs
 
* IBM has a good Text-To-Speech engine which we will probably use for our TTS needs
 +
 +
'''Tony'''<br />
 +
* Looked into the cost range of an appropriately sized 1 way mirror (18"-24"x24"-36")
 +
* Looked into cheaper monitors that could cover at least 3/4 of the mirror's surface.
 +
* Minor research into checking out possible designs for the exterior frame of the mirror.
 +
* Current findings: Mirror will most likely by 24x36 unless a significantly cheaper option (18x24) appears in the immediate future. Cost of said mirror is currently $50. I found some relatively inexpensive monitors for around $70, with a diagonal between 30-36", or enough to cover the vast majority of the interior of the mirror. The surrounding frame will probably be flat to minimize complications during the 3D printing process. Also, some sort of RGB LED, either in the form of strips or individuals in set locations, will be used somewhere on the frame, whether that is outside or inside the exterior frame, to indicate that the mirror is either listening, processing, or executing a voice command.
  
 
=== Jan. 29 -> Feb. 4 2018 ===
 
=== Jan. 29 -> Feb. 4 2018 ===

Revision as of 22:47, 29 January 2018

Jan. 22 -> Jan. 28 2018

Ethan

  • Did some testing and research into Speech-To-Text Engines
    • Cannot test snowboy hotword detection until we have a raspberry pi and hardware microphone- only runs on Linux
    • Looks like Google voice API will be best for STT, most others struggle significantly with numbers and names
  • IBM has a good Text-To-Speech engine which we will probably use for our TTS needs

Tony

  • Looked into the cost range of an appropriately sized 1 way mirror (18"-24"x24"-36")
  • Looked into cheaper monitors that could cover at least 3/4 of the mirror's surface.
  • Minor research into checking out possible designs for the exterior frame of the mirror.
  • Current findings: Mirror will most likely by 24x36 unless a significantly cheaper option (18x24) appears in the immediate future. Cost of said mirror is currently $50. I found some relatively inexpensive monitors for around $70, with a diagonal between 30-36", or enough to cover the vast majority of the interior of the mirror. The surrounding frame will probably be flat to minimize complications during the 3D printing process. Also, some sort of RGB LED, either in the form of strips or individuals in set locations, will be used somewhere on the frame, whether that is outside or inside the exterior frame, to indicate that the mirror is either listening, processing, or executing a voice command.

Jan. 29 -> Feb. 4 2018


Tech Reflect Voice Project Page: https://classes.engineering.wustl.edu/ese205/core/index.php?title=Tech_Reflect_Voice