Speech
Recognition,
Technologies
and Applications
490
therefore allows a continuum of dialogue complexities to suit the changing needs of the
vocal human-computer interaction. The particular vocabulary in use at any one time would
depend upon the current position in the grammar syntax tree.
As a noticeable choice in embedded applications necessary for smart homes, Sphinx II is
available in an embedded version called PocketSphinx. Sphinx II was the baseline system
for creating PocketSphinx because it is was faster than other recognizers currently available
in the Sphinx family (Huggins-Daines et al., 2006). The developers claim PocketSphinx is
able to address several technical challenges in deployment
of speech applications on
embedded devices. These challenges include computational requirements of continuous
speech recognition for a medium to large vocabulary scenario, the need to minimize the size
and power consumption for embedded devices which imposes further restrictions on
capabilities and so on (Huggins-Daines et al., 2006).
Actually, PocketSphinx, by creating a four-layer framework including:
frame layer,
Gaussian mixture model (GMM) layer, Gaussian layer, and component layer, allows for
straightforward categorisation of different speed-up techniques based upon the layer(s)
within which they operate.