Speech Recognition for Smart Homes
487
system). Potentially these two classes of recognition task could be performed with two
different types of ASR software employing isolated word and continuous speech
recognition respectively (Nakano et al., 2006). However both are catered for in Sphinx2
using two density acoustic models, namely semi-continuous and continuous. Some other
flexible speech recognition systems have been introduced.
One system by Furui (Furui, 2001) uses a method of automatically
summarizing speech,
sentence by sentence. This is quite domain-specific (closed-domain) with limited vocabulary
size (designed for news transcription), and may not be directly applicable to a continuously
variable vocabulary size smart home system. However it does perform very well, and
provides an indication of the customisations available in such systems.
Vocabulary size,
V
, impacts recognition accuracy, and needs to be related to accuracy
requirement,
R
. Since 100% accuracy is unlikely, smart home systems
need to be able to cope
with inaccuracies through careful design of the interaction and confirmation processes. In