WO2004104520A1 - Method of operating a voice-controlled navigation system - Google Patents
Method of operating a voice-controlled navigation system Download PDFInfo
- Publication number
- WO2004104520A1 WO2004104520A1 PCT/IB2004/050706 IB2004050706W WO2004104520A1 WO 2004104520 A1 WO2004104520 A1 WO 2004104520A1 IB 2004050706 W IB2004050706 W IB 2004050706W WO 2004104520 A1 WO2004104520 A1 WO 2004104520A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- geographical
- voice
- dialog
- recognition
- user
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/34—Route searching; Route guidance
- G01C21/36—Input/output arrangements for on-board computers
- G01C21/3605—Destination input or retrieval
- G01C21/3608—Destination input or retrieval using speech input, e.g. using speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Definitions
- the invention relates to a method of operating a voice-controlled navigation system.
- the invention relates to a voice-data user interface for a navigation system, a navigation system with a voice-data user interface of this kind, and a computer program in order to execute the method on a processor of a voice-data interface of a navigation system.
- the invention relates to a method of generating a geographical database for use in the said method in order to operate a voice-controlled navigation system.
- the user interface generally comprises a keyboard for inputting the location data.
- location data hereby is geographical data on any locations, areas, buildings, roads etc.
- the more convenient navigation systems are equipped, alternatively or in addition, with a voice-data user interface, with which the user can communicate in natural language. Since a voice-data user interface enables the hands-free operation of the particular equipment, the controlling of navigation systems in motor vehicles in this manner is to be preferred from the safety aspect. The driver can operate the navigation system during the journey without having to take his hands off the vehicle's steering wheel in order to do so.
- a spoken response expressed by the user e.g.
- the word list currently active may also be compiled as a function of recognition results of previous spoken responses within the dialog by the user.
- recognition results of previous spoken responses within the dialog by the user One example here would be that the user has already input, in a previous stage of the dialog, that the destination is located in the federal state of North Rhine-Westphalia. For a voice recognition of the user's spoken response to the subsequent input request "In what town is your destination?" it is then sufficient for all names of towns within the federal state of North Rhine-Westphalia to be included in the word list.
- different recognition hypotheses that have been determined during a voice recognition of a spoken response by the user may be evaluated, using the geographical database, by means of the geographical criteria taken into account in the generation of the previous prompt.
- An evaluation of this kind can also take place as a function of recognition results of spoken responses by the user previously and/or subsequently within the dialog.
- a geographical database with data entries that each have assigned to them one or more markers representing a type of the data entry concerned is particularly preferred.
- a geographical type of data entry would be, for example, whether the data entry concerned relates to a country, a federal state, a town or a large conurbation, or also, in which federal state a town is located, etc.
- the markers may also represent a geographical hierarchy level. Using these markers, a restriction of the database for further steps can be accomplished considerably faster, and/or a word list can be extracted more quickly or post-processed more effectively, since searching is restricted to entries with specific markers, wherein the type of marker, e.g. the current hierarchy level or the currently queried geographical type, is defined for a recognition or evaluation of a specific spoken response by the previous prompt or dialog stage.
- Fig. 2 shows a dialog block diagram to explain one possible dialog sequence between a user and the system in accordance with the invention.
- a voice recognition device 6 which pre-processes the incoming spoken responses S, processes them and supplies recognition hypotheses EH at an output. These recognition hypotheses EH are then further processed in an analysis unit 7 so that the contents of the spoken responses - for instance commands or location details - can be understood.
- a dialog generally begins - following a normal activation, e.g. by a voice command or by manual operation of the equipment - by the dialog manager 3 outputting a prompt output command PB to the prompt generator 5 in order for a particular prompt P to be output to the user.
- the generation of this prompt P takes account of specific geographical criteria GK, which are predetermined in the dialog program or which the dialog manager 3 can retrieve from the geographical database 8.
- data entries DE e.g. the names and further geographical data on countries, regions, federal states, towns, streets, significant sights, full addresses, etc.
- the data entries DE may hereby be entered in different ways into the database 8.
- the individual data entries DE may each contain markers M, which indicate the geographical category or the type to which the data entry DE is assigned, such as ⁇ country>, ⁇ federal state>, ⁇ town>, administrative district of a town>, etc. or ⁇ small town>, ⁇ large conurbation>, ⁇ town of over 1 million inhabitants>, etc.
- the database may also be hierarchically organized and/or divided into different parts.
- the town is queried or, if applicable, a particular region is queried in an intervening hierarchical step.
- the administrative district may be queried in the case of larger towns, and finally, at one of the lower stages, the street name and a house number, or a particular building, etc.
- the dialog sequence is not strictly hierarchically structured from large down to small geographical units per se, being relatively flexible.
- a dialog sequence of this kind may, in certain circumstances, i.e. under good recognition conditions, reach the destination in fewer steps than a dialog sequence with a strictly hierarchical structure.
- the dialog control unit 3 firstly selects, for example, a prompt "To which town do you want to travel?". Then, if applicable, a word list with all town entries available in the database 8 will be compiled. To the extent that no further restrictions have been undertaken previously, this will, of course, be a relatively long list.
- all data entries DE in the database 8 that are located in the vicinity of the large conurbation sought can then be extracted. If applicable, all data entries DE that fulfil the condition of being located near the recognized large conurbation may also be marked in a first step.
- the new word list is then compiled, containing all towns that fulfil the condition. If the spoken response of the user to the previous query as to the desired town has been stored, it is now possible to undertake a voice recognition for this first spoken response once again with the restricted word list in order to arrive at a better recognition result.
- the dialog manager 3 may also induce the prompt generation device 5 to output the first prompt "To which town do you want to travel?" once again and then to undertake the voice recognition of the subsequent spoken response with the restricted word list.
- the invention is not restricted to the above-described embodiment examples - in particular the precise structure of the voice- data user interface or the precise sequence of the explained dialogs - but may be varied to a large degree by a person skilled in the field without exceeding the bounds of the invention.
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP04733066A EP1631791A1 (en) | 2003-05-26 | 2004-05-14 | Method of operating a voice-controlled navigation system |
JP2006530859A JP2007505365A (en) | 2003-05-26 | 2004-05-14 | Voice control navigation system operation method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03101523.3 | 2003-05-26 | ||
EP03101523 | 2003-05-26 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004104520A1 true WO2004104520A1 (en) | 2004-12-02 |
Family
ID=33462217
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2004/050706 WO2004104520A1 (en) | 2003-05-26 | 2004-05-14 | Method of operating a voice-controlled navigation system |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1631791A1 (en) |
JP (1) | JP2007505365A (en) |
CN (1) | CN1795367A (en) |
WO (1) | WO2004104520A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1860918A1 (en) * | 2006-05-23 | 2007-11-28 | Harman/Becker Automotive Systems GmbH | Communication system and method for controlling the output of an audio signal |
GB2440766A (en) * | 2006-08-10 | 2008-02-13 | Denso Corp | Voice recognition controlled system for providing a disclaimer to be acknowledged before allowing operation of a vehicle navigation system |
EP2003641A2 (en) * | 2006-03-31 | 2008-12-17 | Pioneer Corporation | Voice input support device, method thereof, program thereof, recording medium containing the program, and navigation device |
US20100235091A1 (en) * | 2009-03-13 | 2010-09-16 | Qualcomm Incorporated | Human assisted techniques for providing local maps and location-specific annotated data |
US8938211B2 (en) | 2008-12-22 | 2015-01-20 | Qualcomm Incorporated | Providing and utilizing maps in location determination based on RSSI and RTT data |
US9080882B2 (en) | 2012-03-02 | 2015-07-14 | Qualcomm Incorporated | Visual OCR for positioning |
WO2016133658A1 (en) * | 2015-02-16 | 2016-08-25 | Jaybridge Robotics, Inc. | Assistive vehicular guidance system and method |
US9500492B2 (en) | 2014-03-03 | 2016-11-22 | Apple Inc. | Map application with improved navigation tools |
US10113879B2 (en) | 2014-03-03 | 2018-10-30 | Apple Inc. | Hierarchy of tools for navigation |
CN113364920A (en) * | 2021-06-09 | 2021-09-07 | 中国银行股份有限公司 | Incoming line request processing method and device and electronic equipment |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1939860B1 (en) * | 2006-11-30 | 2009-03-18 | Harman Becker Automotive Systems GmbH | Interactive speech recognition system |
CN105302082A (en) * | 2014-06-08 | 2016-02-03 | 上海能感物联网有限公司 | Controller apparatus for on-site automatic navigation and car driving by non-specific person foreign language speech |
CN105302079A (en) * | 2014-06-08 | 2016-02-03 | 上海能感物联网有限公司 | Controller apparatus for controlling on-site car driving by Chinese speech |
JP6250121B1 (en) * | 2016-09-16 | 2017-12-20 | ヤフー株式会社 | Map search apparatus, map search method, and map search program |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6230132B1 (en) * | 1997-03-10 | 2001-05-08 | Daimlerchrysler Ag | Process and apparatus for real-time verbal input of a target address of a target address system |
DE19962048A1 (en) * | 1999-12-22 | 2001-07-12 | Detlef Zuendorf | Voice controlled target address recognition for route guidance system for vehicle, involves entering target location using voice and outputting target by voice for verification |
EP1233407A1 (en) * | 2001-02-15 | 2002-08-21 | Navigation Technologies Corporation | Spatially built word list for automatic speech recognition program and method for formation thereof |
EP1298415A2 (en) * | 2001-09-27 | 2003-04-02 | Robert Bosch Gmbh | Navigation system with speech recognition |
-
2004
- 2004-05-14 CN CNA2004800143866A patent/CN1795367A/en active Pending
- 2004-05-14 WO PCT/IB2004/050706 patent/WO2004104520A1/en not_active Application Discontinuation
- 2004-05-14 EP EP04733066A patent/EP1631791A1/en not_active Withdrawn
- 2004-05-14 JP JP2006530859A patent/JP2007505365A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6230132B1 (en) * | 1997-03-10 | 2001-05-08 | Daimlerchrysler Ag | Process and apparatus for real-time verbal input of a target address of a target address system |
DE19962048A1 (en) * | 1999-12-22 | 2001-07-12 | Detlef Zuendorf | Voice controlled target address recognition for route guidance system for vehicle, involves entering target location using voice and outputting target by voice for verification |
EP1233407A1 (en) * | 2001-02-15 | 2002-08-21 | Navigation Technologies Corporation | Spatially built word list for automatic speech recognition program and method for formation thereof |
EP1298415A2 (en) * | 2001-09-27 | 2003-04-02 | Robert Bosch Gmbh | Navigation system with speech recognition |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2003641A2 (en) * | 2006-03-31 | 2008-12-17 | Pioneer Corporation | Voice input support device, method thereof, program thereof, recording medium containing the program, and navigation device |
EP2003641A4 (en) * | 2006-03-31 | 2012-01-04 | Pioneer Corp | Voice input support device, method thereof, program thereof, recording medium containing the program, and navigation device |
EP1860918A1 (en) * | 2006-05-23 | 2007-11-28 | Harman/Becker Automotive Systems GmbH | Communication system and method for controlling the output of an audio signal |
US8019454B2 (en) | 2006-05-23 | 2011-09-13 | Harman Becker Automotive Systems Gmbh | Audio processing system |
GB2440766A (en) * | 2006-08-10 | 2008-02-13 | Denso Corp | Voice recognition controlled system for providing a disclaimer to be acknowledged before allowing operation of a vehicle navigation system |
US7881940B2 (en) | 2006-08-10 | 2011-02-01 | Denso Corporation | Control system |
GB2440766B (en) * | 2006-08-10 | 2011-02-16 | Denso Corp | Control system |
US8938211B2 (en) | 2008-12-22 | 2015-01-20 | Qualcomm Incorporated | Providing and utilizing maps in location determination based on RSSI and RTT data |
US20100235091A1 (en) * | 2009-03-13 | 2010-09-16 | Qualcomm Incorporated | Human assisted techniques for providing local maps and location-specific annotated data |
US8938355B2 (en) * | 2009-03-13 | 2015-01-20 | Qualcomm Incorporated | Human assisted techniques for providing local maps and location-specific annotated data |
US9080882B2 (en) | 2012-03-02 | 2015-07-14 | Qualcomm Incorporated | Visual OCR for positioning |
US9500492B2 (en) | 2014-03-03 | 2016-11-22 | Apple Inc. | Map application with improved navigation tools |
US10113879B2 (en) | 2014-03-03 | 2018-10-30 | Apple Inc. | Hierarchy of tools for navigation |
US10161761B2 (en) | 2014-03-03 | 2018-12-25 | Apple Inc. | Map application with improved search tools |
US11035688B2 (en) | 2014-03-03 | 2021-06-15 | Apple Inc. | Map application with improved search tools |
US11181388B2 (en) | 2014-03-03 | 2021-11-23 | Apple Inc. | Hierarchy of tools for navigation |
WO2016133658A1 (en) * | 2015-02-16 | 2016-08-25 | Jaybridge Robotics, Inc. | Assistive vehicular guidance system and method |
US9464913B2 (en) | 2015-02-16 | 2016-10-11 | Jaybridge Robotics, Inc. | Assistive vehicular guidance system and method |
CN113364920A (en) * | 2021-06-09 | 2021-09-07 | 中国银行股份有限公司 | Incoming line request processing method and device and electronic equipment |
CN113364920B (en) * | 2021-06-09 | 2023-01-20 | 中国银行股份有限公司 | Incoming line request processing method and device and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
JP2007505365A (en) | 2007-03-08 |
CN1795367A (en) | 2006-06-28 |
EP1631791A1 (en) | 2006-03-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6411893B2 (en) | Method for selecting a locality name in a navigation system by voice input | |
US6598018B1 (en) | Method for natural dialog interface to car devices | |
EP1233407B1 (en) | Speech recognition with spatially built word list | |
US7184957B2 (en) | Multiple pass speech recognition method and system | |
US7328155B2 (en) | Method and system for speech recognition using grammar weighted based upon location information | |
EP2226793B1 (en) | Speech recognition system and data updating method | |
US8996385B2 (en) | Conversation system and conversation software | |
EP1050872A2 (en) | Method and system for selecting recognized words when correcting recognized speech | |
US20080177541A1 (en) | Voice recognition device, voice recognition method, and voice recognition program | |
US20080059199A1 (en) | In-vehicle apparatus | |
EP1631791A1 (en) | Method of operating a voice-controlled navigation system | |
US7209884B2 (en) | Speech input into a destination guiding system | |
CN108871370A (en) | Air navigation aid, device, equipment and medium | |
US20120253822A1 (en) | Systems and Methods for Managing Prompts for a Connected Vehicle | |
GB2422011A (en) | Vehicle navigation system and method using speech | |
KR100770644B1 (en) | Method and system for an efficient operating environment in a real-time navigation system | |
JP2001022779A (en) | Interactive information retrieval device, method for interactive information retrieval using computer, and computer-readable medium where program performing interactive information retrieval is recorded | |
JPH0764480A (en) | Voice recognition device for on-vehicle processing information | |
US20090210144A1 (en) | Method for selecting a destination | |
JP3645104B2 (en) | Dictionary search apparatus and recording medium storing dictionary search program | |
Bernsen | On-line user modelling in a mobile spoken dialogue system. | |
WO2006028171A1 (en) | Data presentation device, data presentation method, data presentation program, and recording medium containing the program | |
KR200328847Y1 (en) | Geographical information provider which gives user's previously input schedule together | |
JP4822993B2 (en) | Point search device and navigation device | |
KR100465827B1 (en) | Geographical information provider which gives user's previously input schedule together |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2004733066 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006530859 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20048143866 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 2004733066 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2004733066 Country of ref document: EP |