Author: Deb Roy
Publications
Co-authors
Productive Colleagues
Publications
Roy, Deb, Pentland, Alex (2002): Learning words from sights and sounds: a computational model. In Cognitive Science, 26 (1) pp. 113-146. https://dx.doi.org/10.1016/S0364-0213(01)00061-1
Roy, Deb, Pentland, Alex (1998): A Phoneme Probability Display for Individuals with Hearing Disabilities. In: Third Annual ACM Conference on Assistive Technologies , 1998, . pp. 165-168. https://www.acm.org/pubs/articles/proceedings/assets/274497/p165-roy/p165-roy.txt
Roy, Deb (2002): Towards Visually-Grounded Spoken Language Acquisition. In: 4th IEEE International Conference on Multimodal Interfaces - ICMI 2002 14-16 October, 2002, Pittsburgh, PA, USA. pp. 105-110. https://csdl.computer.org/comp/proceedings/icmi/2002/1834/00/18340105abs.htm
Gorniak, Peter, Roy, Deb (2003): Augmenting user interfaces with adaptive speech commands. In: Oviatt, Sharon L., Darrell, Trevor, Maybury, Mark T., Wahlster, Wolfgang (eds.) Proceedings of the 5th International Conference on Multimodal Interfaces - ICMI 2003 November 5-7, 2003, Vancouver, British Columbia, Canada. pp. 176-179. https://doi.acm.org/10.1145/958432.958467
Gorniak, Peter, Roy, Deb (2003): A visually grounded natural language interface for reference to spatial scenes. In: Oviatt, Sharon L., Darrell, Trevor, Maybury, Mark T., Wahlster, Wolfgang (eds.) Proceedings of the 5th International Conference on Multimodal Interfaces - ICMI 2003 November 5-7, 2003, Vancouver, British Columbia, Canada. pp. 219-226. https://doi.acm.org/10.1145/958432.958474
Gorniak, Peter, Roy, Deb (2005): Probabilistic grounding of situated speech using plan recognition and reference resolution. In: Lazzari, Gianni, Pianesi, Fabio, Crowley, James L., Mase, Kenji, Oviatt, Sharon L. (eds.) Proceedings of the 7th International Conference on Multimodal Interfaces - ICMI 2005 October 4-6, 2005, Trento, Italy. pp. 138-143. https://doi.acm.org/10.1145/1088463.1088489
Juster, Joshua, Roy, Deb (2004): Elvis: situated speech and gesture understanding for a robotic chandelier. In: Sharma, Rajeev, Darrell, Trevor, Harper, Mary P., Lazzari, Gianni, Turk, Matthew (eds.) Proceedings of the 6th International Conference on Multimodal Interfaces - ICMI 2004 October 13-15, 2004, State College, PA, USA. pp. 90-96. https://doi.acm.org/10.1145/1027933.1027950
Massaro, Dominic W., Takeda, Kazuya, Roy, Deb, Potamianos, Alexandros (eds.) Proceedings of the 9th International Conference on Multimodal Interfaces - ICMI 2007 November 12-15, 2007, Nagoya, Aichi, Japan.
Pentland, Alex, Roy, Deb, Wren, Christopher Richard (1999): Perceptual Intelligence: learning gestures and words for individualized, adaptive interfac. In: Bullinger, Hans-Jorg (eds.) HCI International 1999 - Proceedings of the 8th International Conference on Human-Computer Interaction August 22-26, 1999, Munich, Germany. pp. 286-290.
Fleischman, Michael, Evans, Humberto, Roy, Deb (2007): Unsupervised content-based indexing for sports video retrieval. In: Lienhart, Rainer, Prasad, Anand R., Hanjalic, Alan, Choi, Sunghyun, Bailey, Brian P., Sebe, Nicu (eds.) Proceedings of the 15th International Conference on Multimedia 2007 September 24-29, 2007, Augsburg, Germany. pp. 473-474. https://doi.acm.org/10.1145/1291233.1291347
Fleischman, Michael, Roy, Brandon, Roy, Deb (2007): Temporal feature induction for baseball highlight classification. In: Lienhart, Rainer, Prasad, Anand R., Hanjalic, Alan, Choi, Sunghyun, Bailey, Brian P., Sebe, Nicu (eds.) Proceedings of the 15th International Conference on Multimedia 2007 September 24-29, 2007, Augsburg, Germany. pp. 333-336. https://doi.acm.org/10.1145/1291233.1291305
Tellex, Stefanie, Roy, Deb (2006): Spatial routines for a simulated speech-controlled vehicle. In: Proceedings of the 1st ACM SIGCHI/SIGART Conference on Human-Robot Interaction , 2006, . pp. 156-163. https://doi.acm.org/10.1145/1121241.1121269
Hsiao, Kai-yuh, Vosoughi, Soroush, Tellex, Stefanie, Kubat, Rony, Roy, Deb (2008): Object schemas for responsive robotic language use. In: Proceedings of the 3rd ACM/IEEE International Conference on Human Robot Interaction , 2008, . pp. 233-240. https://doi.acm.org/10.1145/1349822.1349853
Kollar, Thomas, Tellex, Stefanie, Roy, Deb, Roy, Nicholas (2010): Toward understanding natural language directions. In: Proceedings of the 5th ACM/IEEE International Conference on Human Robot Interaction , 2010, . pp. 259-266. https://doi.acm.org/10.1145/1734454.1734553
Tellex, Stefanie, Kollar, Thomas, Shaw, George, Roy, Nicholas, Roy, Deb (2010): Grounding spatial language for video search. In: Proceedings of the 2010 International Conference on Multimodal Interfaces , 2010, . pp. 31. https://dx.doi.org/10.1145/1891903.1891944
Roy, Deb (2002): Towards Visually-Grounded Spoken Language Acquisition. In: Proceedings of the 2002 International Conference on Multimodal Interfaces , 2002, . pp. 105. https://doi.acm.org/10.1145/846222.847738
Gorniak, Peter, Roy, Deb (2003): Augmenting user interfaces with adaptive speech commands. In: Proceedings of the 2003 International Conference on Multimodal Interfaces , 2003, . pp. 176-179. https://doi.acm.org/10.1145/958432.958467
Gorniak, Peter, Roy, Deb (2003): A visually grounded natural language interface for reference to spatial scenes. In: Proceedings of the 2003 International Conference on Multimodal Interfaces , 2003, . pp. 219-226. https://doi.acm.org/10.1145/958432.958474
Juster, Joshua, Roy, Deb (2004): Elvis: situated speech and gesture understanding for a robotic chandelier. In: Proceedings of the 2004 International Conference on Multimodal Interfaces , 2004, . pp. 90-96. https://doi.acm.org/10.1145/1027933.1027950
Gorniak, Peter, Roy, Deb (2005): Probabilistic grounding of situated speech using plan recognition and reference resolution. In: Proceedings of the 2005 International Conference on Multimodal Interfaces , 2005, . pp. 138-143. https://doi.acm.org/10.1145/1088463.1088489
Tellex, Stefanie, Roy, Deb (2009): Grounding spatial prepositions for video search. In: Proceedings of the 2009 International Conference on Multimodal Interfaces , 2009, . pp. 253-260. https://doi.acm.org/10.1145/1647314.1647369
Vosoughi, Soroush, Goodwin, Matthew S., Washabaugh, Bill, Roy, Deb (2012): A portable audio/video recorder for longitudinal study of child development. In: Proceedings of the 2012 International Conference on Multimodal Interfaces , 2012, . pp. 193-200. https://dx.doi.org/10.1145/2388676.2388715