The paper "Put-That-There": Voice and Gesture at the Graphics Interface, by Richard A. Bolt, explores the integration of voice and gesture inputs in a graphics interface to create a natural and intuitive user experience. The work is conducted by the Architecture Machine Group at MIT, focusing on commanding simple shapes on a large-screen graphics display using voice and pointing gestures. The system leverages connected-speech recognition and position sensing technology to enable precise and natural interactions. Key features include the use of pronouns as "temporary variables" to reference items on the display, enhancing the expressiveness and efficiency of commands. The Media Room, a physical facility designed for this research, serves as a physical embodiment of the user's terminal, integrating a large projection screen, color TV monitors, and interactive controls. The paper discusses specific commands such as "Create," "Move," "Copy," "Make," and "Delete," demonstrating how these can be executed through voice and gesture. The system also supports naming objects, allowing users to label items on the screen. The authors highlight the versatility and ease of use of the interface, suggesting potential applications in various real-world scenarios, such as planning harbor facilities or terrain maps. The paper concludes by emphasizing the natural and spontaneous nature of the interactions, which are compatible with the spirit and nature of the display.The paper "Put-That-There": Voice and Gesture at the Graphics Interface, by Richard A. Bolt, explores the integration of voice and gesture inputs in a graphics interface to create a natural and intuitive user experience. The work is conducted by the Architecture Machine Group at MIT, focusing on commanding simple shapes on a large-screen graphics display using voice and pointing gestures. The system leverages connected-speech recognition and position sensing technology to enable precise and natural interactions. Key features include the use of pronouns as "temporary variables" to reference items on the display, enhancing the expressiveness and efficiency of commands. The Media Room, a physical facility designed for this research, serves as a physical embodiment of the user's terminal, integrating a large projection screen, color TV monitors, and interactive controls. The paper discusses specific commands such as "Create," "Move," "Copy," "Make," and "Delete," demonstrating how these can be executed through voice and gesture. The system also supports naming objects, allowing users to label items on the screen. The authors highlight the versatility and ease of use of the interface, suggesting potential applications in various real-world scenarios, such as planning harbor facilities or terrain maps. The paper concludes by emphasizing the natural and spontaneous nature of the interactions, which are compatible with the spirit and nature of the display.