lesson 02 - speech and natural language interfaces
Speech interfaces, which include advanced natural language interfaces, offer hands-free, accessible, and intuitive control for devices like virtual assistants and in-car systems, but they also present challenges related to accuracy, privacy, and suitability for public or complex tasks.
Have you ever asked your phone for the weather, told your TV to play a movie, or asked a smart speaker to set a timer? If so, you've used a Speech-based User Interface. These interfaces are becoming more and more common, moving beyond our phones and into our cars, homes, and even our workplaces.
Learning Outcomes
The Building Blocks (Factual Knowledge)
The Connections and Theories (Conceptual Knowledge)
The Skills and Methods (Procedural Knowledge)
Recall that a speech or natural language interface allows a user to interact with a device using spoken commands.
Describe examples of devices that use speech interfaces, such as smart speakers, smartphones, and in-car entertainment systems.
Describe the difference between a simple voice command (e.g., "Play music") and a natural language query (e.g., "What's the weather like in London tomorrow?").
The Connections and Theories (Conceptual Knowledge)
Describe how a speech interface could enhance the user experience for 'The Majestic Cinema' project, particularly for users with visual or motor accessibility needs.
Analyse the potential challenges of implementing a speech interface for the cinema project, considering factors like background noise in a public space and the accuracy of speech recognition.
Evaluate the suitability of a speech interface for the 'Ticket Booking' screen, considering the need to accurately capture specific and detailed information (e.g., names, dates, number of tickets).
Skill Focus: Digital Communication: Using communication tools and applying digital etiquette ('netiquette').
The Skills and Methods (Procedural Knowledge)
Apply 'netiquette' principles to formulate clear, polite, and unambiguous spoken commands suitable for a public-facing speech interface.
Apply knowledge of the Majestic Cinema scenario to create a list of five specific voice commands that would be useful for a customer (e.g., "What time is 'Jungle Adventure' showing tonight?").
Create a simple script or dialogue showing an interaction between a user and a hypothetical speech-based booking system for the Majestic Cinema.
What's the Difference: Speech vs. Natural Language
While we often use the terms interchangeably, there's a subtle difference:
Speech Interface: This is the broader category. It refers to any interface that can recognise spoken words. Early versions were very basic and could only understand very specific, pre-programmed command words. For example, an early in-car system might only understand "Call David," but not "Ring David."
Natural Language Interface: This is a much smarter type of speech interface. It uses Artificial Intelligence (AI) to understand not just the words you say, but the meaning and context behind them. This is why you can ask your Google Assistant or Siri, "Will I need an umbrella today?" and it knows you're asking for a weather forecast and will check for rain. This ability to understand conversational language is what makes modern virtual assistants so powerful.
For our Majestic Cinema project, incorporating a simple speech interface could be a fantastic way to meet the accessibility needs of our audience, allowing users to easily ask for film times or booking information.

Task Cinema Consultants: The Voice Assistant Pitch
The Brief: You are a Digital Consultant. 'The Majestic Cinema' has hired you to provide advice. They are considering adding a voice assistant feature to their app so customers can ask for film times and information hands-free. They need to know if this technology is a good investment.
Your task is to research speech interfaces and create a one-page digital infographic to advise the cinema's management team.
1
Research
Open the 'Speech & NLI Resource Pack' document.
Read the information carefully to understand the technology.
As you read, fill in the 'Infographic Planning Sheet'. This will become the plan for your infographic.
2
Design your Infographic
Open a new one-page document in Google Slides, PowerPoint, or Canva.
Using your plan, create a professional and visually engaging infographic.
Your infographic must include:
A clear title and a short explanation of what a speech interface is.
At least three real-world examples of where they are used.
A section for Advantages (the 'Pros').
A section for Disadvantages (the 'Cons').
Your final recommendation: A clear "Yes" or "No" to the cinema, with a sentence explaining why, using your research to back it up.
Outcome: I will have produced a professional, one-page digital infographic that analyses speech and natural language interfaces and provides a justified recommendation for a business client.

Application to the component sample PSA
While the Component 1 PSA for "The Majestic Cinema" focuses on creating a visual prototype, understanding Speech and Natural Language Interfaces is highly relevant, especially for justifying your design decisions and considering the diverse needs of users.
Accessibility: A primary application of speech and natural language interfaces is to improve accessibility. For the cinema's information point, you could propose a voice-activated navigation system as a key feature on the 'Facilities & Accessibility' screen. This would be a crucial tool for users with visual impairments or those who have difficulty using a touchscreen. In your design justification for Task 2 (Interface Designs), you could explain that including a microphone icon would allow users to speak commands like "Find showtimes for this evening" or "Read out the accessibility information," thereby making the interface inclusive for a wider audience.
Enhanced User Experience: You could also argue that a natural language chatbot feature would make the ticket booking process faster and more intuitive. Instead of clicking through multiple menus on the 'Ticket Booking' screen, a user could simply type or say, "I'd like two adult tickets for the 8 pm showing of the new superhero movie." Your system could then process this request and fill in the form fields automatically. While you won't build a functional version of this in Task 3, describing this advanced functionality in your report would demonstrate a thorough understanding of modern, user-centred interface design.
Last modified: October 8th, 2025