Projects / Lego Rover

Introduction | Construction | Electronics | Programming | Remote Control

Remote Control

After I got the rover driving around using the keyboard, I made some improvements so it could be controlled remotely from a web browser using touch controls and show a live camera feed from the rover. Here are a video and pictures of the rover being controlled from the web browser with the camera view from the rover.

Remote control from browser

The browser remote control needs quite a few extra bits and changes to the control program to make it all work, which I will explain in the sections below. You can download a zip file containing the code here.

Camera - Take photos using a camera attached to the Raspberry Pi.
Web Server - Serve the web page used to control the rover remotely and handle control commands from the page.
Web Page - Display the camera feed in a web page along with touch controls for the rover. Sends control commands back to the web server on the Rasberry Pi.
Control Program - Take control input for the rover from the web page, rather than directly from the keyboard, and use it to control the motors to drive the rover.
Start/Stop Scripts - Start and stop the camera, web server and rover control program.

Camera

I am using a ZeroCam camera (£15 from PiHut). You could use any camera that works with the Raspberry Pi. The Pi Zero uses a smaller camera connector than than full size Raspberry Pis, so you have to get a camera with the correct connector for whatever Pi version you have or use an adapter cable. As the ZeroCam has a very short cable on it, I've just taped it to the bottom of my Pi Zero case and mount it so the camera is pointing in the right direction.

Once you have a suitable camera attached to you Raspberry Pi, you can take pictures using the raspistill command. Use raspistill -o photo.jpeg to save to a file. You then need to download the photo to view it, unless you have a screen and GUI running on the Pi.

The raspistill command can also be setup to keep taking pictures at a regular interval. The -tl (timelapse) parameter tells it to take a picture every specified number of milliseconds and keep doing it for the time given by -t. E.g. raspistill -o photo-%02d.jpeg -tl 1000 -t 10000 makes it take a series of photos every second for 10 seconds, with a frame number (01 to 10) added to each file name.

In my script to start the rover (start.sh), I am using raspistill -w 640 -h 480 -o photo.jpeg -tl 100 -t 9999999 to take a 640x480 pixel photo every 100ms for a long time and overwrite the same photo.jpeg file every time. I then display this photo in the webpage used to control the rover (see below). I decided to use this approach, rather than getting into proper video streaming, as it is easy to setup and doesn't need extra software installing on the Raspberry Pi.

⇧

Web Server

I am using Node.js for the web server that will run on the Raspberry Pi and serve up the web page that will be used to view the camera feed and control the rover from a web browser on another device (like a phone or tablet). I like Node.js because it allows you to write the code for the web server in JavaScript, which is the same programming language used in web pages and similar to the C++ code used to program the rover. Type node -v at the terminal to check it is installed and what version you have. You can use apt-get to install and update it if needed.

In the code zip file there is a folder called "server" containing two files:

server.js - This is the JavaScript code to launch the web server on the Rasberry Pi. This will serve the web page and camera photos to the browser and respond to control commands sent back from the web page.
index.html - Is the web page that is served by server.js to the web browser. It contains the JavaScript code that runs in the browser to display the camera feed and send back control commands to the server on the Rasberry Pi.

These two bits of code work together to communicate between the browser on your control device (phone/tablet) and the Rasberry Pi on the rover. This is basically how all web pages on the internet work, sending requests from your browser to a server on the internet (or in this case the Raspberry Pi on your local Wifi network), which responds with a HTML page, JPEG photo or some other data to display in your browser. The JavaScript code in the "server.js" file is shown below, followed by an explanation:

var http = require('http');
var url = require('url');
var fs = require('fs');

// Setup the web server (on standard port 80)
http.createServer(function (req, res) {
  
  // Check the requested URL
  var query = url.parse(req.url).search;
  if (query && query.indexOf("?control=") == 0)
  {
  	// Control input sent back from page
  	var control = query.substr(query.indexOf("=") + 1, 1).toUpperCase() + "\n";
  	
  	// Update the control.txt file with the direction (to tell the rover what to do)
  	res.writeHead(200, {'Content-Type': 'text/plain'});
  	fs.writeFile("control.txt", control, function () {});
  	
  	res.end(control);
  }
  else if (req.url.indexOf("/photo.jpeg") == 0)
  {
  	// Request for photo
  	res.writeHead(200, {'Content-Type': 'image/jpeg'});
  	res.end(fs.readFileSync('photo.jpeg'));
  }
  else
  {
  	// Treat anything else as request for the web page
  	res.writeHead(200, {'Content-Type': 'text/html'});
  	res.end(fs.readFileSync('index.html'));
  }
}).listen(80);

The first thing the code above does, after including some modules it needs, is create a HTTP web server listening on port 80 (the standard port for web pages). All the other code inside the curly braces ({}) is the code for our web server, which decides how to respond to requests from the browser. This code checks the request URL (the address sent by the browser) and responds in one of three different ways depending on the URL.

The first "if" block of code, checks if the request is a control command sent back from the page, e.g. "?control=U". (Anything after a ? in the URL is data sent back from a page to the server). If it is a control command, it reads the control character (U = Up, D = Down, L = Left or R = Right) from the request and writes it to a file called "control.txt" in the "server" folder on the Raspberry Pi. The control program on the rover will check this file for commands to control it. I'll explain this later on.
The second "else if" checks if the request is for the "photo.jpeg" image file. If it is it sends back the image file in the response to the web page. This is the last photo taken by the "raspistill" command running on the Raspberry Pi. The web page will keep requesting this photo, so it updates the camera feed in the browser.
The third "else" block, responds to any other request and sends back the "index.html" web page to the browser.

⇧

Web Page

The "index.html" page is served by our web server on the Rasberry Pi and displayed in the browser on our control device. This file contains the HTML code for the web page. HTML is the markup language used for the content of web pages. I won't explain HTML in detail here, if you're new to it have a look at w3schools.com for a quick introduction. These are the main parts of the HTML code in the page:

The <img id="photo" src="photo.jpeg" /> tag displays the latest camera photo in the page.
The <div class="controls"> tag displays the rover control buttons on the screen.
The <style> block near the top contains the CSS styling to layout the page (mainly for the control buttons).
The <script> block contains the JavaScript code that runs in the browser to update the photo and respond to the controls.

Below is the JavaScript code from the page, followed by explanation:

// Set interval to update photo			
setInterval(function () {
	let photo = document.getElementById("photo");
	
	// Add the date and time to the end to make the photo update
	photo.src = "photo.jpeg?v=" + (new Date());
}, 250);

// Setup the controls to respond to touch
document.querySelectorAll(".controls div").forEach(control => {

	control.addEventListener("touchstart", (ev) => {
		// Touch started, change the direction
		directionChange(ev.target.className.substr(0, 1).toUpperCase());
	}, false);
	
	control.addEventListener("touchend", (ev) => {
		// Touch ended, clear the direction
		directionChange("");
	}, false);
});

// Change direction when the controls are touched
function directionChange (direction) {
	
	// Show the selected direction in the center of the controls
	document.getElementById("control").innerText = direction;
	
	// Send the control direction back to the server (Raspberry Pi)
	var http = new XMLHttpRequest();
	http.open("GET", "/?control=" + direction, true);
	http.send();
}

There are three main parts to the JavaScript code in the page:

The call to setInterval sets up a regular update of the photo every 250 milliseconds (4 times a second). Each time it "finds" the photo img element in the page and then updates it's "src" attribute, adding the date to the end, so it is different each time, causing the photo to update.
The next block of code calls document.querySelectorAll to find the control buttons in the page. It sets up event handlers for the "touchstart" and "touchend" events, that occur when you start or stop touching the buttons on the screen, and calls the directionChange function with the control code (U,D,L,R) from the button.
The directionChange function called when you touch the controls, displays the control code in the centre of the controls. It then makes a "request" back to the web server, to send back the control code to the rover. The web server then writes this to a file which is read by the control program (explained below).

⇧

Control Program

The control program is mostly the same as the original explained in the Programming section. The updated code is in the zip file download and the process for compiling and running it is the same as before. The updated "main.cpp" file is shown below. The main change is that the program now runs in the background and doesn't respond to keyboard input like before. It checks for control signals so you can end it with Ctrl + C from the terminal. Rather than checking for key presses with getch, it now reads the "server/control.txt" file that contains the control command code (U,D,L,R) sent back from the webpage. It reads this control code into the "control" variable and then turns on and off the motors depending on the code. U (Up) = both motors forwards, D (Down) = both motors backwards, L = turn left, R = turn right.

#include <iostream>
#include <fstream>
#include <string>
#include "screen.h"
#include "output.h"

using namespace std;

// Main program
int main() {

	// Catch any errors while the program is running
	try {
		// Setup in background mode (no terminal output)
		Screen io = Screen(true);
		
		// Setup the motor outputs
		Output leftForward = Output(17);
		Output leftReverse = Output(27);
		Output rightForward = Output(22);
		Output rightReverse = Output(10);
		
		// Loop until program is stopped by terminate signal
		while (true) {
			
			// Get the control input from the file
			string control = "";
			ifstream file("server/control.txt");
			if (file.is_open()) {
				getline(file, control);
				file.close();
			}
			else {
				// Failed to open file
				throw runtime_error("Failed to open control.txt");
			}
			
			// Check which key is pressed and turn on the correct motors
			if (control == "U") {
				leftForward.on();
				leftReverse.off();
				rightForward.on();
				rightReverse.off();
			}
			else if (control == "D") {
				leftForward.off();
				leftReverse.on();
				rightForward.off();
				rightReverse.on();
			}
			else if (control == "L") {
				leftForward.off();
				leftReverse.on();
				rightForward.on();
				rightReverse.off();;
			}
			else if (control == "R") {
				leftForward.on();
				leftReverse.off();
				rightForward.off();
				rightReverse.on();
			}
			else {
				// Stop, turn all the motors off
				leftForward.off();
				leftReverse.off();
				rightForward.off();
				rightReverse.off();
			}
			
			// Check for signal to end program
			io.checkSignal();
			
			// Sleep for a short time before the next update
			io.sleep(0.1);
		}
	}
	catch (const exception& e) {
		// Error or program aborted
		cerr << e.what() << "\n";
	}
	
	return 0;
}

⇧

Start/Stop Scripts

In the zip file with the code there are also two script files: "start.sh" and "stop.sh". The "start.sh" script does the following:

Clears the "control.txt" file, so that the rover doesn't start moving until it gets a command from the web page.
Starts the web server running: node server.js, to serve the web page and camera feed and respond to controls.
Starts the camera with raspistill in timelapse mode, to take pictures for the camera feed.
Starts the rover control program "drive".

These processes are all started in the background, rather than in the current terminal session, by adding "&" after each command. This means that you can close the terminal after starting everything and control the rover from the web page in the browser. Run sudo sh start.sh to start the rover processes. You can see the running processes on the Raspberry Pi by typing ps at the terminal. Once they have all started up (which can take a few seconds), open a browser on your control device (I use my Android phone or iPad) and enter the local IP address of your Raspberry Pi (e.g. mine is 192.168.0.11) in the address bar. You can find the IP address from the Pi by entering hostname -I at the terminal. You should then see the web page with the image from the camera feed updating and the controls should feed back to the rover and move it. Once you've finished driving, you can run sudo sh stop.sh to stop all the processes running on the Raspberry Pi.