Giovanni Claudio - Personal Website

Traffic Sign Recognition with Tensorflow

2017-03-31T00:00:00+00:00

Introduction

In this project, I used a convolutional neural network (CNN) to classify traffic signs. I trained and validated a model so it can classify traffic sign images using the German Traffic Sign Dataset. After the model is trained, I tried out the model on images of traffic signs that I took with my smartphone camera.

My final model results are:

Training set accuracy of 97.5%
Validation set accuracy of 98.5%
Test set accuracy of 97.2%
New Test set accuracy of 100% (6 new images taken by me)

Here is the project code. Please note that I used only the CPU of my laptop to train the network.

The steps of this project are the following:

Load the data set
Explore, summarize and visualize the data set
Design, train and test a model architecture
Use the model to make predictions on new images
Analyze the softmax probabilities of the new images
Summarize the results with a written report

Data Set Summary & Exploration

1. Basic summary of the data set

I used the Pandas library to calculate summary statistics of the traffic signs data set:

The size of original training set is 34799
The size of the validation set is 4410
The size of test set is 12630
The shape of a traffic sign image is 32x32x3 represented as integer values (0-255) in the RGB color space
The number of unique classes/labels in the data set is 43

We have to work with images with a resolution of 32x32x3 representing 43 type of different German traffic signs.

2. Exploratory visualization of the dataset

Here is an exploratory visualization of the data set. It is a bar chart showing how many samples we have for each class.

We can notice that the distribution is not balanced. We have some classes with less than 300 examples and other well represented with more than 1000 examples. We can analyze now the validation dataset distribution:

The distributions are very similar. Even if it would be wise to balance the dataset, in this case, I am not sure it would be very useful. In fact, some traffic signs (for example the 20km/h speed limit) could occur less frequently than others (the stop sign for example). For this reason I decided to not balance the dataset.

Class 0: Speed limit (20km/h)                                180 samples
Class 1: Speed limit (30km/h)                                1980 samples
Class 2: Speed limit (50km/h)                                2010 samples
Class 3: Speed limit (60km/h)                                1260 samples
Class 4: Speed limit (70km/h)                                1770 samples
Class 5: Speed limit (80km/h)                                1650 samples
Class 6: End of speed limit (80km/h)                         360 samples
Class 7: Speed limit (100km/h)                               1290 samples
Class 8: Speed limit (120km/h)                               1260 samples
Class 9: No passing                                          1320 samples
Class 10: No passing for vehicles over 3.5 metric tons        1800 samples
Class 11: Right-of-way at the next intersection               1170 samples
Class 12: Priority road                                       1890 samples
Class 13: Yield                                               1920 samples
Class 14: Stop                                                690 samples
Class 15: No vehicles                                         540 samples
Class 16: Vehicles over 3.5 metric tons prohibited            360 samples
Class 17: No entry                                            990 samples
Class 18: General caution                                     1080 samples
Class 19: Dangerous curve to the left                         180 samples
Class 20: Dangerous curve to the right                        300 samples
Class 21: Double curve                                        270 samples
Class 22: Bumpy road                                          330 samples
Class 23: Slippery road                                       450 samples
Class 24: Road narrows on the right                           240 samples
Class 25: Road work                                           1350 samples
Class 26: Traffic signals                                     540 samples
Class 27: Pedestrians                                         210 samples
Class 28: Children crossing                                   480 samples
Class 29: Bicycles crossing                                   240 samples
Class 30: Beware of ice/snow                                  390 samples
Class 31: Wild animals crossing                               690 samples
Class 32: End of all speed and passing limits                 210 samples
Class 33: Turn right ahead                                    599 samples
Class 34: Turn left ahead                                     360 samples
Class 35: Ahead only                                          1080 samples
Class 36: Go straight or right                                330 samples
Class 37: Go straight or left                                 180 samples
Class 38: Keep right                                          1860 samples
Class 39: Keep left                                           270 samples
Class 40: Roundabout mandatory                                300 samples
Class 41: End of no passing                                   210 samples
Class 42: End of no passing by vehicles over 3.5 metric tons  210 samples

Design and Test a Model Architecture

1. Pre-processing

This phase is crucial to improving the performance of the model. First of all, I decided to convert the RGB image into grayscale color. This allows to reduce the numbers of channels in the input of the network without decreasing the performance. In fact, as Pierre Sermanet and Yann LeCun mentioned in their paper “Traffic Sign Recognition with Multi-Scale Convolutional Networks”, using color channels did not seem to improve the classification accuracy. Also, to help the training phase, I normalized each image to have a range from 0 to 1 and translated to get zero mean. I also applied the Contrast Limited Adaptive Histogram Equalization (CLAHE), an algorithm for local contrast enhancement, which uses histograms computed over different tile regions of the image. Local details can, therefore, be enhanced even in areas that are darker or lighter than most of the image. This should help the feature exaction.

Here the function I used to pre-process each image in the dataset:

def pre_processing_single_img (img):

    img_y = cv2.cvtColor(img, (cv2.COLOR_BGR2YUV))[:,:,0]
    img_y = (img_y / 255.).astype(np.float32)
    img_y = (exposure.equalize_adapthist(img_y,) - 0.5)
    img_y = img_y.reshape(img_y.shape + (1,))

    return img_y

Steps:

Convert the image to YUV and extract Y Channel that correspond to the grayscale image:
img_y = cv2.cvtColor(img, (cv2.COLOR_BGR2YUV))[:,:,0] Y stands for the luma component (the brightness) and U and V are the chrominance (color) components.
Normalize the image to have a range from 0 to 1:
` img_y = (img_y / 255.).astype(np.float32) `
Contrast Limited Adaptive Histogram Equalization (see here for more information) and translate the result to have mean zero:
img_y = (exposure.equalize_adapthist(img_y,) - 0.5)
Finally reshape the image from (32x32) to (32x32x1), the format required by tensorflow:
img_y = img_y.reshape(img_y.shape + (1,))

Here is an example of a traffic sign image before and after the processing:

Initially, I used .exposure.adjust_log, that it is quite fast but finally I decided to use exposure.equalize_adapthist, that gives a better accuracy.

2. Augmentation

To add more data to the data set, I created two new datasets starting from the original training dataset, composed by 34799 examples. In this way, I obtain 34799x3 = 104397 samples in the training dataset.

Keras ImageDataGenerator

I used the Keras function ImageDataGenerator to generate new images with the following settings:

datagen = ImageDataGenerator(
        rotation_range=17,
        width_shift_range=0.1,
        height_shift_range=0.1,
        shear_range=0.3,
        zoom_range=0.15,
        horizontal_flip=False,
        dim_ordering='tf',
        fill_mode='nearest')

 # configure batch size and retrieve one batch of images

for X_batch, y_batch in datagen.flow(X_train, y_train, batch_size=X_train.shape[0], shuffle=False):
    print(X_batch.shape)
    X_train_aug = X_batch.astype('uint8')
    y_train_aug = y_batch
    break

To each picture in the training dataset, a rotation, a translation, a zoom and a shear transformation is applied.

Here is an example of an original image and an augmented image:

Motion Blur

Motion blur is the apparent streaking of rapidly moving objects in a still image. I thought it is a good idea add motion blur to the image since they are taken from a camera placed on a moving car.

3. Final model architecture

I started from the LeNet network and I modified it using the multi-scale features took inspiration from the model presented in Pierre Sermanet and Yann LeCun paper. Finally, I increased the number of filters used in the first two convolutions. We have in total 3 layers: 2 convolutional layers for feature extraction and one fully connected layer used. Note that my network has one convolutional layer less than the [Pierre Sermanet and Yann LeCun (http://yann.lecun.com/exdb/publis/pdf/sermanet-ijcnn-11.pdf) version.

Layer	Description
Input	32x32x1 Grayscale image
Convolution 3x3	1x1 stride, same padding, outputs 28x28x12
RELU
Max pooling	2x2 stride, outputs 14x14x12
Dropout (a)	0.7
Convolution 3x3	1x1 stride, same padding, outputs 10x10x24
RELU
Max pooling	2x2 stride, output = 5x5x24.
Dropout (b)	0.6
Fully connected	max_pool(a) + (b) flattend. input = 1188. Output = 320
Dropout (c)	0.5
Fully connected	Input = 320. Output = n_classes
Softmax

To train the model I used 20 epochs with a batch size of 128, the AdamOptimizer(see paper here) with a learning rate of 0.001. The training phase is quite slow using only CPU, that’s why I used only 20 epochs.

My final model results were:

Training set accuracy of 97.5%
Validation set accuracy of 98.5%
Test set accuracy of 97.2%

First attempt: validation accuracy 91.5%

Initially, I started with the LeNet architecture, a convolutional network designed for handwritten and machine-printed character recognition.

I used the the following preprocess pipeline:

Convert in YUV, keep the Y
Adjust the exposure
Normalization

Parameters:

EPOCHS = 10
BATCH_SIZE = 128
Learning rate = 0.001

Number of training examples = 34799
Number of validation examples = 4410
Number of testing examples = 12630

At each step, I will mention only the changes I adopted to improve the accuracy.

Second attempt: validation accuracy 93.1%

I added Dropout after each layer of the network LeNet:
1) 0.9 (after C1)
2) 0.7 (after C3)
3) 0.6 (after C5)
4) 0.5 (after F6)

Third attempt: validation accuracy 93.3%

I changed the network using multi-scale features as suggested in the paper Traffic Sign Recognition with Multi-Scale Convolutional Networks and use only one fully connected layer at the end of the network.

Fourth attempt: validation accuracy 94.6%

I augmented the training set using the Keras function ImageDataGenerator. In this way, I double the training set.

Number of training examples = 34799x2 = 69598
Number of validation examples = 4410

I Used Dropout with the follow probability (referred this table):
a) 0.8
b) 0.7
c) 0.6

Fifth attempt: validation accuracy 96.1%

Since the training accuracy was not very high, I decided to increase the number of filters in the first two convolutional layers.
First layer: from 6 to 12 filters.
Second layer: from 16 to 24 filters.

Final attempt: validation accuracy 98.5%

I augmented the data adding the motion blur to each sample of the training data. Hence, I triplicate the number of samples in the training set. In addition, I added the L2 regularization and I used the function equalize_adapthist instead of .exposure.adjust_log during the image preprocessing.

Performance on the test set

Finally I evaluated the performance of my model with the test set.

Accuracy

The accuracy was equal to 97.2%.

Precision

The Precision was equal to 96.6%
The precision is the ratio tp / (tp + fp) where tp is the number of true positives and fp the number of false positives. The precision is intuitively the ability of the classifier not to label as positive a sample that is negative.

Recall

The recall was equal to 97.2% The recall is the ratio tp / (tp + fn) where tp is the number of true positives and fn the number of false negatives. The recall is intuitively the ability of the classifier to find all the positive samples.

Confusion matrix

Let’s analyze the confusion matrix:

You can click on the picture to interact with the map.

We can notice that:

28/60 samples of to the class 19 (Dangerous curve to the left) are misclassified as samples belonging to the class 23 (Slippery road). This can be explained by the fact that the class number 19 is underrepresented in the training set: it has only 180 samples.
34/630 samples of to the class 5 (Speed limit 80km/h) are misclassified as samples belonging to the class 2 (Speed limit 50km/h).
The model is not very good to classify the class number 30 (Beware of ice/snow), it classified samples in a wrong way 61 times.
The model produces 80 false positives for the class 23.

Test a Model on New Images

Here are five traffic signs from some pictures I took in France with my smartphone:

Here are the results of the prediction:

the first image is the test image
the second one is a random picture of the same class of the prediction
the third one is a plot showing the top five soft max probabilities

The model was able to correctly guess 6 of the 6 traffic signs, which gives an accuracy of 100%. Nice!

Visualize the Neural Network’s State with Test Images

We can understand what the weights of a neural network look like better by plotting their feature maps. After successfully training your neural network you can see what it’s feature maps look like by plotting the output of the network’s weight layers in response to a test stimuli image. From these plotted feature maps, it’s possible to see what characteristics of an image the network finds interesting. For a sign, maybe the inner network feature maps react with high activation to the sign’s boundary outline or the contrast in the sign’s painted symbol.

Here the output of the first convolution layer:

Here the output of the second convolution layer:

We can notice that the CNN learned to detect useful features on its own. We can see in the first picture some edges of the sign for example.

Now another example using a test picture with no sign on it:

In this case the CNN does not recognize any useful features. The activations of the first feature map appear to contain mostly noise:

Final considerations

Premise:

Only CPU was used to train the network. I had to use only the CPU of my laptop because I didn’t have any good GPU at my disposal. I choose to not use any online service like AWS or FloydHub, mostly because I was waiting for the arrival of the GTX 1080. Unfortunately, it did not arrive in time for this project. This required me to use a small network and to keep the number of epochs around 20.

Some possible improvements:

I would use Keras to define the network and its function ImageDataGenerator to generate augmented samples on the fly. Using more data could improve the performance of the model. In my case, I have generated an augmented dataset once, saved it on the disk and used it every time to train. It would be useful to generate randomly the dataset each time before the training.
The confusion matrix gives us suggestions to improve the model (see section Confusion matrix). There are some classes with low precision or recall. It would be useful to try to add more data for these classes. For example, I would generate new samples for the class 19 (Dangerous curve to the left) since it has only 180 samples and the model.
The accuracy for the training set is 0.975. This means that the model is probably underfitting a little bit. I tried to make a deeper network (adding more layers) and increasing the number of filters but it was too slow to train it using the CPU only.
The model worked well with new images taken with my camera (100% of accuracy). It would be useful to test the model by using more complicated examples.

Finding Lane Lines on the Road

2017-03-10T00:00:00+00:00

Overview:

The goal of this project is to make a pipeline that finds lane lines on the road using Python and OpenCV. See an example:

The pipeline will be tested on some images and videos provided by Udacity. The following assumptions are made:

The camera always has the same position with respect to the road
There is always a visible white or yellow line on the road
We don’t have any vehicle in front of us
We consider highway scenario with good weather conditions

Here you can find the project.

Reflection

1. Pipeline description

I will use the following picture to show you all the steps:

Color selection

Firstly, I applied a color filtering to suppress non-yellow and non-white colors. The pixels that were above the thresholds have been retained, and pixels below the threshold have been blacked out. This is the result:

I will keep aside this mask and use it later.

Convert the color image in grayscale

The original image is converted in grayscale. In this way we have only one channel:

Use Canny for edge detection

Before running the Canny detector, I applied a Gaussian smoothing which is essentially a way of suppressing noise and spurious gradients by averaging. The Canny allows detecting the edges in the images. To improve the result, I also used the OpenCV function dilate and erode.

Merge Canny and Color Selection

In some cases, the Canny edge detector fails to find the lines. For example, when there is not enough contrast between the asphalt and the line, as in the challenge video (see section Optional challenge). The color selection, on the other hand, doesn’t have this problem. For this reason, I decided to merge the result of the Canny detector and the color selection:

Region Of Interest Mask

I defined a left and right trapezoidal Region Of Interest (ROI) based on the image size. Since that the front facing camera is mounted in a fix position, we supposed here that the lane lines will always appear in the same region of the image.

Run Hough transform to detect lines

The Hough transform is used to detect lines in the images. At this step, I applied a slope filter to get rid of horizontal lines. This is the result:

Compute lines

Now I need to average/extrapolate the result of the Hough transform and draw the two lines onto the image. I used the function fitLine, after having extrapolated the points from the Hough tranform result with the OpenCV function findNonZero. I did this two times, once for the right line and another time for the left line. As a result, I got the slopes of the lines, and I could draw them onto the original picture:

Results:

Pictures

Here some results on test images provided by Udacity:

You can find the original pictures and the results in the folder test_images.

Videos

Here some results on test videos provided by Udacity:

You can find the video files here: video1, video2.

Optional challenge:

While I got a satisfactory result on the first two videos provided by Udacity, it was not the case for the challenge video. In the challenge video we can identify more difficulties:

The color of the asphalt became lighter at a certain point. The Canny edge detector is not able to find the line using the grayscale image (where we lose information about the color)
The car is driving on a curving road
There are some shadows due to some trees

To overcome theses problems, I introduced the color mask and resized the ROI. This is the result, using only the color mask (without the canny detection):

You can find the video file here: video_challenge

The right line is a little jumpy mainly because of the curve: the function fitline is trying to fit a line on a curvy lane. It would be useful to shrink the ROI in this case, but I preferred to keep the same ROI size used in the first two videos.

If we analyze the steps using a snapshot from the challenge video, we can notice that the Canny detector is not very useful:

while the color mask is able to detect the lines:

Indeed, as you can see in the following picture, we lose valuable color information when we convert the image in grayscale. Moreover, the Canny operator find a lot of edges when we have shadows on the road.

Testing the pipeline on a YouTube Video:

Just out of curiosity, I wanted to test the pipeline on a video extracted from Youtube (see the original video here ).

I noticed that the color selection was not working properly in this case, so I had to tune a little bit the thresholds values. This is the new result using both color selection and Canny:

You can find the video file here: video_extra

It would be wiser to transform the image in the HSV space and to apply the color selection, instead of doing it on the RGB images.

2. Potential shortcomings with the current pipeline

This approach could not work properly:
- if the camera is placed at a different position
- if other vehicles in front are occluding the view
- if one or more lines are missing
- at different weather and light condition (fog, rain, or at night)

3. Possible improvements

Some possible improvements:

Perform a color selection in the HSV space, instead of doing it in the RGB images
Update the ROI mask dynamically
Perform a segmentation of the road
Using a better filter to smooth the current estimation, using the previous ones
If a line is not detected, we could estimate the current slope using the previous estimations and/or the other line detection
Use a moving-edges tracker for the continuous lines

Install Unity3D in Ubuntu

2017-02-05T00:00:00+00:00

I wanted to test on my laptop this nice project created with Unity3D by tawnkramer:

Unity3D is used to simulate a self-driving car. A neural network based on the Nvidia’s paper “End to End Learning for Self-Driving Cars” is trained to drive the car down a randomly generated road. You can find the source code here.

Download Unity for Linux

Donwload the last version here. I downloaded the debian package: Unity 5.6.0xb3Linux.

Installation

You can install the .deb package via the Ubuntu Software Center and is expected to work on installations of Ubuntu 12.04 or newer.

Fix black screen launching the application

Initially Unity3D was not able to start properly, I was only getting a grey screen. This solution, posted by malyzeli, solved the problem.

How to compute frame transformations with Tf (ROS)

2016-08-08T00:00:00+00:00

Here a script to compute the transformation between two frames published in /tf in ROS.
You can download the python script, modify the name of the topics and run it with
python computeTF2frames.py

The result:

Translation:  (0.06290424180516997, -0.004345408291908189, 1.1515618071559173)
Rotation:  (-0.5041371308002005, 0.5088056423602352, -0.49116582568647843, 0.4957002151791443)

Intrinsic camera calibration for Nao/Romeo/Pepper with Visp

2016-08-04T00:00:00+00:00

We will show here how to estimate the camera intrinsic parameters for the robot Nao, Romeo or Pepper, using the ViSP camera calibration tool.

First of all we need to have ‘ViSP’,visp_naoqi and the C++ SDK from Softbank. You can follow this guide.

Once everything is working we can run the program to estimate the parameters:

Go to the build folder of visp_naoqi via terminal
Run the program camera_calibration:
```
Usage: ./sdk/bin/camera_calibration  [ --config <configuration file>.cfg] [--ip <robot address>] [--port <port robot>] [--cam camera_number] [--name camera_name] [--vga] [--help]
```
Here the explanation of the options:
- [ --config <configuration file>.cfg] The path to a configuration file where we define the kind of pattern we are using ( size of the grid and the dimension of the circle/square). You can find two examples here:(default-chessboard.cfg)[https://github.com/lagadic/visp_naoqi/blob/master/tools/calibration/default-chessboard.cfg] or (default-circles.cfg)[https://github.com/lagadic/visp_naoqi/blob/master/tools/calibration/default-circles.cfg]
- [--ip <robot address>] Se the IP of the robot.
- [--port <port robot>] Se the port of the robot: default 9559.
- [--cam camera_number] Choose the camera you want to use. For Pepper and Nao 0 = TopCamera, 1 = BottomCamera.
- [--name camera_name] Set the name of the camera.
- [--vga] if you want to set the camera at the resolution of 640x480. Default resolution: 320x240
  Example:
```
$ ./sdk/bin/camera_calibration --config /udd/gclaudio/romeo/cpp/workspace/visp_naoqi/tools/calibration/default-circles.cfg --cam 0 --name cameraTopPepper --ip 131.254.10.126
```

Speech recognition using ROS and Pocketsphinx

2016-03-28T00:00:00+00:00

How to install pocketsphinx on ROS Indigo (Ubuntu 14.04).

Install Dependecies

$ sudo apt-get install gstreamer0.10-pocketsphinx
$ sudo apt-get install python-gst0.10
$ sudo apt-get install gstreamer0.10-gconf

Clone and Build Pocketsphinx

Clone the repository in your catkin src folder:
- $ git clone https://github.com/mikeferguson/pocketsphinx.git
Launch the catkin_make command from the catkin workspace folder

Test Pocketsphinx

Now we can start the speech recognizer:
- $ roslaunch pocketsphinx turtlebot_voice_cmd.launch
These are the basic commands can be recognized (see file voice_cmd.corpus in the folder demo):

forward
left
right
back
backward
stop
move forward
move right
move left
move back
move backward
halt
half speed
full speed

Open another terminal to check if the words are recognized:
$rostopic echo /recognizer/output

jokla@Dell-PC:~/catkin_ws/src$ rostopic echo /recognizer/output 
data: back
---
data: speed
---
data: move right
---
data: move back
---
data: back
---
data: left
---
data: speed
---
data: left
---
data: move right
---
data: stop
---
data: move right
---
data: move left
---
data: full speed

To each vocal command corresponds a twist command. For example, this is the twist corresponding to the command “back”:

linear: 
  x: -0.4
  y: 0.0
  z: 0.0
angular: 
  x: 0.0
  y: 0.0
  z: 0.0`

Command Turtlebot robot in Gazebo using voice commands

Install turtlebot_simulator
- $ sudo apt-get install ros-indigo-turtlebot-simulator
Open the launch file turtlebot_voice_cmd and remap the name of the topic cmd_vel to /mobile_base/commands/velocity

<node name="voice_cmd_vel" pkg="pocketsphinx" type="voice_cmd_vel.py" output="screen">
    <remap from="cmd_vel" to="/mobile_base/commands/velocity"/>
 </node>`

In this way the node pocketsphinx publishes the velocities in the topic /mobile_base/commands/velocity and gazebo subscribes to it.

Launch simulation (click here for more info). Note that Gazebo may update its model database when it is started for the first time. This may take a few minutes.
- $ roslaunch turtlebot_gazebo turtlebot_world.launch
Now we can start the speech recognizer:
- $ roslaunch pocketsphinx turtlebot_voice_cmd.launch
You should be able to control Turtlebot using your voice.

Places to see in Brittany

2014-12-29T00:00:00+00:00

I have the fortune to live in one of the best regions of France: Brittany. During my weekend I love traveling around my current city, Rennes. Having said this, I want to share a map collecting the places that I want to see or I have already seen. I will try to improve this map adding comments and pictures.

Naoqi C++ SDK Installation

2014-12-28T00:00:00+00:00

Prerequisites

Installation IDE

QT Creator is the IDE recommended by SoftBank Robotics.

Download the installer available here. In my case the file is named qt-opensource-linux-x64-1.6.0-5-online.run.
Go in the folder where you downloaded the installer of qt-creator and give execute permission with:
$ chmod a+x qt-opensource-linux-x64-1.6.0-5-online.run
Run the installer:
$ ./qt-opensource-linux-x64-1.6.0-5-online.run

Download software

C++ SDK and Cross Toolchain

Download the following packages here or here for Pepper:

C++ SDK 2.3 Linux 64 (or newier version)

Creation Devtools and workspace folders

Let’s create now some folders useful for the development with the SDK:
$ mkdir -p ~/romeo/{devtools,workspace}

NB: This is just a suggestion, you can manage these folders as you prefer.

Now we can extract the C++ SDK and Cross Toolchain in the devtools folder. Go via terminal in the folder where you downloaded the tools and run:
$ tar -zxvf naoqi-sdk-2.3.0.14-linux64.tar.gz -C ~/romeo/devtools/

Qibuild

Open a terminal and install Qibuild with pip: $ pip install qibuild
If you don’t have pip installed you can install it with: $ sudo apt-get install python-pip
Now we add the installation location of Qibuild in the PATH. Open the file bashrc: $ gedit ~/.bashrc and in the end of the file add:
export PATH=${PATH}:${HOME}/.local/bin
Open a new terminal and check if Qibuild is correctly installed:
$ qibuild --version
Now we have to create a qibuild “worktree”. This path will be the root from where qiBuild searches to find the sources of your projects. We can use the folder we created before: ~/romeo/workspace.
$ cd ~/romeo/workspace And digit:
$ qibuild init
Now we can run: $ qibuild config --wizard
A file will be generated in ~/.config/qi/qibuild.xml. It is shared by all the worktrees you will create. You will be asked to choose a CMake generator, select Unix Makefiles, and to choose a IDE, choose QtCreator (or another if you use a different IDE).

:: Please choose a generator:
> 1 (Unix Makefiles)
:: Please choose an IDE
> 2 (QtCreator)
:: Do you want to use qtcreator from /usr/bin/qtcreator?
> Y (Yes)
:: Found a worktree in /udd/fspindle/soft/romeo/workspace_gantry
:: Do you want to configure settings for this worktree? (y/N)
> y
:: Do you want to use a unique build dir? (mandatory when using Eclipse) (y/N)
> N

If you see a message like “CMake not found” probably you have to install CMake:
sudo apt-get update && sudo apt-get install cmake
We can create, configure and build a new project called “foo”: $ qisrc create foo
New project initialized in /home/jokla/romeo/workspace/foo $ qibuild configure foo

Current build worktree: /home/jokla/romeo/workspace 
Build type: Debug 
* (1/1) Configuring foo 
-- The C compiler identification is GNU 4.8.2
-- The CXX compiler identification is GNU 4.8.2
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Using qibuild 3.6.2
-- Binary: foo
-- Binary: test_foo
-- Configuring done
-- Generating done
-- Build files have been written to: /home/jokla/romeo/workspace/foo/build-sys-linux-x86_64

$ qibuild make foo

Current build worktree: /home/jokla/romeo/workspace 
Build type: Debug 
* (1/1) Building foo 
Scanning dependencies of target foo
[ 50%] Building CXX object CMakeFiles/foo.dir/main.cpp.o
Linking CXX executable sdk/bin/foo
[ 50%] Built target foo
Scanning dependencies of target test_foo
[100%] Building CXX object CMakeFiles/test_foo.dir/test.cpp.o
Linking CXX executable sdk/bin/test_foo
[100%] Built target test_foo

We can run the executable of the project “foo”:
$ cd ~/romeo/workspace/foo/build-sys-linux-x86_64/sdk/bin/foo
You should see:
Hello, world

References: link1, link2

Using qibuild with Aldebaran C++ SDKs

Now we need to create a toolchain (change with your path to the file toolchain.xml you want to use. You will find it in the naoqi-sdk folder):

$ qitoolchain create toolchain_romeo /home/jokla/romeo/devtools/naoqi-sdk-2.3.0.14-linux64/toolchain.xml --default

NB: Instead of toolchain_romeo you can choose the name that you want. You can create also different toolchains.

If you have a new version of qibuild the procedure is slightly different:

$ qitoolchain create toolchain_romeo /local/soft/naoqi-sdk/naoqi-sdk-2.3.0.14-linux64/toolchain.xml
$ qibuild add-config toolchain_romeo -t toolchain_romeo --defaul

Optional Test:

Open a terminal and digit:

$ cd ~/romeo/devtools/naoqi-sdk-2.3.0.14-linux64/doc/dev/cpp/examples 
$ qibuild init --interactive

Now we can configure and build the examples:

$ cd core/helloworld/
$ qibuild configure -c toolchain_romeo  
$ qibuild make -c toolchain_romeo

You can also build in release mode:

$ qibuild configure --release <project_name>
$ qibuild make --release <project_name>

Matlab ROS Bridge

2014-12-05T00:00:00+00:00

This software consists of a set of Matlab C++ S-functions that can be used to:

synchronize simulink with the system clock, thus obtaining a soft-real-time execution;
interface simulink blocks with other ROS nodes using ROS messages.

This project is based on a work started by Martin Riedel and Riccardo Spica at the Max Plank Institute for Biological Cybernetics in Tuebingen (Germany). This fork is currently supported by Riccardo Spica and Giovanni Claudio at the Inria Istitute in Rennes (France).

The software is released under the BSD license. See the LICENSE file in this repository for more information.

Romeo and ROS

2014-10-29T00:00:00+00:00

List of packages:

Step 1: Controller Romeo and joint state publisher:

Install the package ros_control: sudo apt-get install ros-indigo-ros-control ros-indigo-ros-controllers
Set AL_DIR to the path to naoqiSDK-c++ on your computer:
- gedit ~/.bashrc
- Add the following line setting the correct path to your naoqi-sdk-c++: export AL_DIR=/local/soft/naoqi/naoqi-sdk-2.1.0.19-linux64
Clone the repository ros-aldebaran/romeo_robot in your catkin_ws
Go via terminal in your catkin_ws and build the packages with catkin_make
Source your workspace: source devel/setup.bash
Now you can run: roslaunch romeo_dcm_bringup romeo_dcm_bringup_remote.launch

Step 2: Use Moveit

Follow the Step 1
Install Moveit in Indigo:
- Open synaptic package manager and search for moveit
- Install the packages you find (I don’t know exactly with packages are required so I installed all of them, except those with config )
Clone in your repository ros-aldebaran/romeo_moveit_config
Go via terminal in your catkin_ws and build the packages with catkin_make
Source your workspace: source devel/setup.bash
Now we can run the dcm_bringup roslaunch romeo_dcm_bringup romeo_dcm_bringup_remote.launch
Wait until romeo_dcm_bringup node is ready, then run MoveIt: roslaunch romeo_moveit_config moveit_planner_romeo.launch

Calibration camera

Calibrate the camera: rosrun camera_calibration cameracalibrator.py --size 9x6 --square 0.025 --no-service-check image:=/nao_camera/image_raw camera:=/nao_camera

http://wiki.ros.org/camera_calibration/Tutorials/MonocularCalibration

To converver ini to yaml:

http://wiki.ros.org/camera_calibration_parsers