Deep Learning Lectures

class: center, middle

# Introduction au Deep Learning

## Réseaux de Neurones Convolutionnels

###Rémy Courdier (UR) avec des slides de Evann Courdier (EPFL)

Dernière mise à jour :

.affiliations[
  ![UR](images/init/logoUR.png)
  ![EPFL](images/init/logoEPFL.png)
]

---

Plan du cours

### En théorie...

- .grey[Présentation Générale Machine Learning et Deep Learning]
- .grey[Aspects theoriques du Machine learning et Deep Laerning]
- Introduction aux réseaux neuronaux convolutionnels

### En pratique...
.grey[
- TP : Classification d’images satellitaires
- TP : Object detection / counting 
]

---

## Pourquoi des réseaux neuronaux convolutionnels (CNN) ?

### Nombre de paramètres

Si on utilisait une image en entrée d'un réseau classique, il faudrait apprendre un nombre colossal de paramètres !

Par exemple, une couche d'un réseau qui prend en entrée une image RGB $256 \times 256$ image, et produit une image de même taille nécessite:

$$ (256 \times 256 \times 3)^2 \simeq 4e+10 $$

paramètres.

---

## Pourquoi des CNN ?

### Cohérence Spatiale

Les images possèdent une cohérence spatiale, et donc les systèmes qui utilisent des images doivent être 'invariant par translation'.

Un réseau convolutionnel l'est car il applique la même transformation linéaire à chaque endroit de l'image.

---

## Convolution 1d

.center[
<img src="images/init/conv1d_1.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_2.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_3.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_4.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_5.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_6.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_7.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_8.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 1d

.center[
<img src="images/init/conv1d_9.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---

## Convolution d'une image

- Image de dimensions $5 \times 5$
- Noyau de dimensions $3 \times 3$

.center[
 <img src="images/init/numerical_no_padding_no_strides.gif" style="width: 80%;" />
]

---

## Convolution d'une image (Noir & Blanc)

.center[
 <img src="images/init/conv2d_BW.gif" style="width: 100%;" />
]

---
## Convolution d'une image (Couleur)

.center[
 <img src="images/init/conv2d_rgb.gif" style="width: 100%;" />
]

---
## Convolution 2d

.center[
<img src="images/init/conv2d_1.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_2.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_3.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_4.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_5.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_6.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_7.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_8.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_9.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_10.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_11.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Convolution 2d

.center[
<img src="images/init/conv2d_12.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Convolution 2d

.center[
<img src="images/init/conv2d_13.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---

## Noyau de Convolution

- Ils sont appris (ils font partie des paramètres)

- Exemple de filtres "3D" (3 canaux RGB) appris:

.center[
 <img src="images/init/conv_2d_bank.png" style="width: 600px;" />
 ]

---

## Sous Echantillonnage (Pooling / Subsampling)

- Réduction de dimension spatiale

- Plusieurs type: average pooling, max pooling, ...

.center[
 <img src="images/init/pooling_1.png" style="width: 600px;" />
]

---

## Sous Echantillonnage (Pooling / Subsampling)

- Réduction de dimension spatiale

- Plusieurs type: average pooling, max pooling, ...

.center[
 <img src="images/init/pooling.png" style="width: 600px;" />
]

???

Conserve l'invariance par translation

---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_1.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_2.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_3.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_4.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_5.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_6.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 1d

.center[
<img src="images/init/maxpool1d_7.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_1.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_2.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_3.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_4.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_5.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_6.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]
---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_7.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_8.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_9.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_10.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_11.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_12.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_13.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_14.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Max-Pooling 2d

.center[
<img src="images/init/maxpool2d_15.png" style="width: 600px;" /> 
]

.credit[Slide credit: F. Fleuret]

---
## Couche convolutionnelle

Une couche de réseau convolutionnel est composée d'une convolution, d'une activation et d'un pooling.

.center[
<img src="images/init/conv_layers.png" style="width: 600px;" />
 ]

---

## Réseau convolutionnel

- Superposition de plusieurs couches convolutionnelles qui extraient les features
- Les couches plus profondes calculent des features plus globales, plus invariantes
- La dernière couche est une couche de classification (non convolutionnelle)

.center[
<img src="images/init/cnn.png" style="width: 90%;" /> 
]

---

## D'autres architectures: GoogleNet

.center[
<img src="images/init/googlenet.png" style="width: 100%;" /> 
]

---

## D'autres architectures: Resnet

.center[
<img src="images/init/resnet.png" style="height: 550px;" /> 
]

---

## Comparaison des modèles

.center[
<img src="images/init/comparison_deep.png" style="width: 110%;" /> 
]

---
## Concours de classification d'images

.center[
###"Deeper is better"

<img src="images/init/deeper.png" style="width: 500px;" /> 
]

.center[[Go to Ethique](ethique.html)]