We have two open PhD positions:
Contact me for more information!
We have two open PhD positions:
Contact me for more information!
Update: this position is no longer available
There is an increasing interest towards the applications that allow Free Navigation Video Services [1], where users can modify the viewpoint on a scene while receiving a video. These services try to provide the user with the so-called Plenoptic function of the scene [2], defined as:
P_f(x,y,z,theta,phi,lambda,t)
It gives the light intensity at each position for any incident angle , for any wavelength and at any time. This doctoral project is focused on three key problems related to the use of the Plenoptic function : its acquisition, synthesis and visualization.
Current tools for acquisition do not allow collecting the whole Plenoptic function; on the contrary, they allow a sampling of it. For example, in Super-MultiView video[3], the plane (z=z_0) is fixed, and only the forward scene, i.e. when the polar angle comprised , is between -pi/2 and pi/2, is acquired. Moreover, the plane is sampled at the position of each camera.
In this project we are interested in the interpolation of the Plenoptic function, i.e. in the synthesis of virtual viewpoints that were not acquired by real cameras. Moreover, we also want to explore the case of irregular sampling position of P_f.
Access to the Plenoptic function would allow new ways to create and consume visual contents. For example, the Fyuse application [4] allows to change the view angle during the reproduction, while the Lytro system [5] allows post-acquisition refocusing.
Several scientific fields are concerned by this approach :
These items interact one with the other : view synthesis is preliminary for virtual cinema and may benefit from visual attention and perception information ; the whole process impacts on the quality and the aesthetics of the resulting image.
Image synthesis plays a key role in the system that we want to implement. We can see the problem as the interpolation of the Plenoptic function from a set of samples [12]. This reconstruction is based on the scene geometry and often uses post-processing for alleviating the synthesis artifacts.
Image synthesis and rendering have been long studied by the Computer Vision community and the Compression community, even outside the context of Plenoptic function interpolation. The first methods only used the images for the synthesis: they fall into the Image-Based Rendering (IBR) [13] family. Disparity estimation and occlusion detection are typical tools used to improve the synthesis for this case[14], and may prove useful in this doctoral project.
When the depth information is also available, we have the Depth Image-Based Rendering (DIBR) [15] family. Even though DIBR is known since the first 2000’s, the quality of synthesis is not fully satisfying yet [16]. Nevertheless, some promising methods have been proposed recently [17]. They combine temporal and inter-view redundancy to improve the synthesis.
Another difficulty may come from the camera positioning [18]. A preliminary calibration and synchronization phase are needed in order to have a high quality synthesis [19] [20] [21]. To this end, feature matching tools could be employed, such as SIFT [22], SURF [23]. This look necessary in order to achieve the 3D scene understanding [1][18] .
This doctoral project will start with a deep and accurate study of the state of the art in the different concerned domains : image synthesis, camera calibration, 3D geometry, feature matching, visual attention. From a practical point of view, the PhD candidate may use the facilities at b<>com to test the acquisition of the Plenoptic function and to perform camera calibration and synchronization.
Then, the PhD candidate will test and implement different synthesis methods, starting from the state of the art, and then proposing more complex and effective solutions. Human vision principles should be integrated into the new approaches.
At the same time, the impact of the synthesis methods on such practical applications as visualization, free navigation, virtual cinema, …, will be taken into account. The final target of the doctoral project is the mastering of the complete system from acquisition to visualization.
Rémi Cozot, Maître de Conférences, Habilité à Diriger des Recherches, IRT b<>com, IRISA/Université de Rennes 1 – cozot@irisa.fr
Marco Cagnazzo, Maître de Conférences, Habilité à Diriger des Recherches, IRT b<>com, Telecom-ParisTech/Institut Mines-Télécom– cagnazzo@telecom-paristech.fr
The airplane screens have a very specific video content, where text and graph are superposed to images or to a uniform background.
Compressing this kind of data requires adapted techniques, since the most important information (text, graph) is usually degraded by traditional, transform-based video compression techniques.
We want to investigate the use of classification, segmentation and inpainting to recognize the most relevant information and encode it with appropriate methods.
The PhD student will work at both Telecom-ParisTech and Zodiac Aerospace
APPLY HERE:
http://www.adum.fr/as/ed/voirproposition.pl?site=PSaclay&matricule_prop=9954
HEVC can be used to encode new video formats, such as 3D video, super-multiview video, of high dynamic range video.
A new PhD thesis on holoscopic video is starting. I will co-supervise Antoine Dricot on holoscopic compression, with co-directors Joël Jung, Béatrice Pesquet-Popescu and Frédéric Dufaux
Three-years contract to achieve a PhD degree.
The topic is the problem of interactive streaming of multiview video.
Multiview video is composed of several video sequences, each corresponding to a different point of view. Interactive acces to this video requires switches from one view to another. This is problematic from the point of view of predictive coding: making prediction from one image to a second one belonging to another view is complex (all inter-view dependencies should be taken into account); independent coding is not effective. Possible solutions are based on distributed video coding.
Links: Paper on IMVS + DVC.
See also papers by G. Cheung.