Workshop and Challenge on Learned Image Compression (CLIC)

Introduction

Our workshop aims to gather publications which will advance the field of image compression with and without neural networks. Even with the long history of signal-processing oriented compression, taking new approaches to image processing have great potential, due to the proliferation of high-resolution cell-phone images and special hardware (e.g., GPUs). The potential in this area has already been demonstrated using recurrent neural networks, convolutional neural networks, and adversarial learning, many of these matching the best image-compression standards when measured on perceptual metrics. As such, we are interested in the various techniques associated with this class of methods. Broadly speaking, we would like to encourage the development of novel encoder/decoder architectures, novel ways to control information flow between the encoder and the decoder, and learn how to quantize (or learn to quantize) better.

Important Dates

Date Description
December 22nd, 2017 Challenge announcement and the training part of the dataset released
January 15th, 2018 The validation part of the dataset released, online validation server is made available
April 15th, 2018 The test set is released
April 22nd, 2018 The competition closes and participants are expected to have submitted their decoder and compressed images
April 26th, 2018 Deadline for paper submission
May 29th, 2018 Release of paper reviews and challenge results

Speakers

Ramin Zabih

Google

Ramin Zabih is a research scientist at Google, on leave from his position as a professor of Computer Science at Cornell Tech in NYC. His research focuses on computer vision, primarily the development and application of discrete optimization techniques. He served as a program chair for the biggest vision conference (CVPR) in 2007, as a general chair for CVPR in 2013; in addition, he will be a general chair for CVPR again in 2020, and for the main European vision conference (ECCV) in 2018. He served as editor in chief of the IEEE Transactions on Pattern Analysis and Machine Intelligence from 2009-2012, and chaired the PAMI-TC from 2013-2015. He received a best paper award at ECCV 2002, and test of time awards at ECCV 2012 and ICCV 2013. In 2016 he was awarded the PAMI Mark Everingham prize for his service as founder and president of the Computer Vision Foundation (CVF). He is a fellow of the ACM and the IEEE.Read More

Oren Rippel

WaveOne

Oren Rippel is co-founder and the CTO of WaveOne, a startup working on compression using machine learning. He completed his Ph.D. at MIT with advisor Ryan Adams, and was a research fellow in the Harvard Intelligent Probabilistic Systems group. Oren previously also worked at Facebook AI Research.

Jim Bankoski

Google

Jim Bankoski is a Director at Google responsible for Google's video, audio, and image compression efforts. He's the former CTO of On2 Technologies and a technical contributor to all of On2 / Google's video codecs from Tm2x through VP9, including video codecs widely used in Flash, Skype and now WebM.

Jens Ohm

RWTH Aachen

Jens-Rainer Ohm holds the chair position of the Institute of Communication Engineering at RWTH Aachen University, Germany since 2000. His research and teaching activities cover the areas of motion-compensated and 3-D image/video processing and analysis, multimedia signal compression, transmission and content descr iption, audio signal analysis, as well as fundamental topics of signal processing and digital communication systems. Since 1998, he participates in the work of the Moving Picture Experts Group (MPEG). He has been chairing/co-chairing various standardization activities in video coding, namely the MPEG Video Subgroup since 2002, the Joint Video Team (JVT) of MPEG and ITU-T SG 16 VCEG from 2005 to 2009, and currently, the Joint Collaborative Team on Video Coding (JCT-VC), as well as the Joint Video Exploration Team (JVET). Prof. Ohm has authored textbooks on multimedia signal processing, analysis and coding, on communication engineering and signal transmission, as well as numerous papers in the fields mentioned above.Read More

Ebrahimi Touradj

EPFL

Prof. Touradj EBRAHIMI (EPFL) received his M.Sc. and Ph.D., both in Electrical Engineering, from the Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland, in 1989 and 1992 respectively. In 1993, he was a research engineer at the Corporate Research Laboratories of Sony Corporation in Tokyo, where he conducted research on advanced video compression techniques for storage applications. In 1994, he served as a research consultant at AT&T Bell Laboratories working on very low bitrate video coding. He is currently Professor at EPFL heading its Multimedia Signal Processing Group. He was also adjunct Professor with the Center of Quantifiable Quality of Service at Norwegian University of Science and Technology (NTNU) from 2008 to 2012. He is currently the Convenor of JPEG (Joint Photographic Experts Group) standardization committee. Prof. Ebrahimi has been the recipient of various distinctions and awards, such as the IEEE and Swiss national ASE award, the SNF-PROFILE grant for advanced researchers, Four ISO-Certificates for key contributions to MPEG-4 and JPEG 2000, and the best paper award of IEEE Trans. on Consumer Electronics . He became a Fellow of the international society for optical engineering (SPIE) in 2003. Prof. Ebrahimi has initiated more than two-dozen National, European and International cooperation projects with leading companies and research institutes around the world. He is also the head of the Swiss delegation to MPEG, JPEG and SC29, and acts as the Chairman of Advisory Group on Management in SC29. He is a co-founder of Genista SA, a high-tech start-up company in the field of multimedia quality metrics. In 2002, he founded Emitall SA, start-up active in the area of media security and surveillance. In 2005, he founded EMITALL Surveillance SA, a start-up active in the field of privacy and protection. He is or has been associate Editor with various IEEE, SPIE, and EURASIP journals, such as IEEE Signal Processing Magazine, IEEE Trans. on Image Processing, IEEE Trans. on Multimedia, EURASIP Image Communication Journal, EURASIP Journal of Applied Signal Processing, SPIE Optical Engineering Magazine. Prof. Ebrahimi is a member of Scientific Advisory Board of various start-up and established companies in the general field of Information Technology. He has served as Scientific Expert and Evaluator for Research Funding Agencies such as those of European Commission, The Greek Ministry of Development, The Austrian National Foundation for Scientific Research, The Portuguese Science Foundation, as well as a number of Venture Capital Companies active in the field of Information Technologies and Communication Systems. His research interests include still, moving, and 3D image processing and coding, visual information security (rights protection, watermarking, authentication, data integrity, steganography), new media, and human computer interfaces (smart vision, brain computer interface).He is the author or the co-author of more than 200 research publications, and holds 14 patents. Prof. Ebrahimi is a member of IEEE, SPIE, ACM and IS&T.Read More

Workshop Schedule

Time Description
TBA Introduction of the dataset & challenges in creating and rating
TBA Invited Speaker (Ramin Zabih)
TBA Invited Speaker (Oren Rippel)
TBA Invited Speaker (Jim Bankoski)
TBA Break & Poster Session
TBA Invited Speaker (Jens-Rainer Ohm)
TBA Fastest Entry
TBA 3rd Place Entry
TBA 2nd Place Entry
TBA 1st Place Entry
TBA Panel Discussion

Sponsors