19 February 2014 Content identification: binary content fingerprinting versus binary content encoding
Author Affiliations +
In this work, we address the problem of content identification. We consider content identification as a special case of multiclass classification. The conventional approach towards identification is based on content fingerprinting where a short binary content description known as a fingerprint is extracted from the content. We propose an alternative solution based on elements of machine learning theory and digital communications. Similar to binary content fingerprinting, binary content representation is generated based on a set of trained binary classifiers. We consider several training/encoding strategies and demonstrate that the proposed system can achieve the upper theoretical performance limits of content identification. The experimental results were carried out both on a synthetic dataset with different parameters and the FAMOS dataset of microstructures from consumer packages.
© (2014) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Sohrab Ferdowsi, Sohrab Ferdowsi, Svyatoslav Voloshynovskiy, Svyatoslav Voloshynovskiy, Dimche Kostadinov, Dimche Kostadinov, "Content identification: binary content fingerprinting versus binary content encoding", Proc. SPIE 9028, Media Watermarking, Security, and Forensics 2014, 90280P (19 February 2014); doi: 10.1117/12.2039482; https://doi.org/10.1117/12.2039482

Back to Top