Abstract
Building a visual summary from an egocentric photostream captured by a lifelogging wearable camera is of high interest for different applications (e.g. memory reinforcement). In this paper, we propose a new summarization method based on keyframes selection that uses visual features extracted by means of a convolutional neural network. Our method applies an unsupervised clustering for dividing the photostreams into events, and finally extracts the most relevant keyframe for each event. We assess the results by applying a blind-taste test on a group of 20 people who assessed the quality of the summaries.
Original language | English |
---|---|
Title of host publication | 2015 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2015 |
Subtitle of host publication | Turin, Italy, June 29-July 3, 2015 |
Publisher | IEEE |
Number of pages | 6 |
ISBN (Print) | 978-1-4799-7079-7 |
DOIs | |
Publication status | Published - 2015 |
Externally published | Yes |