Blog post

Recognizing emotions in photographs

December 5, 2018
7:02 pm
.data, AI, Conferences

This post is based on Àgata Lapedriza's talk atthe Deep Learning Summit in London.

Àgata Lapedriza is a professor at the Open University of Catalonia and a visiting researcher at MIT. Lapedriza does research on recognizing emotions in photos and videos. This research immediately caught my interest since I did similar research on emotions during my internship (before joining Continuum).

Emotions

Research on recognizing emotions may seem unusual to some but it is not at all. There are several things that could improve should one be able to recognize emotions accurately, for example educational applications. These could suggest certain tips or exercises based on the emotion (e.g., enthusiasm or confusion) of the user. But also in the healthcare sector, such systems could be used to recognize patients' emotions and take the necessary actions.

Most methods to accomplish this attempt to analyze the expressions of the face in photographs. Other studies focus on the shoulders or posture of the body. Unfortunately, these methods often fall short. What emotions does the man in the photo on the right have? The ability to recognize emotions in other persons is already not always easy for us. But what if more information was available?

The pictures above show how context can change emotion. All four faces are identical but the extra information in the photo makes it easy for a person to recognize the emotion.

To determine this context, a dataset of pictures annotated with the type of place (e.g., a classroom) and various attributes was collected. A CNN(Convolutional Neural Net) was also used by Lapedriza to determine place and attributes.

Places

Armed with the method of determining emotions using facial expressions and the PlacesCNN, they could begin to improve the existing systems. There was only one problem: Labeled data.

To recognize the emotion of individuals in photographs using context, they had to have examples. They did this through the following steps:

1. Scraping photos from search engines and existing research datasets
2. Create a tool for annotating the photos
3. Use crowdsourcing to annotate as many photos as possible

More info on the Places Demo can be found here.

VAD Emotional State Model

The tool shown above allows the user to choose between 26 emotions for the indicated person in the photo. There is also the ability to provide additional metadata. This metadata is based on the "VAD Emotional State Model". This model describes emotions through 3 numeric metrics: Valence (how pleasant the situation is), Arousal (how excited the person is) & Dominance (whether the person is in control or not). These three metrics provide more information about the emotions. Gender and age range are also tracked. With all this information, an EMOTIC dataset was built with 23,571 annotated photos showing 34,320 people.

This dataset is available for free download.

The next step was to combine the Places and EMOTIC datasets to determine emotion and VAD metrics.

This system uses two CNNs, one that is pre-trained on the imagenet dataset (collection of photos of objects) and a second that is pre-trained their own places dataset. The first CNN takes a smaller version of the photo as input showing the person whose emotion is to be determined. The second CNN gets the entire photo to detect features of the context. The output from these CNNs is then combined to take information from both systems and make a prediction.

This figure shows that for all emotions except "esteem," the combination (B + I) of the person - (B) and context network (I) scores better than the networks separately.

After the presentation, of course the big question was, "What about video?" and of course Lapedriza and her team are already working on this!

Also be sure to read the original paper. It is easy to understand and very interesting. Do you want to know more about machine learning or AI in general? Then keep an eye on our website for our events? Or join our tribe!

Cookie	Duration	Description
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Analytics" category.
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Necessary" category.
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie stores user consent for cookies in the category "Others".
cookielawinfo-checkbox-preferences	1 year	CookieYes set this cookie to record the user consent for the cookies in the category "Functional".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	The website's WordPress theme uses this cookie. It allows the website owner to implement or change the website's content in real-time.
viewed_cookie_policy	1 year	The GDPR Cookie Consent plugin sets the cookie to store whether or not the user has consented to use cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_hole_UA-*	1 minute	Google Analytics sets this cookie for user behavior tracking.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
ln_or	1 day	Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Cookie	Duration	Description
_rdt_uuid	3 months	Reddit sets this cookie to build a profile of your interests and show you relevant ads.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
VISITOR_INFO1_LIVE	5 months 27 days	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
li_gc	5 months 27 days	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Blog post

Recognizing emotions in photographs

Emotions

Places

VAD Emotional State Model

Join our tribe

Contact