Boost your search with semantic technology

29
EBU Production Technology Seminar 2011 Karel Braeckman Vrt-medialab 1 Vrt-Medialab

description

MediaLoep combines documents readily available within the broadcasting company (subtitles, news preparation, ...) with semantic web technology to create a powerfull media search application. Presented at EBU Production Technology Seminar 2011

Transcript of Boost your search with semantic technology

Page 1: Boost your search with semantic technology

EBU Production Technology Seminar 2011

Karel BraeckmanVrt-medialab

1Vrt-Medialab

Page 2: Boost your search with semantic technology

VRT is the Flemish Public Broadcaster

3 TV-channels, 5 radio channels

VRT-medialab is the research department

creation, distribution and management of media content

2Vrt-Medialab

Page 3: Boost your search with semantic technology

Lots of audio and video material illustrating our cultural heritage.

Also includes new material (news clips, …)

Used by programme-researchers & journalists

3VRT-medialab

Page 4: Boost your search with semantic technology

Vrt-Medialab 4

The problem of media search MediaLoep project

Re-using production metadata Linking to the semantic web

Page 5: Boost your search with semantic technology

Vrt-Medialab 5

The problem of media search MediaLoep project

Re-using production metadata Linking to the semantic web

Page 6: Boost your search with semantic technology

Not self-descriptive → we need metadata

Video / Audio are continuous media with a time-dimension

Series: FlikkenKeywords: violence, robberyDescription: Robbery on shop. Attacker hits shop owner with gun.

6VRT-medialab

Page 7: Boost your search with semantic technology

Not self-descriptive Video / Audio are continuous media with a

time-dimension → we prefer time-coded metadata

00’00”>01’43”Robbery on shop

01’43”>04’20”Police agent looks worried

35’00”>36’33”Observation by police

7VRT-medialab

Page 8: Boost your search with semantic technology

35’00”>36’33”Observation by police

8VRT-medialab

Page 9: Boost your search with semantic technology

Basis9Vrt-Medialab

Page 10: Boost your search with semantic technology

Ardome10Vrt-Medialab

Page 11: Boost your search with semantic technology

Not enough detailed annotations available

◦ “X spits on the ground after Y makes a goal”◦ The entire dialogue so we can search for quotes◦ Labels, locations, links, maps, photographs, …

as the creation of these annotations is very time consuming.

Vrt-Medialab 11

Page 12: Boost your search with semantic technology

Vrt-Medialab 12

The problem of media search MediaLoep project

Re-using production metadata Linking to the semantic web

Page 13: Boost your search with semantic technology

Vrt-Medialab 13

Page 14: Boost your search with semantic technology

Vrt-Medialab 14

The problem of media search MediaLoep project

Re-using production metadata Linking to the semantic web

Page 15: Boost your search with semantic technology

News Rundown with auto-cue texts, overlay labels, …

EPG data contains a summary of the programme, the broadcast dates, …

A drama script contains dialogues and actions, …

Subtitles ~ transcript of spoken text

15Vrt-Medialab

Page 16: Boost your search with semantic technology

Vrt-Medialab 16

Information added by an archivistkeywords

textual description

other fields

Page 17: Boost your search with semantic technology

Vrt-Medialab 17

Information added by the news preparation:

overlay captions

autocue text

links to other items inthis news broadcast

Page 18: Boost your search with semantic technology

Vrt-Medialab 18

Information added by the subtitles:

time-coded transcriptof the dialogue

Page 19: Boost your search with semantic technology

Vrt-Medialab 19

Page 20: Boost your search with semantic technology

Vrt-Medialab 20

The problem of media search MediaLoep project

Re-using production metadata Linking to the semantic web

Page 21: Boost your search with semantic technology

Archivists add thesaurus keywords to clips

By linking these keywords to a thesaurus, we can make the search system smarter

Vrt-Medialab 21

GenevaObama, BarackEurope…

Geneva → coordinates on a map?Obama, Barack → a picture?…

Page 22: Boost your search with semantic technology

Vrt-Medialab 22

Geneva country Switzerland

15.86 km2area

Public knowledge bases provide information about resources using ‘triples’.

Page 23: Boost your search with semantic technology

Vrt-Medialab 23

Geneva country Switzerland

15.86 km2area

Geneva

latitude 46° 12' 0" N

sameAs

Links to the same resource in other knowledge bases can be created.

GeoNames

Page 24: Boost your search with semantic technology

Vrt-Medialab 24

A network of linked knowledge is created.

Page 25: Boost your search with semantic technology

Vrt-Medialab 25

We linked MediaLoep to DBpedia, which is in turn linked to many other knowledge bases.

MediaLoep

Page 26: Boost your search with semantic technology

Vrt-Medialab 26

AALMOEZENIERAALST

AALTERDE WEVER, BART

GENEVA…

VRT-Thesaurus DBpedia / Wikipedia MediaLoep

Page 27: Boost your search with semantic technology

Vrt-Medialab 27

Information added by the semantic web:

Page 28: Boost your search with semantic technology

Vrt-Medialab 28

Page 29: Boost your search with semantic technology

Improved search by combining existing information.

Enhanced results visualization and semantic query suggestions by coupling to the semantic web.

Vrt-Medialab 29