Beamtenh Errschaft

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Monday, 15 October 2012

New Book on the Frontiers of Multimedia Information Retrieval

Posted on 01:13 by Unknown
This is the new book of my colleague Horst Eidenberger. I collaborated with him in several workshops of our multimedia metadata group. It is on the frontiers of machine intelligence for multimedia understanding.


Frontiers of Media Understanding: The Common Methods of Audio Retrieval, Biosignal Processing, Content-Based Image Retrieval, Face Recognition, Music Classification, Speech Recognition, Text Retrieval and Video Surveillance


Author
Horst Eidenberger
Vienna University of Technology
http://www.ims.tuwien.ac.at/hme/


Abstract

Media understanding is the science/art of identifying semantic structures in digital media objects such as audio, biosignals, images, text and videos. This volume ends the work started in "Fundamental Media Understanding" and continued in "Professional Media Understanding" (atpress, 2011/12). It investigates the scientific frontiers of multimedia information retrieval. Soft frontier areas such as the influence of media theory and psychophysical research are considered as well as core topics such as semantic template matching, Kalman filtering, the limits of learning, dynamic aspects of categorization, human-like similarity perception and developing a neural view on the machine learning problem. In contrast to related publications, this book does not focus on one type of media but considers all the above-named as well as a few others. The author endeavors to identify similarities between the methods employed in audio retrieval, image understanding, text summarization and many other research domains. It turns out that a number of significant parallels do exist. Structuring the methods along common criteria and discussing their similarities and differences breaks the ground for a new research discipline: true computational understanding of multimedia content.

Link
http://www.amazon.com/Frontiers-Media-Understanding-Horst-Eidenberger/dp/3848210924/ref=sr_1_1?ie=UTF8&qid=1350041726&sr=8-1&keywords=horst+eidenberger


Table of Contents

1 Reflection of Professional Methods
1.1 Conclusions from Advanced Methods
1.2 Building Blocks of Categorization
1.3 Which Methods When?
1.4 Overview Over Scientific Frontiers

2 Media Philosophies
2.1 The Image in Philosophy
2.2 Media Theories
2.3 Semiotics
2.4 Media and Information

3 Perception and Psychophysics
3.1 Human Perception and Cognition
3.2 Perceptual and Cognitive Errors
3.3 Psychophysical Theory
3.4 Psychoacoustics and Psychophysics of Vision

4 Description by Templates
4.1 Convolution Everywhere
4.2 Templates for One-Dimensional Media
4.3 Static Visual Templates
4.4 Dynamic Template Adaptation Models

5 Semantic Descriptions and Applications
5.1 The Semantic Scale
5.2 Semantic Feature Transformations
5.3 Semantics in Audio, Biosignals and Text
5.4 Visual Semantic Applications

6 Convergent Filtering
6.1 Models of Convergence
6.2 Vector Quantization
6.3 The Kalman Filter
6.4 Associative Memories

7 Frontiers of Learning Machines
7.1 Analysis of Categorization Methods
7.2 Limits of Learning
7.3 Dynamical Systems Theory
7.4 Oscillating Classifiers

8 Human-Like Similarity Perception
8.1 Similarity as Measurement
8.2 Similarity as Counting
8.3 Dual Process Models
8.4 Similarity as Alignment and Transformation

9 Neural Media Understanding
9.1 Neural Foundations
9.2 Artificial Neural Networks
9.3 Neural Description and Filtering
9.4 Neural Networks for Categorization

10 Finale and Future
10.1 Summary
10.2 Essential Findings
10.3 Critical Review
10.4 Outlook: To Do List

Appendix A Mathematical Notation
Appendix B Similarity Models


Description of Chapters

Chapter 1 lists the major findings of the second part, names major potentials of the professional methods, develops a set of categorization building blocks, sketches best combinations of media understanding methods and provides an overview over the third part.

Chapter 2 discusses the relationship of perception and reality, theories of media content and media usage, the semiotic analysis of arbitrary symbol systems and potentials for merging of media theory, semiotics and information theory for the benefit of better media understanding.

Chapter 3 lists fundamental aspects of human perception, shows where perceptual and cognitive insufficiencies of the human brain lie, gives an introduction into the psychophysical model and discusses psychophysical aspects of hearing and vision.

Chapter 4 revisits the fundamental convolution problem, links it to human similarity measurement, lists templates for audio, biosignals and stock data, and introduces static and dynamic models for visual media representation.

Chapter 5 introduces the semantic scale, describes the usage of low-level descriptions for semantic enhancement and semantic applications in the audio and the visual domain.

Chapter 6 develops a model of convergence for iterative filtering processes, discusses learning vector quantization, the Kalman filter for scalar quantization and quantization by associative memories such as the Hopfield network and the Boltzmann machine.

Chapter 7 reviews communalities of categorization methods, presents a system of learning bounds, introduces fundamental methods of dynamical systems and applies these methods on dynamic classifiers.

Chapter 8 explains distance-based similarity, the improvements reached through the usage of predicate-based models, their integration in dual process models and the new perspectives gained from structural alignment and transformational similarity.

Chapter 9 analyzes the building blocks of human cognition, explains how these are imitated in artificial neural networks and discusses practical networks for description, filtering and categorization, including the spike response mode, radial basis function networks and cascade correlation.

Chapter 10 summarizes the findings of the book, emphasizes the most important points, estimates the practical applicability of some important ideas and sketches a vision of future media understanding research.
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in 2013, book, multimedia, multimedia semantics | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • 1st International Workshop on Multicloud Applications and Federated Clouds (Multi-Cloud 2013)
    Multi-Cloud 2013 1st International workshop on multicloud applications and federated clouds Co-located with ICPE 2013, 22 April 2013. Pragu...
  • Two Post-doc Positions at Politecnico di Milano on Model-driven design and QoS management for Cloud-based Applications
    Two Post-doc Positions at Politecnico di Milano Dipartimento di Elettronica e Informazione Dependable Evolvable Pervasive Software Engineeri...
  • WWW 2014 Workshop - Connecting Online & Offline Life (COOL2014)
    You are cordially invited to submit a contribution to "Connecting Online & Offline Life" (COOL2014), a one-day workshop of the...
  • Professor/Director, Serious Games Institute, Coventry University, UK
    Professor/Director, Serious Games Institute Application closing date     17/01/2014 Faculty / School or Service     Faculty of Engineering a...
  • PhD Fellowship in the area of Big Data Analytics, NTNU, Trondheim, Norway
    The Department of Computer and Information Science have opening 1 PhD scholarship in the area of BigData Analytics. The scholarship is for t...
  • Community and trust-aware fake media detection
    Today, the online first version of our journal paper  Community and trust-aware fake media detection  appeared: Khaled Ahmed Nagi Rashed , D...
  • Special Issue on Information Technology and Innovation - MIS Quarterly
    MIS Quarterly Call for Papers: Special Issue on Information Technology and Innovation Submission deadline: September 30, 2014 Guest Editors:...
  • Numerous Erasmus Mundus Joint Doctorate Fellowships in Business Intelligence and Big Data Analytics
    Numerous Erasmus Mundus Joint Doctorate Fellowships in Business Intelligence and Big Data Analytics ERASMUS MUNDUS JOINT DOCTORATE in INFORM...
  • ACM International Conference on Recommender Systems (RecSys 2014)
    CALL FOR CONTRIBUTIONS ACM International Conference on Recommender Systems (RecSys) 2014 Oct 6-10, 2014, Foster City, Silicon Valley, Califo...
  • Thematic issue on Playful Interactions and Serious Games - JAISE
    Call for papers Journal of Ambient Intelligence and Smart Environments: Thematic Issue on Playful Interactions and Serious Games  http://jai...

Categories

  • 2009
  • 2012
  • 2013
  • 2014
  • 3D web
  • ACIS
  • ACM
  • adaptation
  • AERCS
  • agent technology
  • algorithmic market design
  • ambient intelligence
  • annotation
  • annual report
  • article
  • artifical intelligence
  • ASONAM
  • assessment
  • assistant professor
  • associate professor
  • audio processing
  • augmented reality
  • autumn school
  • award
  • basic research
  • BASNA
  • behavior computing
  • best paper
  • big data
  • blog
  • book
  • bpms2
  • business intelligence
  • business management
  • business process management
  • CAISE
  • call for chapters
  • call for demos
  • call for panels
  • call for papers
  • call for participation
  • call for posters
  • call for sponsors
  • call for tutorials
  • call for workshops
  • ceur
  • challenge
  • CIKM
  • cloud computing
  • clustering
  • collaboration
  • collaborative filtering
  • collaborative modeling
  • collective intelligence
  • colloquium
  • communities of pratice
  • community
  • community analytics
  • community detection
  • community information systems
  • community learning analytics
  • complex networks
  • complexity
  • computational social science
  • computer science
  • conceptual modeling
  • conference
  • content-based multimedia indexing
  • context-aware computing
  • creativity
  • crisis management
  • CRIWG
  • cross-community mining
  • cross-media mining
  • crowdsourcing
  • cscl
  • cscw
  • cultural heritage management
  • cultural sciences
  • cyber-physical systems
  • dashboard
  • data integration
  • data management
  • data mining
  • data quality
  • data streams
  • database
  • deadline extension
  • deception
  • deep web
  • demo
  • developer camp
  • digital ecosystems
  • digital library
  • digital preservation
  • digital video
  • DireWolf
  • distributed user interfaces
  • doctoral consortium
  • doctoral research
  • dynamic network analysis
  • e-health
  • e-humanities
  • e-science
  • eatel
  • ec-tel
  • ectel
  • educational games
  • elsevier
  • embedded services
  • emergence
  • entertainment
  • entrepreneurship
  • eTwinning
  • eu
  • evidence
  • expert identification
  • Facebook
  • flyer
  • fp7
  • full professor
  • GALA
  • game analytics
  • game-based learning
  • games
  • gamification
  • geographic information systems
  • gigapixel
  • GPU
  • graph mining
  • honory doctorate
  • House of Quality
  • HTML5
  • human computer interaction
  • hypermedia
  • i*
  • i5cloud
  • ICALT
  • ICWL
  • ieee
  • ijtel
  • IKNOW
  • image analysis
  • IMS LD
  • influence
  • informal learning
  • information extraction
  • information integration
  • information retrieval
  • information systems
  • information visualization
  • innovation
  • intelligent systems
  • interactive services
  • interactive tv
  • interfaces
  • internet
  • Internet of Things
  • interoperability
  • job opening
  • journal
  • jtel
  • KDD
  • keynote
  • knowledge discovery
  • knowledge management
  • LAK
  • learning analytics
  • Learning Frontiers
  • learning layers
  • linked data
  • LNCS
  • location-based services
  • machine learning
  • master thesis
  • media fragments
  • mediabase
  • metadata
  • metamodelling
  • method engineering
  • METIS
  • MIS Quartely
  • mobile
  • mobile applications
  • mobile cloud computing
  • mobile data management
  • mobile learning
  • mobile media
  • mobile services
  • mobile social networks
  • mobile web information systems
  • mooc
  • moodle
  • mpeg-7
  • MTAP
  • multi-agent
  • multimedia
  • multimedia experience
  • multimedia indexing
  • multimedia processing
  • multimedia semantics
  • multimedia streaming
  • newsletter
  • non-linear storytelling
  • online first
  • open access
  • open innovation
  • open source
  • operational transformations
  • opinion mining
  • organizational learning
  • overlapping community detection
  • paper
  • paper.li
  • participatory design
  • peer-to-peer networks
  • personal learning environments
  • personalization
  • plagiarism
  • postdoc
  • poster
  • presentation
  • press release
  • privacy
  • proceedings
  • process design
  • program
  • prolearn
  • PROLEARN Academy
  • public displays
  • quality of experience
  • ranking
  • real-time
  • recommender systems
  • repositories
  • reputation
  • requirements bazaar
  • requirements engineering
  • research
  • restful
  • ROLE
  • RWTH Aachen University
  • science
  • screencast
  • security
  • self-directed learning
  • semantic computing
  • semantic video annotation
  • semantic web
  • sensor networks
  • sentiment analysis
  • serious games
  • service integration
  • service networks
  • service oriented architecture
  • SeViAnno
  • shared editing
  • SIGMOD
  • slideshare
  • smart cities
  • SNAM
  • social capital
  • social computing
  • social contagion
  • social data management
  • social multimedia
  • social network analysis
  • social networks
  • social networks modeling
  • social science
  • social search
  • social software
  • social web
  • software engineering
  • spatial data infrastructure
  • special issue
  • special track
  • Springer
  • stcsn
  • stream mining
  • summer school
  • table of contents
  • teaching
  • technology enhanced learning
  • TEL Roadmaps
  • TELLNET
  • TELMAP
  • text mining
  • TIST
  • toit
  • topic mining
  • transactions on cloud computing
  • transactions on learning technologies
  • trust
  • twitter
  • ubiquitous computing
  • ubiquitous media
  • UMIC
  • usability
  • user experience
  • user modeling
  • user-generated content
  • video
  • video browser
  • virtual campfire
  • virtual worlds
  • visual analytics
  • visualization
  • VLDB
  • web 2.0
  • web analytics
  • web archiving
  • web engineering
  • web information systems
  • web intelligence
  • web mining
  • web of things
  • web quality
  • web science
  • web search
  • Web services
  • widgets
  • wikipedia
  • winter school
  • wisdom of crowds
  • WISE
  • workflow management
  • workshop
  • WWW
  • WWWJ
  • XMPP
  • youtube

Blog Archive

  • ►  2013 (216)
    • ►  December (8)
    • ►  November (27)
    • ►  October (16)
    • ►  September (15)
    • ►  August (1)
    • ►  July (19)
    • ►  June (22)
    • ►  May (18)
    • ►  April (22)
    • ►  March (17)
    • ►  February (27)
    • ►  January (24)
  • ▼  2012 (284)
    • ►  December (37)
    • ►  November (34)
    • ▼  October (29)
      • Special Issue on Facebook as an Educational Tool -...
      • International Workshop on Big-Data Analytics for t...
      • Deadline Extension: The 7th FTRA International Con...
      • Deadline Extension: Learning Analytics & Knowledge...
      • Joint Call For Papers - Conferences / Journal Spec...
      • Special Issue on Multimedia Data Management in Mob...
      • ACM Web Science Conference (WebSci 2013)
      • Workshop Data Management in the Cloud (DMC 2013)
      • IEEE CollaborateCom 2012 Contributions from the AC...
      • The 4th International Workshop on Graph Data Manag...
      • VS-Games 2012 – GaLA Conference
      • Special Issue on Software Tools and Technologies f...
      • Joint Call For Papers - Conferences / Journal Spec...
      • New Book on the Frontiers of Multimedia Informatio...
      • Post-Doctoral Fellow Position at the MADMUC Lab, U...
      • 24th ACM Conference on Hypertext and Hypermedia 2013
      • 13th International Conference on Web Engineering
      • Special Issue on Intelligent Big Data Processing -...
      • PhD Openings in Requirements Engineering at Haroko...
      • Postdoc position in Information Visualization in M...
      • Track on Serious Games for Crisis Management at IS...
      • WWW 2013 - Call for Workshops
      • Final Tellnet event at Media & Learning 2012 Confe...
      • Faculty Positions at North Carolina State University
      • EC-TEL 2012 Contributions from the ACIS Group at R...
      • The 13th IEEE International Conference on Advanced...
      • Special Issue on The Role of Information Systems i...
      • Special Issue on Models and Protocols for Digital ...
      • 21st International Conference on User Modeling, Ad...
    • ►  September (38)
    • ►  August (30)
    • ►  July (17)
    • ►  June (39)
    • ►  May (47)
    • ►  April (13)
Powered by Blogger.

About Me

Unknown
View my complete profile