Material Detail

Latent Factor Models for Relational Arrays and Network Data

This video was recorded at 24th Annual Conference on Neural Information Processing Systems (NIPS), Vancouver 2010. Network and relational data structures have increasingly played a role in the understanding of complex biological, social and other relational systems. Statistical models of such systems can give descriptions of global relational features, characterize local network structure, and provide predictions for missing or future relational data. Latent variable models are a popular tool for describing network and relational patterns. Many of these models are based on well-known matrix decomposition methods, and thus have a rich mathematical framework upon which to build. Additionally, the parameters in these models are easy to interpret: Roughly speaking, a latent variable model posits that the relationship between two nodes is a function of observed and unobserved (latent) characteristics, potentially in addition to contextual factors. In this tutorial I give an introduction to latent variable models for relational and network data. I first provide a mathematical justification for a general latent factor model based on exchangeability considerations. I then describe and illustrate several latent variable models in the context of the statistical analysis of several network datasets. I also compare several such models in terms of what network features they can, and cannot, represent. A particularly flexible class of models are the "latent factor" models, based on singular value and eigen-decompositions of a relational matrix. These models generalize in a natural way to accommodate more complicated relational data, such as datasets that are described by multiway arrays, such as a network measured over time or the measurement of several relational variables on a common nodeset. I will close the tutorial by showing how tools from multiway data analysis (such as the higher order SVD and PARAFAC decomposition) can be used to build statistical models of multiway networks and relational data.

Keywords:: videolectures, ocwc, oec

Disciplines:

Science and Technology / Computer Science / Programming & Programming Languages

Go to Material

Bookmark / Add to Course ePortfolio

Create a Learning Exercise

Add Accessibility Information

Rate

Add a Comment

Quality

User Rating
Comments
Learning Exercises
Bookmark Collections
Course ePortfolios
Accessibility Info

Report Broken Link
Report as Inappropriate

More about this material

Material Type:: Presentation
Date Added to MERLOT:: February 10, 2015
Date Modified in MERLOT:: February 10, 2015
Author:: Peter Hoff, Department of Statistics, University of Washington
Submitter:: The Open Education Consortium
Primary Audience:: College General Ed, College Lower Division, College Upper Division
Technical Format:: Video

Mobile Compatibility:: Not specified at this time
Language:: English
Cost Involved:: No
Source Code Available:: No
Creative Commons:: This work is licensed under a Attribution-NonCommercial-NoDerivs 3.0 United States