Project Overview
Our project proposes to develop novel techniques for organizing and visualizing organizational links, using recently developed methods that are at the intersection of state-of-the-art research in text mining and social networks. The algorithms and software will allow users to quickly visualize the topical content of a document collection, and see connections between the documents' authors. With our work, users will be able to easily examine the scope of Calit2 research. Furthermore, our models will allow Calit2 researchers to find other Calit2 researchers with similar interests, and automatically be alerted to relevant fun opportunities. In the broader sense, extracting links and relationships from text data, and using this inferred information as the basis for interactive querying and visualization, is a concept that has broad applications in academia, government, and industry
It should be noted that all data and interfaces demonstrated are in "alpha" iteration and will be enhanced as the project continues.
Project Team Members:
Topic Modeling
The topic model uses statistical learning algorithms to
automatically discover the topics that describe a document
collection. A "topic" is a probability distribution over words,
with high probabilities assigned to words that are associated with
that topic. Given a set of topics, we can use the probability model
to automatically characterize the nature of a researcher's work
and find other researchers with similar interests.
Link to topic browser
Data Funding Visualization
The Calit2 project team has assembled comprehensive data on
research funding at UCSD from the public and private sector since
the initiation of the Institute. Using data visualization tools
developed by the University of Maryland for creating "tree maps"
the team has created a dynamic environment for exploring research
funding interactively.
Link to tree map applet