(complex network analysis)
Cities, Scaling & Sustainability- Santa Fe Institute
Network and Society-MIT  Senseable City Lab
NICO
-Northwestern U
Barabasi Lab-Northeastern U
Christakis Lab- Harvard U Medical
ComplexSystems-U Michigan
Center for Social Complexity-George Mason University
COSI- Toulouse, France

(geog/visualization) 
ColorBrewer
   Brewer et al
The GeoVista Center -Penn State
GeoDA -  Univ. Illinois  
Prefuse - Java Toolkit
Flow Mapper - Tobler, UCSB
Many Eyes - IBM Visualization Tools
Simile - MIT CSAIL
GeoWeb- Data complied by MIT

(programming)
The Really Big Index - Java
Processing   open source java viz environment
T & T Society
- Open Source Loving
R Project - Statistical Computing
Eclipse
- IDE for Java (c, c++)
BasicProgramming.pdf -  Clio's Programming Lecture (El Lam-o)

(datamining)
WEKA
- Univ. Waikato Software
KDNuggets - Software, Data
RapidI - My Favorite Software
Spatial Data Mining - Univ. of Munich
mySVM - SVM: Univ. of Dortmund


Clio Andris

THIS PAGE REDIRECTS FROM MIT.EDU/CLIO/WWW


What we're thinking about over here in Building 9, 77 Mass Ave.

How do we integrate personal relationships into geography? What would happen if there was better transit to see your friends and family? Have you moved somewhere because someone you like already lived there or was moving there? Are you friends with someone when you are 'in town', but not otherwise? Should you move somewhere where everyone has contacts elsewhere, or has contacts nearby? Has your relationship with someone improved because where you live and work has become convenient for meeting up? How has this changed your life? How can we measure this? Should we measure this? Tell me what you think.

Read more [Technical Report : 'Informed Relationships'] (download pdf) (view as a google doc)



I am a Postdoctoral Researcher studying the convergence of Geospatial Networks and Social Complexity. My studies are a cross section of Geography and Computer Science.
 
  • Postdoctoral Associate: MIT Dept or Urban Studies and Planning (Urban Information Systems Group) mit uis
  • PhD 2011, Massachusetts Institute of Technology
  • Contact: clio@mit.edu or +1 202.630.0085

RESEARCH INTERESTS


RESEARCH OVERVIEW

In Plain English:
Every time you visit a family member, call to order a pizza, email a friend, walk to work, or take a vacation, you’re folding together places that don’t touch on a map. You’re sewing the deep patterns of work & play in the world.

Normally, when we measure distance, we measure in terms of miles, or minutes. For example, Boston is about 500 miles from Washington D.C., or, it takes 8 hours to drive. We call straight line distance (like on a map) "Euclidean Distance", because it connects two points usually at a diagonal. We call travel time (how long it takes to get somewhere) the "Cost Distance", where cost could be time, money or even wear-and-tear on a car.

These two measures are very handy for a lot of people, like airlines, or food distributors. But we now may need a third measure: With new data and math software coming at us from every direction, we have more information about how people communicate, on the telephone or email, or how they commute from home to work, and where they migrate. With this information, we can measure how much two places "talk" to each other, by summing the phone call minutes, or how much two places "trade people", by how many people have moved from one place or another in the past year.

High numbers show that two places are connected well, and low numbers show that they are not connected well. We could imagine that two neighboring towns have strong connections, and two distant towns have virtually none. But from our research, close distance can't always predict a strong connection. A Bostonian is more likely to know someone/or have visited San Francisco than San Antonio, Tx., even though they have the same population size, and San Antonio is closer to Boston. Now we can calculate "how much more likely" to get a feeling for the "social distance" between two places. This social distance can be important for measuring and predicting spreading of a lot of things: ideas, diseases, consumer behavior, voting preferences, or lifestyle choices like the adoption of a recycling program. I like this stuff a lot, and there's a lot more to say!  


In GIS/Mathy Language:
My Dissertation incorporates Complex Network Analysis (CNA), a method of Computational Social Science (CSS), into geographic models of flow data, which have been largely untouched since the 1970's. CNA is a robust new field that sprung from graph theory, and has been used to characterize and measure complex systems in physics, social networks and biology. I measure things like clustering coefficients, diameters, weighted/unweighted degree measures, betweenness centrality, modularity, and find cliques, bridges and cycles, in terms of cities or places as nodes, and the people & information flow between them as edges. Another area of interest is modelling spatio-temporal network propagation for predicting the location and intensity of a transmission in an emergency or for longitudinal prediction. Relating to quality of life, a network propagation model could point out hard-to-reach nodes which may help policy analysts target bottlenecks leading to neighborhood isolation, high travel times,and  poor access to professional, health and civic opportunities

                                                                              (map: phone calls around London)

From another perspective, social networks themselves can be coupled with spatial analysis tools for visual data mining and pattern recognition. Linking and Brushing techniques can HIGHLIGHT user-selected nodes (depending on a certain feature--all nodes in a module, nodes with high closeness centrality, female nodes, nodes with the most neighbors) ALONG WITH each selected node's corresponding location on a map. Of course, entities on the map could be selected to reveal participants in the social network. More dynamically auto-updated coupling views include regression plots and parallel coordinate plots (for showing trends and signals for nodal features--age, income, clustering coefficient, centrality, etc.), as already embedded in current open-source geographic environments. This system is important because most social network models only focus (if at all) on the straight line distance as the geographic separation between two people. This measure is not enough to show spatial clustering, autocorrelation, hot and cold spots and anisotropic spreading and diffusion patterns.

Characterizing the entire network of places and their connectivity measures as a system allows us to see dynamics from many neighbors-away, so we can model the causes and effects of growth or shrinkage to all entities connected to somewhere, not just the place itself.  These types of tools are inherently spatial, give us the benefit of a multi-scale analysis, sub-partitioning and fusing layers, and should be integrated with interdisciplinary GIS systems for the research community.

Layered Network Connectivity Modules can show different levels of Social & Economic Topological Inconsistency in Geographic Space.

 

EXAMPLE APPLICATIONS

  • figuring out which areas to target first for a malaria outbreak
  • ranking countries for admittance into the E.U. based on 500+ factors
  • settling a new land boundary in a war-torn nation
  • re-routing a subway to avoid dangerous gas pipe
  • appeasing political opposition in district gerrymandering
  • indexing people and regions most susceptible to economic downturn