Matt Rodriguez


I am a lead software developer at Neustar in San Francisco. We have developed a new Web Performance Management platform. The platform provides a SAAS solution to monitor and load test websites. The product includes synthetic monitoring and real user monitoring. Synthetic monitoring checks a website on a regularly scheduled interval. We use Selenium to automate a series of tests of our customer's websites. We provide a scripting interface and a browser plugin, so that our customers can easily tailor their Selenium scripts, to perform actions on their website. After the script has run, we capture the performance information in a HAR file. Synthetic monitoring provides a repeatable experiment to test the performance and availability of a website. Real user measurement(RUM) measures the performance of each user as they visit the website. Our customer puts a small snippet of javascript on their webpage. The javascript captures the navigation timings from the browser and sends the data back to the WPM platform. I am the lead software engineer for the WPM Alerting Application. This application lets our customers configure how they want to be alerted if there is a problem with their website. They can be sent an email, SMS, or a voice phone call, to alert them if there is a problem with their website. The application provides a collection of javascript APIs which allows them to script an Intelligent Alerting policy that is tailored to their needs.


I worked as a programmer analyst at SDSC in the geospatial group headed by Ilya Zaslavsky. The group provides production web services and GIS applications for CUAHSI . CUAHSI is a consortium of universities centered out of University of Texas trying to organize hydrologic datasets to enable hydrologic science. I primarily did DBA work for them, using Microsoft's SQL Server and Analysis Services. I also worked as a software developer for the Ocean Observatories Initiative project. We are developing an AMQP Messaging system to deliver large amounts of Oceanographic data. This project I get to work on Amazon's EC2 , use the distributed key value store Cassandra. One implementation of the Messaging system uses a Python network application framework Twisted.

I've worked with Chesapeake Bay Environmental Organization developing web applications that allow them to analyze and visualize their data. The first web site allows the user to search through two datasets and filter by spatial and temporal constraints. The application runs a matching algorithm that matches data points from one data set to the other. It generates a scatterplot of the data and calculates the correlation coefficient. The web application is here.

I made a SQL Server report which calculates the Hypoxic volume and percentage of the volume. The report executes a stored procedure. The data set is a model data set which contains over 25 billion data points. I aggregated the data and built indexes to make the procedure execute in a tolerable amount of time. The report can be found here here. The report requires a username and password to run.


I have a Master's in Computer Science and Engineering from UCSD. My academic interests focused on machine learning and data mining. My Master's thesis is focused on Recommender Systems. The problem typically reduces to estimating a matrix where rows are represented by users and columns by items. Most of the matrix elements are unknown. I am exploring different techniques for estimating the Recommender System matrix. The latest algorithm is the PDLF algorithm by Deepak Agarwal and Srujana Merugu. Here's a link to a kdd07 video presentation .

My Master's Thesis is found here.


Previously I worked at LBL for the Berkeley Water Center. We took a variety of hydrology and carbon flux data sets loaded them into a RDBMS and built data cubes. This allowed us to present the hydrologic data to scientists who could perform further analysis on the data. Data cubes use the OLAP data model which has great performance for a certain class of queries.


Presentation of the Gravity Recommender System Slides for Charles Elkan's 291 course in Spring 2008.

Presentation of PDLF paper Slides for Lawrence Saul's 291 course in Winter 2009.


Here's my Resume