Other articles


  1. 2014-09-19 daily

    Today Sophie Clayton and I hacked on Myria for SeaFlow once again. We found another few opportunities for language and usability improvements, and made little progress because of an issue introduced when fixing other bugs earlier this week.

    In the Myria research meeting, we had both Johannes Gehrke from Microsoft ...

    read more

    There are comments.

  2. 2014-09-16 daily

    I also did not get much time to do real work today. There were three major activities:

    1. UW Data Science Incubator applications are due Thursday! They have started rolling in, so I have started looking at them and have started a few clarifying discussions with some of the authors. Getting ...

    read more

    There are comments.

  3. 2014-09-15 daily

    Next week, I’ll see if the incrementalization actually helps us scale.

    Only had a tiny bit of time today; I worked more on the least common ancestor query. Here is what new work contributed to better scaling:

    • Incrementalizing the code (duh) did in fact let me scale it farther ...

    read more

    There are comments.

  4. 2014-09-11 daily

    Today I spent all day with Sandra Anderson’s citation graph lineage queries. Though I can compute “all-pairs reachability” for the first 10000 papers in the dataset… I can only currently compute “least-common ancestor” for the first 500 papers. There are some severe algorithmic scalability challenges here that we are ...

    read more

    There are comments.

  5. 2014-09-10 daily

    In between meetings, I spent most of today continuing yesterday’s work on the citation use case. Further query rewrites and testing exposed an interesting bug in the optimizer due to a mismatch between logical algebra representation and the actual system implementation behavior — the optimizer assumed the system could perform ...

    read more

    There are comments.

  6. 2014-09-09 daily

    Today I picked up some of the work that Sandra Anderson did in her summer internship, namely trying to find common citations (transitively) between pairs of papers in Jevin West‘s data sets.

    Once again I identified a number of nice optimization opportunities:

    • some query rewrites that result in better ...
    read more

    There are comments.

  7. 2014-08-20 daily

    Today I hacked more on the blog organization and layout; fighting with GitHub CNAMEs was harder than I expected it to be. Eventually I settled on creating a sub-project for the blog as hosting it in my personal dhalperi/dhalperi.github.io repository affected the URLs for other projects like ...

    read more

    There are comments.

blogroll

social