Monthly Archives: October 2008

Experience with Infiniband 0

Anyone out there using Infiniband in production? I’m curious if this stuff is coming down in price when compared to 10gE. The switch costs for 10gE are just insane so I imagine Infiniband is in a similar situation. Another major advantage of Infiniband is the reduction in latency. Of course the Linux stack will almost [...]

Cornell Researchers Launch Memetracker Powered by Spinn3r 0

We have a number of other pending announcements of researchers building cool applications with Spinn3r but this one was just too awesome to hold back. Researchers at Cornell have developed a new memetracker (cleverly named MemeTracker) powered by Spinn3r. Jure Leskovec, Lars Backstrom and Jon Kleinberg (author of the HITS algorithm, among other things) built [...]

Spinn3r Sponsors 2009 International Conference for Weblogs and Social Data Challenge 1

Spinn3r is sponsoring the International Conference for Weblogs and Social Media this year with a snapshot of our index. The data set was designed for use by researchers to build cool and interesting applications with the data. Good research topics might include… link analysis social network extraction tracing the evolution of news blog search and [...]

Google, Bigtable, Compression, Zippy and BMDiff 9

A few months ago, when I was heads down finalizing the distributed database in Spinn3r, I was exceedingly curious about what other DBs are using for compression. GZip seems to be the obvious choice but its compression speed isn’t very good when compared to LZO. Your disks are almost certainly going to be bottlenecked on [...]

Linux and SSD @ 5k Random 4k Writes 0

Apparently, Linus has a new Intel drive he’s happy with: In contrast, the Intel SSD does about 8,500 4kB random writes per second. Yeah, that’s over eight thousand IOps on random write accesses with a relevant block size, rather than some silly and unrealistic contiguous write test. That’s what I call solid-state media. They are [...]

Apple Brick 0

If Apple ships this I certainly plan on buying one:

Google Showing Feedburner Redirect URLs in Search Results 2

Google is showing Feedburner redirect URLs in their search results. They’re using the link: http://feeds.latimes.com/~r/topoftheticket/~3/408704356/palin-couric.html which is a Feedburner redirect URL which they use in RSS feeds to help in tracking. Google owns Feedburner so it’s a bit embarrassing that they’re making such an obvious mistake. This might be distorting the stats for the LA [...]

MySQL InnoDB Isolated Patches for 5.0.37, 5.0.67 and Percona’s 5.0.68 Branch. 3

We’re migrating from MySQL 4.1.x to 5.0.x at work and one of the key features we need is the ability to freeze InnoDB and prevent it from writing to disk. We do this to aid in syncing masters and slaves and performing backups. Basically we freeze a master, copy the data to a new slave, [...]