Wordpress - highlyscalable.wordpress.com - Highly Scalable Blog
General Information:
Latest News:
In-Stream Big Data Processing 20 Aug 2013 | 09:59 pm
The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. It became clear that real-time query processing and in-stream p...
Distributed Algorithms in NoSQL Databases 18 Sep 2012 | 09:45 pm
Scalability is one of the main drivers of the NoSQL movement. As such, it encompasses distributed system coordination, failover, resource management and many other capabilities. It sounds like a big u...
Speeding Up Hadoop Builds Using Distributed Unit Tests 14 Aug 2012 | 11:46 pm
We recently worked with one of the Hadoop vendors on the continuous integration system for Hadoop core and other Hadoop-related projects like Pig, Hive, HBase. One of the challenges we faced was very ...
Fast Intersection of Sorted Lists Using SSE Instructions 5 Jun 2012 | 08:58 pm
Intersection of sorted lists is a cornerstone operation in many applications including search engines and databases because indexes are often implemented using different types of sorted structures. At...
Fast Intersection of Sorted Lists Using SSE Instructions 5 Jun 2012 | 08:58 pm
Intersection of sorted lists is a cornerstone operation in many applications including search engines and databases because indexes are often implemented using different types of sorted structures. At...
Probabilistic Data Structures for Web Analytics and Data Mining 2 May 2012 | 02:11 am
Statistical analysis and mining of huge multi-terabyte data sets is a common task nowadays, especially in the areas like web analytics and Internet advertising. Analysis of such large data sets often ...
Probabilistic Data Structures for Web Analytics and Data Mining 1 May 2012 | 10:11 pm
Statistical analysis and mining of huge multi-terabyte data sets is a common task nowadays, especially in the areas like web analytics and Internet advertising. Analysis of such large data sets often ...
Hierarchical Navigation and Faceted Search on Top of Oracle Coherence 3 Apr 2012 | 02:06 am
Some time ago I participated in design of a backend for one large online retailer company. From the business logic point of view, this was a pretty typical eCommerce service for hierarchical and facet...
Hierarchical Navigation and Faceted Search on Top of Oracle Coherence 2 Apr 2012 | 10:06 pm
Some time ago I participated in design of a backend for one large online retailer company. From the business logic point of view, this was a pretty typical eCommerce service for hierarchical and facet...
NoSQL Data Modeling Techniques 2 Mar 2012 | 02:54 am
NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because speci...