strings

A software development and technology blog by Deepak Kandepet

Hacking Lucene – The Index format

Lucene is high-performance, scalable, full-featured, open-source text search engine written in Java. Since I am a search engineer by profession, I wanted to learn more about Lucene and its internals. This article is about the index format of the 3.4 Lucene. Specifically the Lucene inverted index. Lucene Inverted Index Some Definitions Index: An Index is [Read the Rest...]

How big of a haystack do you need to hide?

Every fact you learn about a person reduces the “entropy” of their identity. For e.g. If I know your gender, we can eliminate about 50% of the population: There were about 155.6 million females & 151.4 million males in the United States in 2009. If I know your birthday we can eliminate a much larger [Read the Rest...]

God’s number is 20, mine is 67.

Without using magic or trickery, God would need utmost 20 moves to solve any scrambled 3 x 3 x 3 Rubik’s cube. The best I can do is in 67 moves. I got my first Rubik’s cube when I was 11 years old. After years of trying to solve it, my greatest accomplishment was solving [Read the Rest...]