architecture - How does realtime text analysis happen -


i have been doing reading on topic not able find right information, have been using services twitter streaming, datasift , other data push service. wanted understand how following services work,

  1. the number of tweets received per second high.
  2. a number of people connect specify keywords want receive tweets for, these can complex boolean queries.
  3. realtime checking of tweets against these boolean queries , streaming data right connection.

any pointer towards kind of architecture being used helpful.

i think first need understand magnitude , resources of services twitter trying create understanding of!! performance can handled managing taking care of both below mentioned fronts.

  1. imagine multi-tiered eco-system load balancers, multiple web-servers , large data centres distributed caching!!

  2. irrespective of complexity of algorithm performance can managed making algorithms multi-threaded!!

if want understand try reading map-reduce, hadoop etc idea


Comments

Popular posts from this blog

java.util.scanner - How to read and add only numbers to array from a text file -

rewrite - Trouble with Wordpress multiple custom querystrings -