AnsweredAssumed Answered

Determining number of files

Question asked by chriscurtin on Aug 23, 2012
Latest reply on Aug 27, 2012 by Ted Dunning

I was asked recently how many files we are now storing in our cluster. Named-node based Hadoop implementations give a view of total blocks and number of files via the web UI. I know MapR works differently (which is why we've moved to it) but I can't see where I figure out how many files we have.

a 'find .  | wc -l' against the NFS mount isn't ideal :)

(The reason for the question was we had issues with around 10MM files with Apache & Cloudera's versions and product and operations want to know where we are now and how things are performing. We haven't had to delete any data since going to MapR so we're definitely above 10MM, I just don't know how much.)