amazon web services - Allow more than one hadoop/EMR tasks to fail before shutting down -


i'm trying use hadoop on amazon elastic mapreduce have thousands of map tasks perform. i'm ok if small percentage of tasks fail, however, amazon shuts down job , lose of results when first mapper fails. there setting can use increase number of failed jobs allowed? thanks.

here's answer hadoop:

is there property define failed mapper threshold

to use setting described above in emr, at:

http://docs.aws.amazon.com/elasticmapreduce/latest/developerguide/emr-plan-bootstrap.html#predefinedbootstrapactions_configurehadoop

specifically, create xml file (config.xml in example) setting want change , apply bootstrap action:

./elastic-mapreduce --create \ --bootstrap-action s3://elasticmapreduce/bootstrap-actions/configure-hadoop \ --args "-m,s3://myawsbucket/config.xml"


Comments

Popular posts from this blog

c++ - CryptStringToBinary API behavior -

c++ - Correct method for redrawing a layered window -

java.util.scanner - How to read and add only numbers to array from a text file -