Qhov twg yog qhov zoo dua los kawm spark lossis Hadoop?
Qhov twg yog qhov zoo dua los kawm spark lossis Hadoop?

Video: Qhov twg yog qhov zoo dua los kawm spark lossis Hadoop?

Video: Qhov twg yog qhov zoo dua los kawm spark lossis Hadoop?
Video: Data Science with Python! Analyzing File Types from Avro to Stata 2024, Kaum ib hlis
Anonim

Tsis yog, nws tsis yog yuav tsum tau kawm Hadoop ua ntej kawm Spark tab sis kev paub yooj yim ntawm Hadoop thiab HDFS yuav ntxiv qhov zoo rau koj txoj kev kawm Teeb . Teeb yog ib qho kev siv technology tshiab thiab yog kev lag luam buzz. Learning Teeb yuav muaj txiaj ntsig zoo rau koj txoj haujlwm Teeb cov kws tshaj lij tau nyiam dua hauv kev lag luam.

Kuj kom paub yog, qhov twg zoo dua Hadoop lossis spark?

Hadoop yog tsim los lis batch processingefficiently whereas Teeb yog tsim los tswj cov ntaub ntawv ntawm lub sijhawm tiag tiag. Hadoop yog ib tug high latency computingframework, uas tsis muaj kev sib tham sib hom whereas Teeb yog ib qho qis latency xam thiab tuaj yeem ua cov ntaub ntawv sib tham sib.

Ib sab saum toj no, puas zoo dua li MapReduce? Qhov tseem ceeb sib txawv ntawm MapReduce vs Apache Spark MapReduce yog nruj me ntsis disk-raws li Apache Teeb siv lub cim xeeb thiab tuaj yeem siv lub disk rau kev ua haujlwm. Teeb muaj peev xwm ua tiav cov haujlwm ua haujlwm ntawm 10 mus rau 100 zaus sai dua tshaj tus MapQhia Txawm hais tias ob lub thetools yog siv los ua cov ntaub ntawv loj.

Tom qab ntawd, lo lus nug yog, nws puas tsim nyog kawm Hadoop rau lub txim?

Tsis yog, koj tsis ua yuav tsum kawm Hadoop rau kawmSpark . Teeb yog ib qhov project ywj siab. Tab sis tom qab YARNand Hadoop 2.0, Teeb tau nrov vim Teeb tuaj yeem khiav saum HDFS nrog rau lwm yam Hadoop cov khoom. Hadoop yog lub moj khaum uas koj sauMapReduce txoj haujlwm los ntawm kev txais cov chav kawm Java.

Puas yog Apache spark tsim nyog kawm?

1) Kawm Apache Spark kom muaj Kev Nkag Mus Ntxiv rau Cov Ntaub Ntawv Loj Cov Ntaub Ntawv Cov kws tshawb fawb tau nthuav tawm kev txaus siab ua haujlwm nrog Teeb vim nws muaj peev xwm khaws cov ntaub ntawv nyob hauv lub cim xeeb uas pab ua kom lub tshuab ceev kev kawm workloads tsis zoo liHadoop MapReduce.

Pom zoo: