Category Archives: Hive

DataDotz BigData Weekly

DataDotz Bigdata Weekly


Using Apache Spark for large-scale language model training

Facebook has written about their experience converting their n-gram language model training pipeline from Apache Hive to Apache Spark. The post describes their Hive-based solution, their Spark-based solution, and the scalability challenges Continue reading

Read More

Moving from Hive 0.12 to Hive 0.14 Installation & Acid

Yarn has allowed all new engines to emanate hadoop. The most popular integration point with hadoop is always SQL. We have various types of SQL on hadoop but Apache Hive still the defacto standard.

Last month, the apache Hive community released apache Hive 0.14, which has the result of the first phase in the initiative.
Continue reading

Read More