You’ve been through a lot of big data learning, but you’re still one step away from success, and that’s getting an Offer as a big data engineer.

I spent countless days and nights working by the computer, typing codes for countless times and correcting projects for countless times, just to get an Offer with high salary and high treatment that I am satisfied with. But this harvest not only requires you to learn skilled big data technology, but also need to prepare carefully before the interview, to understand the development of the enterprise you want to apply for, the technical requirements of their positions and so on. In addition, it is necessary to read some big data interview questions, to increase their experience.

Although xiaobian can not help you investigate the development of your ideal enterprise, but the big data often meet the questions have been ready for you, need as soon as possible income in the bag!

1. What are the features of the Scala language and what is functional programming? What are the advantages

2. What are scala associated objects for

3. How do you make concurrent programming in Scala and what are the advantages of your understanding of the Actor model

4. How does Spark process structured data and unstructured conversation data?

5. What are the main methods for Spark performance optimization?

6. What do you think are Spark’s strengths and weaknesses in the current situation of big data?

7. Have you conducted independent research and design on the algorithm?

8. Briefly describe some data mining algorithms and content you know

9. How do I use Spark to clean data

10. Tell me about spark’s app, in-store advertising, and scalper detection

11. How many partitions does Spark read data from? How many partitions does HDFS have in blocks?

12. Differences between Mogodb and hbase

13. Problems encountered during development

Optimization of the HIVE

15. Linux boot sequence

16. Do compiled Scala programs still need the Scala environment when running

17.Write a java program to implement Stack in java.

18. Differences between Linkedlist and ArrayList

19. Functions of Combiner in Hadoop

20. Design a grouping row weight counting algorithm with Mr

21. Use MapReduce to find two people who have public friends

22. HDFS storage mechanism

23. The principle of graphs

24. Hadoop operation principle

Hadoop namenode is down. What can I do

26.Hbase features and how do you design rowkey and columnFamily and how do you create a table

27. Differences between Redis, traditional database,hbase,hive (very detailed)

28. Talk about your understanding of Hadoop and what components it includes

29. Go into detail about your streaming real-time computing project deployment and the results collected

30. Real-time streaming computing framework, how many people, how long, details, including flume, Kafka,storm components, which part you are responsible for, if you need to build you can finish it?

Here I would like to recommend a big data official account: Big Data Development Learning Institute, which will share some videos recorded by senior big data engineers and architects: Linux, Hadoop core cluster building, HDFS, Mapreduce, YARN, off-line computing Flume, Hive, real-time computing, big data ETL, big data application and data mining principles have become the necessary knowledge system for large and data developers. I can also get free learning resources, which I benefit a lot from now