In the field of OLAP data analysis, Count distinct is a very common requirement, and it can be divided into approximate and exact deduplication according...
Xiaomi is now more of a big data and artificial intelligence company than a phone company. With the rapid development of various businesses of Xiaomi,...
Copyright Notice: This set of technical column is the author (Qin Kaixin) usually work summary and sublimation, through extracting cases from real business environment to...
To calculate indicators, relying solely on early statistical software would be inefficient. And the accuracy goes down a lot, so Kylin was born. What are...
Since 2016, Kyligence has held many Apache Kylin certification training from time to time, which has been well received by students. Apache Kylin certification training...
Today, E-MapReduce provides a monthly package service (60% cheaper than on-demand). Users can customize software installation and configuration, create Hbase clusters, create clusters, and submit...
Open source products are quick to iterate, but also prone to pitfalls. Sometimes you encounter unexpected problems that need to be solved by studying the...
Apache Kylin is an open source distributed analysis engine that provides SQL query interfaces on Top of Hadoop/Spark and multidimensional analysis (OLAP) capabilities to support...
On October 26, byte beating special technique salon | large data architecture End of bytes to beat headquarters in Shanghai. We invited Guo Jun, head...
Dimension tables should not be Hive views, since views need to be materialized each time, resulting in additional time overhead. Ensure the mapping between dimension...
Tableau is the most widely used self-service visualization tool for big data analysis in OLAP field. This paper introduces how to use Kylin to improve...
ELKB refers to Elasticsearch, Logstash, Kibana, and Beats. Kylin logs are collected by FileBeat, distributed to Logstash for filtering, and finally written into ES. Using...
Apache Kylin v2.5.1 is now available! Welcome to download and use. Apache Kylin is an open source distributed analysis engine that provides SQL query interface...
Apache Kylin v2.4.1 is now available! Welcome to download and use. Apache Kylin is an open source distributed analysis engine that provides SQL query interface...
With the expansion and development of Lianjia's business lines, as well as the construction of data ecology, the scale of data grows rapidly. Since the...
With the arrival of big data wave, enterprises are trying to build big data analysis platform in order to conduct comprehensive, rapid and effective intelligent...
Abstract: In the first and middle chapters of Apache Kylin, we introduce the theoretical knowledge before Kylin learning, the birth of Kylin, and the problems...
Today with the mobile Internet, Internet of things, big data, AI, such as the rapid development of technology, data has already become the most important...
Sales business is characterized by large scale, multiple fields and dense demand. Meituan to store food and beverage giant sales system (hereinafter referred to as...