Abstract:

Dynamic description of Aliyun e-MapReduce

The following is the information about the upcoming release of e-MapReduce:

Version 1.5.2

  • Add predefined configurations, such as trial/entry/compute/memory, etc
  • Add annual and monthly automatic renewal function

1.6.0 version

  • Interactive query (Support Hive and Spark)

information

  • CIO era Institute president Yao Le: big data industry application strategy

    About the development of big data, CIO Era Institute president Yao Le believes that there are three trends worthy of our attention: first, the authentication of data resources; Second, deep integration with cloud computing; Third, deep integration with artificial intelligence.Copy the code
  • Privacy and big Data behind the US election

    The U.S. presidential election has always been an activity focusing on public participation. To understand the needs of the public and satisfy their preferences is the basis for winning the White House. Today's candidates have long recognised that data technology is the way to go.Copy the code
  • Data Realising Unicorns -10 Business Models

    Data has become an important factor of production and a transformative force in every industry and various business functions. The accumulation, cooperation, sorting, mining and utilization of data are the basic qualities necessary for modern enterprises. Without it, your enterprises will be unable to face the competition in the era of big data. This article summarizes and shares 10 business models for many of the problems that are troubling business decision makers.Copy the code
  • After an enterprise deploys cloud computing, what’s next?

    The Mobile Information Research Center predicts that domestic companies will continue to increase their budgets after they realize expected profits from cloud computing.Copy the code
  • Understand the nature of distributed systems: high throughput, high availability and scalability

    Distributed system is almost the most basic method to solve the problem of Internet business carrying capacity, so as a server programmer, it becomes extremely important to master distributed system technology. This paper explains how to build a distributed system with high reliability, high availability, high performance and extensibility from the perspectives of throughput, concurrency, delay and load.Copy the code
  • (Technology) HBase high availability principles and practices

    This article describes several common HBase high availability problems, how to solve these problems, and the implementation principles of the HBase high availability feature.Copy the code
  • Scalable Stream Processing: A Survey of Storm, Samza, Spark and Flink

    In this paper, the characteristics of Storm, Trident, Samza, Spark Streaming, Flink (Streaming) are compared comprehensively, and Apex, Heron, MillWheel, Beam, Features of streaming processors such as IBM Infosphere Streams.Copy the code
  • (Technology) Spark’s practice in anti-cheating clustering scenarios

    For bulk spammer content and behavior, clustering is an effective method to replace manual strategies. This paper tries to use clustering to discover and mine spammers. anti-spamCopy the code