By: Jeremy Pedersen
Sorry folks, have to keep it short this week!
Let's take a look at some of the questions I have been asked in recent Alibaba Cloud training sessions. That's right "Friday Q&A" is back! We'll focus on Big Data today.
Unfortunately no. DataV dashboards are designed to display data in real-time, while MaxCompute is designed for offline, batch processing. This means that MaxCompute queries take anywhere from tens of seconds to tens of minutes to run (depending on the size of the data processing job), while DataV expects data sources to respond in under a second.
However, you can connect MaxCompute to Hologres, which is a much faster OLAP database system designed to work with MaxCompute. DataV can read data directly from Hologres without any issues.
Yes. Unlike DataV, QuickBI is not designed for real-time data display: it's a traditional BI reporting tool. Further, it has a built-in cache which allows it to work well with batch processing tools like MaxCompute.
Yes. MaxCompute can directly run Spark code. See this documentation.
MaxCompute's SQL dialect is mostly compatible with Hive, so sometimes, yes. If you're curious how different SQL dialects (Hive, MySQL, Oracle) map onto MaxCompute's SQL language, take a look here.
Actually, yes! As it turns out, you can translate Airflow jobs into DataWorks workflows. See this post on the developer forums. Unfortunately the article is in Chinese, so you might have to use Google Translate!
Yes. See this document.
Yes. MaxCompute allows you to treat files in OSS as an "external table". See here.
That's it for this week! See you next time.
JDP - April 23, 2021
JDP - July 9, 2021
JDP - May 20, 2022
JDP - June 11, 2021
JDP - April 1, 2021
JDP - May 20, 2021
More Posts by JDP