My Cloud Data Lake (2): ClickHouseIn part 1 “My Cloud Data Lake (1): dbt + dremio”, I mainly introduced using DBT + Dremio to import & process data out of PostgreSQL, and…Apr 29, 2022Apr 29, 2022
My Cloud Data Lake (1): dbt + dremioDuring daily work, I have many data want to explore. I want to find an economic way to save and analyze my data.Apr 1, 2022Apr 1, 2022
How to parse Spark SQL fasterDuring my daily work, we built a data analysis product, deeply based on Apache Spark. A common task is to analyze Spark SQL.Mar 30, 20221Mar 30, 20221
To-business software: design your system for content contributorsDesign software for business is complicated.It involves lots of roles in target company.Among them, content contributors are most…Apr 1, 2021Apr 1, 2021
How to get Medium stats data and make your own visualizationsMedium stats is a useful data for me, so I want to fetch in by scripts daily, and do my own visualizations to get more insights.Mar 4, 2021Mar 4, 2021
SQL Parser vs SQL Generator: What beats jOOQ may not be another SQL Generator, but a String…How to apply compile theory to build SQL Parser & SQL Generator, and why StringTemplate4 may be a good jOOQ alternative.Mar 2, 20211Mar 2, 20211
Why plain text configuration file can be a good user interface for business software?Use plain text configuration can also able to describe all kinds of applications, and with many benefits.Feb 27, 2021Feb 27, 2021
Published inAnalytics VidhyaApache Spark : for those who starred Spark in Github, what else projects were starred?After use my own tool “universe-lite” to fetch Github API, I found for those who starred Apache Spark, what else projects are popular.Feb 26, 2021Feb 26, 2021
Be Warned: MinIO (popular open source object storage) may change its License from Apache to AGPL!AWS S3 is a really great product. Amazon build a Cloud Computing Empire from S3. S3 is so cheap while still have a very high availability…Dec 2, 20205Dec 2, 20205