• Bide Bullpen
  • Posts
  • December Lead Off Holiday Edition šŸŽ„āš¾

December Lead Off Holiday Edition šŸŽ„āš¾

Wait, is it year end already?

Forgot someoneā€™s name? Use ā€œonamae (pronounced onama-eh) wa nan des ka?ā€ in Japanese which translates to - whatā€™s your name?

šŸ’« Build Smarter Data Pipelines with Orem Data

Ready to optimize your BI infrastructure or kickstart your organization's data journey? Yuki Kakegawa at Orem Data offers personalized consulting to craft the perfect BI stack for your needs.

With expertise in Power BI, Python, SQL, and cloud platforms including Snowflake & Microsoft Fabric, Yuki delivers tailored solutions to streamline your data strategy.

šŸ‘‰ Email Yuki today at [email protected] to get started!

ā³Todayā€™s reading time: 5 minutes

Hereā€™s whatā€™s brewing in the data world:

1ļøāƒ£ Omni Analytics just launched Create Mode for embedded analyticsā€”making it easier to build directly within apps.
2ļøāƒ£ Sigma Computing rolled out a public beta for input tables on Amazon Redshift. Big win for Redshift fans!
3ļøāƒ£ Thoughtspot introduced the next iteration of Identity and Access Management with its IAM v2 release.
4ļøāƒ£dbt Core v1.9 is here, bringing support for microbatch incremental strategies and updates to snapshot configs.
5ļøāƒ£AWS has dropped managed S3 tables based on the popular open source format - Apache Iceberg!

Business Intelligence šŸ’”šŸ“Š

Omni

What is it: Omni Analytics has launched Create Mode for its embedded analytics dashboard. This new feature lets users (with the right permissions, of course) build their own analyses and share them with peers within their organization. The idea? Empower users with self-service analytics and reduce the flood of data requests that usually bog down data teams.

Source: Omni Analytics

Omni also shared upcoming features in its 2024 recap blog, and one stand out highlight is their focus on creating a more Excel-like interface. The goal is to make end users feel at home with the tool, lowering the learning curve and boosting adoption.

Why is it relevant: Tools like Omni Analytics and Sigma Computing are bringing an alternative take to the BI space by focusing on ease of adoption. Both use excel formula to SQL translation, and offer (or plan to offer) Excel-like frontends. For smaller businesses that donā€™t want to heavily invest in specialized BI resources, Omni could be a strong contenderā€”just like Sigma.

Check everything out here.


Sigma Computing

What is it: Sigma computing has introduced writeback capabilities to AWS Redshift Data warehouse for input tables. ā€œInput tablesā€ in Sigma, give users flexibility to create custom tables through data entry, CSV uploads or modified tables built on top of existing ones. With Sigmaā€™s spreadsheet-like interface, users can build these tables inside the tool, and push them directly to the data warehouse for broader use across their organization. This feature could be helpful for scenarios including budgeting analysis, custom groupings that shift annually, and team-specific rules.

Source: Sigma Computing

Sigma did a winter product launch recently and has announced features including

  • Ask Sigma which has a step by step explanation of AI responses generated,

  • Custom level subtotals, & hierarchies for improved reporting, and

  • Export bursts for push-based report subscriptions

You can read all about the product launch here.

Why is it relevant: Sigma is carving its niche by focusing on simplicity and familiarity with its spreadsheet-like interface. While it has some catching up to do compared to BI giants, its straightforward approach makes it an appealing option, especially for teams hesitant to adopt more complex BI tools.

Thoughtspot

What is it: Thoughtspot is rolling out the next iteration of Identity Access Management by layering Okta on its cloud service. This will allow organizations to setup Thoughtspot cloud apps more securely, by using MFA (multi-factor authentication), SSO (single sign on) support, and self-service management.

Source: Thoughtspot

Why is it relevant: Safety is a big deal in the data world, and enhancements like this are always a welcome addition. Features like this which are meant to improve security, help moves the data industry forward, and improve trust across the board for BI and Data tools.

Read all about it here.

Data Engineering šŸ› ļøšŸ“Š

dbt

What is it: dbt Core, the open-source version of dbt, has introduced several long-awaited features in version 1.9 to enhance user experience and reduce the need for manual workarounds. One of the key additions is microbatch materialization for large datasets. This allows users to partition a large data table and process only a smaller portion at a time, saving on resources and time. There's also a retry option for failed batches, ensuring smoother operations when dealing with large-scale data.

Snapshots (used to store point-in-time versions of data for tracking changes) have been simplified in this update. The use of YAML for configuration makes setup more intuitive and organized. Additionally, users can now rename meta fields (which was previously impossible), and target-aware schemas are now an option.

There are other changes as well, including state:modified behaviors to generate less false positives by better accounting for configuration differences across environments. This update should make dbt better at falsely flagging any deliberate changes.

Read the entire release here.

AWS

What is it: One of the major announcements from AWS re:Invent was the launch of S3 Tables and S3 Metadata. With S3 Tables, AWS now offers a managed implementation of the Iceberg table format, which is increasingly popular for its performance when querying large datasetsā€”think Netflix-scale data processing. On the other hand, S3 Metadata provides a managed solution for handling metadata, to manage and query data as it scales over time.

Read more about the feature here.

Why it matters: Wrapping a managed layer around an open source service improves accessibility of open source format, since they can be difficult for organizations to implement. With a managed layer, complexity is abstracted. However, that may come at a price, including vendor lock-ins. It is a tradeoff that needs to be considered when balancing ease of implementation versus freedom that comes with open source.

Data events action calendar šŸ“…

  • Snowflake RAG ā€˜nā€™ ROLL Hackathon is in Toronto on December 17th. Details here.

  • Databricks Data Intelligence with Databricks on Google Cloud virtual workshop is on January 15, 2025. Details here.

  • Google Cloud Next is happening in Las Vegas from April 9-11, 2025. Details here.

Enjoying reading the latest in the data world? Please subscribe and spread the word!

Feedback? Email us at: [email protected]

Reply

or to participate.