data

Massively Scalable Processing & Massively Parallel Processing

Massively Scalable Processing & Massively Parallel Processing

Massively Scalable Processing Real-time processing systems designed to efficiently process large volumes of data in a distributed, massively scalable manner are known as massively scalable processing. Cloud-native solutions and distributed computing frameworks such as Hadoop and Spark are examples of such systems. Features of MSP Horizontal scalability Increasing the number of nodes (machines) to spread processing and storage over several systems is known as horizontal scalability. Parallelism Dividing work into manageable portions that are handled concurrently by several nodes. Fault tolerance Systems can gracefully bounce back from node outages or hardware malfunctions. Scalability Distributed data storage allows for scalability of…
Read More
Scraping Data with DevTools and HAR Files

Scraping Data with DevTools and HAR Files

Data scraping is a game-changer for anyone looking to extract meaningful information from websites. With tools like Chrome DevTools and HAR files, you can uncover hidden APIs and capture valuable data streams effortlessly. In this post, I’ll share how I used these tools to scrape product data from Blinkit, a grocery delivery platform, and show you how you can do it too. Why I Chose Data Scraping for My Grocery App While building a grocery delivery app, I faced a major challenge—lack of real data. Creating my own dataset from scratch would have been extremely time-consuming and offered no real…
Read More
Generative AI and Climate Change Are on a Collision Course

Generative AI and Climate Change Are on a Collision Course

In 2025, AI and climate change, two of the biggest societal disruptors we're facing, will collide.The summer of 2024 broke the record for Earth’s hottest day since data collection began, sparking widespread media coverage and public debate. This also happens to be the year that both Microsoft and Google, two of the leading big tech companies investing heavily in AI research and development, missed their climate targets. While this also made headlines and spurred indignation, AI’s environmental impacts are still far from being common knowledge.In reality, AI’s current “bigger is better” paradigm—epitomized by tech companies’ pursuit of ever bigger, more…
Read More
Big Tech Will Scour the Globe in Its Search for Cheap Energy

Big Tech Will Scour the Globe in Its Search for Cheap Energy

On the southern tip of Malaysia lies the state of Johor, renowned for its beaches and mountainous jungle. But Johor has a new boom industry: data centers to power generative AI, with Microsoft committing more than $2 billion on just such a data center. For the tech giants, electricity has become the new oil. A state-of-the-art AI data center might need 90 MW, enough to power tens of thousands of American homes. With AI applications proliferating, from chatbots to AI agents, needs are growing. One industry consortium is planning for data centers requiring 10 GW (more than a hundred times…
Read More
ToxicPanda: A New Malware Threat to Android Users and Their Bank Accounts

ToxicPanda: A New Malware Threat to Android Users and Their Bank Accounts

In a concerning development for Android users worldwide, cybersecurity researchers have identified a new malware known as ToxicPanda that poses a significant threat to mobile devices and banking security. This sophisticated trojan is spreading rapidly, disguised as trusted apps like Google Chrome and various banking applications, putting sensitive user data and financial accounts at risk. What is ToxicPanda? ToxicPanda is a financial-focused trojan malware that has already compromised over 1,500 devices across Europe and Latin America, according to the Threat Intelligence team at cybersecurity firm Cleafy. The malware is derived from an older malware family known as TgToxic, but with…
Read More
Uber’s latest gig economy play is offering out coders for hire

Uber’s latest gig economy play is offering out coders for hire

reports that Uber has started farming out independent contractors for data-labeling services as well as coders-for-hire.Uber describes its new Scaled Solutions division as a platform of “analysts, testers and independent data operators,” according to . Scaled Solutions began life as an internal team, handling “large-scale annotation tasks” for Uber’s other services. Now it’s been expanded to provide coders and data labelers for outside companies like Pokémon Go developer Niantic Inc. and the self-driving trucking software firm Aurora Innovations. Aurora acquired Uber’s self-driving unit in 2020 and Uber is one of Aurora’s investors.Uber started recruiting workers this month for its Scaled…
Read More
Bring Data to Life: How to Install Grafana on Ubuntu 22.04

Bring Data to Life: How to Install Grafana on Ubuntu 22.04

Monitoring and visualizing real-time data has become crucial for businesses to make data-driven decisions. Several tools exist to achieve this goal, including paid ones. Grafana is one such tool; however, it's free and open-source. It stands out from other analytics tools with its flexibility to connect to a wide range of data sources and its highly customizable, interactive dashboards. Unlike many paid tools, Grafana's open-source model provides powerful visualization at no cost, making it a scalable choice for teams of all sizes. In this guide, we'll cover the step-by-step installation process of Grafana using the official Grafana APT repository. Let's…
Read More
OLAP (Online Analytical Processing)

OLAP (Online Analytical Processing)

OLAP (Online Analytical Processing) is a technology that enables analysts to extract and query data interactively from multidimensional data warehouses. It provides a way to analyze complex datasets for decision-making, typically in business intelligence (BI) applications. Definition of OLAP OLAP is a system for organizing large business databases and supporting complex analysis. Unlike OLTP (Online Transaction Processing), which focuses on fast, real-time transactional operations, OLAP emphasizes analytical operations such as summarizing, aggregating, and comparing data across multiple dimensions. Core Concept of OLAP At its core, OLAP uses a multidimensional data model, often referred to as a "cube." This cube allows…
Read More
The Benefits of MedOne and VMware: A Cloud Solution Built for the Future

The Benefits of MedOne and VMware: A Cloud Solution Built for the Future

By partnering with VMware, MedOne is able to offer businesses advanced virtualization and cloud management capabilities that help them better manage their IT resources and ensure business continuity. Here are the key benefits of the MedOne and VMware collaboration: 1.Seamless Virtualization with VMware vSphereOne of the biggest advantages of the MedOne and VMware partnership is the ability to offer VMware vSphere, one of the most trusted virtualization platforms in the industry. With vSphere, businesses can consolidate their physical servers into a more efficient virtualized environment, reducing hardware costs and simplifying IT management. MedOne’s cloud infrastructure enhances this by offering secure…
Read More
Is there an Alternative to Debezium + Kafka?

Is there an Alternative to Debezium + Kafka?

I asked this question on Reddit a while back and received lots of valuable answers. Therefore, I've looked into each answer and documented the results in this article. TL;DR No, Debezium dominates the market at the moment, despite some drawbacks. Background Explanation Why would we want to find an alternative to Debezium? The main reason is we encountered a challenging scenario. This is a typical scenario for Debezium, where any modifications to the data source are captured and fed into Kafka for downstream processing. The advantage of this architecture is simple and efficient, ensuring all downstream processes are as real-time…
Read More
No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.