Garbage In, Garbage Out - High-Quality Datasets for Edge ML Research Artwork

EDGE AI POD

Discover the cutting-edge world of energy-efficient machine learning, edge AI, hardware accelerators, software algorithms, and real-world use cases with this podcast feed from all things in the world's largest EDGE AI community.

These are shows like EDGE AI TALKS, EDGE AI BLUEPRINTS as well as EDGE AI FOUNDATION event talks on a range of research, product and business topics.

Join us to stay informed and inspired!

All Episodes

EDGE AI POD

Garbage In, Garbage Out - High-Quality Datasets for Edge ML Research

June 05, 2025 • EDGE AI FOUNDATION

The EDGE AI FOUNDATION's Datasets & Benchmarks Working Group highlights the rapid progress in neural networks, particularly in cloud-based applications like image recognition and NLP, which benefited greatly from large, high-quality datasets. However, the constrained nature of edge AI devices necessitates smaller, more efficient models, yet a lack of suitable datasets hinders progress and realistic evaluation in this area. To address this, the Foundation aims to create and maintain a repository of production-grade, diverse, and well-annotated datasets for tiny and edge ML use cases, enabling fair comparisons and the advancement of the field. They emphasize community involvement in contributing datasets, providing feedback, and establishing best practices for optimization. Ultimately, this initiative seeks to level the playing field for edge AI research by providing the necessary resources for accurate benchmarking and innovation.

Send us a text

Support the show

Learn more about the EDGE AI FOUNDATION - edgeaifoundation.org

Speaker 1: 0:00

Welcome to the Deep Dive. Today we're tackling a really interesting problem. How do we get a clear picture of how well artificial intelligence is actually performing when it's running on those tiny power-sipping devices, you know, edge devices?

Speaker 2: 0:14

Right, the microcontrollers, the battery-powered gadgets, Exactly.

Speaker 1: 0:18

Things where AI meets the well the physical world. Getting that right seems like kind of a big deal.

Speaker 2: 0:23

Oh, it is a big deal and as you'll see, the path to reliably evaluating and, frankly, pushing the boundaries of edge AI. It isn't quite as straightforward as you might think. There are some unique challenges we really need to unpack here.

Speaker 1: 0:40

Absolutely, and that's exactly our mission today right To dive deep into how a new initiative is confronting a core issue in this field.

Speaker 2: 0:47

Which is.

Speaker 1: 0:47

Which is the current lack of standardized, high quality data for edge AI research. I mean, think about it If everyone's testing their AI on totally different data sets.

Speaker 2: 0:57

How can you compare?

Speaker 1: 0:58

Exactly. It's nearly impossible to get a clear sense of which approaches are truly advancing the field.

Speaker 2: 1:08

Okay. So to help us get a handle on this, we're looking at a really insightful paper. It's called Leveling the Playing Field for Edge AI Research Through High-Quality Datasets.

Speaker 1: 1:14

Right.

Speaker 2: 1:15

And this comes from the Edge AI Foundation Datasets and Benchmarks Working Group. It's a real collaboration of experts. You've got folks from NXP, Inatera, Harvard.

Speaker 1: 1:26

University, imperial College Imagimab.

Speaker 2: 1:28

Renice's Technical University of Denmark. It's quite the list.

Speaker 1: 1:30

It is and you might have seen the Edge AI Foundation's logo that sort of interconnected network design.

Speaker 2: 1:36

Yeah, I think so.

Speaker 1: 1:37

Their tagline is connecting AI to the real world, which really kind of sums up what we're talking about today, doesn't it?

Speaker 2: 1:43

It absolutely does, it nails it.

Speaker 1: 1:45

Okay, so when you think about the well, the incredible progress we've seen in AI recently, Massive progress. Right. A lot of that has been fueled by these enormous data sets like ImageNet for image recognition.

Speaker 2: 1:57

That'll really change the game.

Speaker 1: 1:58

Or just vast amounts of text data for natural language processing. These big shared data sets they've provided like a common ground for researchers train, test, compare models.

Speaker 2: 2:09

And the impact is undeniable. The paper we're looking at even has this graph, you know, showing how AI performance and things like image recognition, reading comprehension it hasn't just improved dramatically.

Speaker 1: 2:23

It surpassed humans in some cases.

Speaker 2: 2:24

In some specific tasks. Yeah, yeah. Which just goes to show the power of having those large standardized data sets. They really drive things forward.

Speaker 1: 2:33

And this is the key thing, the edge AI world. It presents a very different picture, totally different, because these tiny devices are used for such a huge range of applications Monitoring crops, wearable health trackers, smart sensors.

Speaker 2: 2:46

You name it.

Speaker 1: 2:47

The landscape is just so much more fragmented. We don't have that same unified data infrastructure that's been so crucial for cloud AI.

Speaker 2: 2:55

It's a good analogy you used before comparing a scooter to a semi truck. Different purposes, wildly different requirements.

Speaker 1: 3:02

Yeah.

Speaker 2: 3:03

And this inherent diversity. It's made even trickier by the fundamental nature of edge devices themselves. Constraints, you mean Exactly they have to operate under incredibly tight constraints, Very limited power, often running on tiny batteries, right Strict thermal limits they can't overheat in these tiny packages. And tiny memory footprints, both for storing the actual AI model and for the. You know the working RAM they use during operation right.

Speaker 1: 3:28

So, while the trend in cloud AI has been towards bigger, more complex models, often meaning better accuracy- yeah, bigger is often better in the cloud that sheer size becomes a massive roadblock when you're trying to cram AI onto a microcontroller with like kilobytes of RAM the resources just aren't there.

Speaker 2: 3:46

You can't run those huge networks.

Speaker 1: 3:47

Okay, so big data fueled cloud progress, but the edge is diverse resource constrained. This leads us to the core issue what specific problem is this edge AI foundation initiative really trying to solve here?

Speaker 2: 4:03

So the paper really highlights a key challenge because we lack these standardized ways to evaluate performance using realistic data. Many research publications in the tiny ML the edge ML fields. They tend to present an overly optimistic picture.

Speaker 1: 4:17

Optimistic how.

Speaker 2: 4:18

About how much models can actually be compressed and optimized without significantly losing performance.

Speaker 1: 4:23

Okay, hang on. So you're saying the really impressive efficiency gains, these tiny model sizes we sometimes read about in papers.

Speaker 2: 4:30

Yeah, the ones designed for these tiny devices.

Speaker 1: 4:32

They might look great on paper, but they don't always translate to real world success when you try to put them into actual products.

Speaker 2: 4:40

That's precisely it. The authors emphasize that these models are often evaluated on what they call toy examples. Toy examples yeah, very small, simplified data sets. They just don't capture the complexity, the variability, the sheer messiness of real-world data. So for our listeners, especially if you're involved in developing or deploying edge AI solutions, it's really crucial to be aware of this Research claims based just on these limited data sets might not accurately reflect how those models will perform in your specific use case.

Speaker 1: 5:12

Right, Because that overestimation it's not just academic, is it? It could lead to real problems.

Speaker 2: 5:18

Oh, absolutely Significant delays, wasted resources. When companies try to deploy these models in actual products and then discover whoops, they don't actually meet the necessary accuracy or efficiency targets.

Speaker 1: 5:29

That's a really critical point. It's like testing that fuel efficient engine prototype only in perfect lab conditions. You need the highway test, the bad weather test.

Speaker 2: 5:38

Exactly, you need the messy reality.

Speaker 1: 5:41

So, if these toy examples are giving us a potentially skewed view, what's the Edge AI Foundation's working group proposing? What's the actual solution?

Speaker 2: 5:51

Okay. So their core solution is to establish a publicly accessible repository, a collection of realistically sized data sets, and these are specifically curated for tiny and edge ML applications. The fundamental idea is to move the whole research community away from those overly simplistic examples.

Speaker 1: 6:12

Towards data that actually represents the challenges you face in real world deployments.

Speaker 2: 6:16

Precisely Data that looks like the real thing.

Speaker 1: 6:18

Okay, so this repository, that's the key to creating that level playing field they talk about in the paper's title. That's the idea. How does having playing field they talk about in the paper's title.

Speaker 2: 6:24

That's the idea.

Speaker 1: 6:25

How does having these common high-quality data sets help exactly?

Speaker 2: 6:29

Well, by providing this shared foundation of data, researchers, developers, everyone can compare their various techniques much more effectively. Techniques late, well. Whether they're focused on shrinking the model size, minimizing power consumption, boosting accuracy, improving precision, whatever their goal is, they can compare them in a much more meaningful, objective way, because everyone will be evaluating their innovations against the same benchmarks, using data that reflects real-world complexities.

Speaker 1: 6:57

Okay, that sounds like a well, a really crucial step Moving towards more reliable progress in the field. What are some of the specific goals the working group has set out for this initiative?

Speaker 2: 7:08

They have several key objectives laid out. Firstly, obviously, provide these realistically scaled data sets and, importantly, these won't be static. The idea is they'll be actively curated and expanded by the broader community.

Speaker 1: 7:22

Dynamic.

Speaker 2: 7:23

Secondly, they want to foster open research, research into the most effective ways to actually do the AI inference on these data sets, taking into account critical factors like power, memory, accuracy, precision all the edge constraints.

Speaker 1: 7:36

Makes sense.

Speaker 2: 7:37

And finally, they intend to facilitate the development and, crucially, the sharing of diverse optimization techniques within the community.

Speaker 1: 7:44

So it's not just about giving people the data. It's about building a whole collaborative ecosystem around it to advance the tech.

Speaker 2: 7:51

That's exactly it, and it's important to understand the work group's role here. They're quite clear about this. So, they're not positioning themselves as, like, the judges of submission quality or the managers of some big centralized benchmarking system.

Speaker 1: 8:06

OK.

Speaker 2: 8:06

Their role is more about being an enabler. Centralized benchmarking system. Their role is more about being an enabler. They're providing the essential resources, the data sets that will empower the community itself to conduct more informed, more transparent comparisons between different approaches.

Speaker 1: 8:19

Got it Empowering the community's collective knowledge, not imposing from the top down. Now the paper talks about the importance of carefully considering use cases and selecting data sets. Given how diverse edge AI is, how are they actually approaching that? It seems tricky.

Speaker 2: 8:34

It is tricky that heterogeneity is a huge challenge, but they propose a pretty smart approach actually, which is Focusing on the fundamental technical requirements that different use cases impose.

Speaker 1: 8:44

Yeah.

Speaker 2: 8:44

Rather than trying to define specific benchmarks for every single possible application out there.

Speaker 1: 8:49

Okay, explain that a bit more.

Speaker 2: 8:55

So, for instance, instead of creating a specific benchmark just for, say, smart doorbells, they would analyze the underlying technical demands that a smart doorbell places on an AI system, things like real-time object detection.

Speaker 1: 9:05

Under different lighting maybe.

Speaker 2: 9:07

Exactly Varying lighting conditions. With a limited power budget, they break it down to those core requirements.

Speaker 1: 9:13

Okay, so it's more granular, focusing on the technical build-up blocks, not the specific end product. What kinds of technical requirements are they looking at then?

Speaker 2: 9:21

They've identified several key categories. One really crucial distinction is between real-time processing and batched processing.

Speaker 1: 9:28

Okay.

Speaker 2: 9:29

For many edge applications, like, say, anomaly detection in industrial equipment, you need that immediate analysis. So real-time single sample operation is a top priority.

Speaker 1: 9:39

Can't wait for a batch to process if a machine is about to fail.

Speaker 2: 9:42

Precisely. Another vital factor is the energy budget. A tiny sensor running on a coin cell battery. It operates under vastly different power constraints compared to, say, a wall powered smart camera.

Speaker 1: 9:55

Absolutely. You can't drain a watch battery in an hour with a power hungry AI model.

Speaker 2: 9:59

No way. And that links to another key consideration, always on operation.

Speaker 1: 10:04

Right Things that are constantly listening or watching.

Speaker 2: 10:06

Yeah, health monitors, security sensors. They need to continuously monitor their environment and that has huge implications for power consumption, sustained performance, even the long-term reliability, the durability of the hardware itself. They also consider the basic nature of the task. Is it classification, like identifying a specific sound or prediction, or is it more like regression or data transformation?

Speaker 1: 10:30

Different types of output Right.

Speaker 2: 10:32

And finally they look at the data modality. Is the device processing sequential time series data like sensor readings from an accelerometer, or individual independent frames or images, like from a camera?

Speaker 1: 10:44

Okay. So by dissecting all these diverse applications into these core technical requirements, they can develop data sets and benchmarks that have broader relevance. They apply across a wider range of edge AI scenarios.

Speaker 2: 10:56

That's exactly the goal. It allows researchers to focus on optimizing for specific technical challenges, challenges that are relevant to loads of potential applications, rather than getting stuck on very narrow application-specific data sets.

Speaker 1: 11:10

That makes a lot of sense. Now, the paper also outlines how the Edge AI Foundation actually intends to improve the quality and usefulness of these data sets. What's their strategy there?

Speaker 2: 11:20

Yeah, they've laid out a pretty comprehensive multifaceted strategy for that. They plan to start by leveraging existing data sets as foundational base sets.

Speaker 1: 11:30

So building on what's already out there.

Speaker 2: 11:32

Exactly, not reinventing the wheel every time. They also emphasize the critical importance of variety, actively seeking out and incorporating a wide spectrum of scenarios variabilities within the data.

Speaker 1: 11:43

Why is that so important?

Speaker 2: 11:45

To ensure that the AI models trained and tested on these data sets are robust, that they can actually handle the complexities of the real world. You don't want a data set that only shows perfect, ideal conditions. That's not reality.

Speaker 1: 11:57

No real world. Data is messy noise, different environments, unexpected stuff. The AI needs to handle all that.

Speaker 2: 12:04

Absolutely. Another crucial element is metadata.

Speaker 1: 12:08

The data, about the data.

Speaker 2: 12:10

Right. They're committed to ensuring all data sets come with complete and accurate labeling, Plus information about how the data was collected, how it was processed.

Speaker 1: 12:19

Why does that matter?

Speaker 2: 12:20

It builds trust. It allows researchers to really understand the data thoroughly and then interpret their results with confidence. And, finally, they recognize that AI is well constantly changing, so they're planning for continuous adaptation. The data sets need to evolve to keep pace with the latest advancements in AI models and techniques.

Speaker 1: 12:42

So this isn't just a static library they're building. It's meant to be a living, dynamic resource that grows and adapts as the field moves forward.

Speaker 2: 12:49

That's the vision.

Speaker 1: 12:50

The paper mentions an initial focus on visual wake words. Can you elaborate a bit on why they're starting there?

Speaker 2: 12:56

Yeah, they're initially focusing on visual wake words using images. That's a pretty common task, right, and many battery operated devices that need to respond to some visual cue.

Speaker 1: 13:04

But while using minimal power no-transcript.

Speaker 2: 13:10

Plus, there's a decent amount of existing image data out there that can serve as a foundation to build upon. Okay, so they'll be working hard to assess how well models trained on these initial sets actually generalize to real-life scenarios and making sure the data sets cover a really broad range of conditions, variations, all that messy reality we talked about.

Speaker 1: 13:30

And they plan to expand beyond images later.

Speaker 2: 13:32

Yes, absolutely yeah, plans are in place to extend to images later. Yes, absolutely yeah. Plans are in place to extend to other data modalities soon after.

Speaker 1: 13:37

Okay, it sounds like a really well thought out and, frankly, incredibly important undertaking. Now the paper keeps stressing the vital role of community participation. How can people, researchers, companies, individuals actually get involved? How can they help ensure these data sets are high quality and relevant?

Speaker 2: 13:55

Community involvement is absolutely paramount. They make that very clear For this initiative to really succeed and have a lasting impact, they need the community.

Speaker 1: 14:03

So how do they get involved?

Speaker 2: 14:04

The Edge AI Foundation is actively encouraging participation in several key ways. First, they're specifically asking for recommendations and contributions of existing data sets.

Speaker 1: 14:14

So if someone knows a good recommendations and contributions of existing data sets.

Speaker 2: 14:17

So if someone knows a good data set yeah, data sets that could be valuable starting points, or maybe data sets that community thinks would really benefit from being improved or expanded they want those suggestions. Ok, they also really want feedback feedback on how well the current data sets meet the needs of various use cases, and they're actively soliciting suggestions for expanding the variety, the scope of the data included.

Speaker 1: 14:39

So if a company, for example, has a valuable data set they've been using internally, or if they see gaps in what's available, they have a channel to contribute that expertise, that data, absolutely. What else?

Speaker 2: 14:49

They're also inviting the community to contribute improved or expanded test sets. Help raise the bar for performance. Drive more innovation.

Speaker 1: 14:57

Makes sense. Better tests mean better models.

Speaker 2: 14:59

And perhaps most critically, they're seeking assistance with the huge task of data labeling and annotation.

Speaker 1: 15:05

Ah, the often unglamorous but essential work.

Speaker 2: 15:09

Totally essential To ensure the accuracy, the completeness of the data sets. They're even looking for contributions of labels for data sets that might currently lack comprehensive annotations.

Speaker 1: 15:21

It really sounds like a truly collaborative effort. Everyone in the Edge AI ecosystem seems to have a potential role to play.

Speaker 2: 15:28

That's the goal.

Speaker 1: 15:30

Now, towards the end of the paper, there's a very clear call to action. It's directed right at the Edge AI Foundation community. What are the main things they're urging people to do?

Speaker 2: 15:39

Yeah, the central message there is a strong, direct invitation to get actively engaged. They are urging members and potential members to provide their valuable insights.

Speaker 1: 15:49

Insights on what.

Speaker 2: 15:50

On the specific data-related challenges they are actually facing in their own work right now and to clearly articulate the use cases they believe the working group should prioritize.

Speaker 1: 15:59

So real-world feedback.

Speaker 2: 16:01

Exactly that direct feedback loop is essential to make sure the Foundation's resources are focused on the most pressing real-world needs of the community.

Speaker 1: 16:09

And beyond feedback.

Speaker 2: 16:11

Well, they're explicitly requesting contributions, Contributions of realistic data sets and relevant evaluation methods for those use cases that are identified as most critical.

Speaker 1: 16:21

Okay.

Speaker 2: 16:22

And they're actively encouraging participation in the collaborative development of best practices for optimizing AI models for deployment on these data sets. How do we best tune models for this data for deployment?

Speaker 1: 16:35

on these datasets? How do we best tune models for this data? So it's a real chance for companies, for researchers, to directly influence the direction and the focus of this whole important initiative.

Speaker 2: 16:44

Absolutely. And there's one final, very direct and significant call to action in there.

Speaker 1: 16:50

Which is.

Speaker 2: 16:50

They are strongly urging companies to join the Edge AI Foundation community.

Speaker 1: 16:55

Ah, okay, not just participate, but formally join.

Speaker 2: 16:57

Right. This isn't just about contributing a data set here or some expertise there. It's about becoming an integral part of this larger collaborative ecosystem, an ecosystem that's collectively working to tackle these fundamental challenges in Edge AI.

Speaker 1: 17:10

And what's the benefit for the companies who join?

Speaker 2: 17:12

Well, by becoming members, companies get a unique opportunity. They get to directly shape the development of the data sets, the evaluation methods, the best practices, all the things that will ultimately define the future path of this rapidly evolving field.

Speaker 1: 17:25

So direct influence.

Speaker 2: 17:26

Direct influence, yeah, and that can provide significant benefits back to their own research and development efforts. It could potentially lead to a real competitive edge in this dynamic market.

Speaker 1: 17:36

So for any companies listening right now, companies involved in developing or deploying AI on edge devices, this sounds like an incredibly valuable opportunity not just to contribute, but also to gain a strategic advantage by being right at the forefront of these crucial standardization efforts.

Speaker 2: 17:54

That's precisely the key takeaway here it's a chance to actively participate in shaping the fundamental tools, the resources that the entire industry will increasingly rely on to effectively evaluate and, frankly, drive advancements in edge AI.

Speaker 1: 18:10

Okay, so let's just summarize the essential points for our listeners. What's the absolute core message from this deep dive into the Edge AI Foundation's initiative?

Speaker 2: 18:18

I think the most crucial thing to grasp is that the Edge AI Foundation is proactively tackling a major bottleneck, a bottleneck that's really hindering the progress of Edge AI right now.

Speaker 1: 18:30

Which is that scarcity of standard, high-quality data.

Speaker 2: 18:33

Exactly, and this isn't just some abstract academic problem. It directly impacts our ability to develop and accurately assess real-world AI applications, the AI running on those low-power, resource-constrained devices that are becoming more and more part of our daily lives. So, by addressing this data gap, the foundation aims to provide a much clearer understanding of what's truly possible, what's actually achievable at the edge.

Speaker 1: 18:58

And by establishing this common ground, these realistic data sets, the hope is to catalyze even faster innovation and get more practical AI deployed.

Speaker 2: 19:06

Yes, leading to the practical deployment of AI across a wide spectrum of everyday devices, really connecting AI to the real world. Just like their tagline said, and as we've really emphasized, active participation from the community is just vital for this to work. Therefore, if your company is involved in any aspect of Edge AI designing algorithms, deploying solutions, anything in between we'd strongly encourage you to consider the significant value of joining the Edge AI Foundation community.

Speaker 1: 19:34

It's that opportunity to contribute, gain access to resources.

Speaker 2: 19:37

And ultimately play a key role in shaping the future of this incredibly promising field.

Speaker 1: 19:42

Absolutely, and you know, as Edge AI becomes more and more deeply integrated into the fabric of our lives, it really makes you think, doesn't it? How will these kinds of collaborative initiatives, the ones focused on standardized, high quality data, how will they fundamentally transform not just the capabilities but also the reliability of the AI we interact with every single day?

Speaker 2: 20:02

It's a huge question.

Speaker 1: 20:03

It is Definitely something to ponder. Thanks for joining us for this deep dive.