뉴스 | 🇺🇸 미국 | IT/기술 | "IDENTIFIED"

Wired

중도 성향

IT/기술

xAI Asks Court to Strip Alleged Grok Deepfake Nudes Victims of Anonymity

Four people suing Elon Musk's AI firm under pseudonyms due to the risks of being identified may face a difficult choice: Reveal your real names, or drop the lawsuit.

Quartz

중도 성향

IT/기술

Microsoft and Mayo Clinic are teaming up to build a healthcare AI model

Mayo Clinic will own the model, which will be trained on de-identified patient data and initially deployed for use by its clinicians

Breitbart

보수 성향

IT/기술

Hackers Are Using the Same Conversational Tricks on AI that Con Artists Use on People

Cybersecurity researchers have identified a growing class of attacks that exploit AI chatbots through sophisticated conversational manipulation rather than traditional technical hacking methods. The post Hackers Are Using the Same Conversational Tricks on AI that Con Artists Use on People appeared first on Breitbart.

Breitbart

보수 성향

IT/기술

Report: Tech Company Accidentally Spends $500 Million on Anthropic's Claude AI in Single Month

An unidentified corporation reportedly spent half a billion dollars on Claude AI in just one month after failing to implement usage restrictions on employee licenses, highlighting growing concerns about runaway AI costs in corporate America. The post Report: Tech Company Accidentally Spends $500 Million on Anthropic’s Claude AI in Single Month appeared first on Breitbart.

The Verge

중도 성향

IT/기술

Motorola says affiliate hijacking of Amazon app was ‘unintended’

Motorola says that recently discovered behavior, which saw some of its phones sending users to an affiliate tracking website before opening the Amazon app, was "unintended" and has been "promptly corrected." The company didn't explain how the error was introduced in the first place. "Recently, Motorola acted quickly to resolve an issue that was identified, […]

IEEE Spectrum

중도 성향

IT/기술

Accelerating Chipmaking Innovation for the Energy-Efficient AI Era

This sponsored article is brought to you by Applied Materials. At pivotal moments in history, progress has required more than individual brilliance. The most consequential breakthroughs — such as those achieved under the Human Genome Project — required a new operating paradigm: Concentrate the world’s best talent around a single mission, establish a common platform, share critical infrastructure, and collapse feedback loops. When stakes are high and timelines are compressed, sequential and siloed innovation simply cannot keep pace. Today’s AI era is creating an engineering race with similar demands. Every company is pushing to deliver higher-performance AI systems, faster. But performance is no longer defined by compute alone. AI workloads are increasingly dominated by the movement of data: In many cases, moving bits consumes as much — or more — energy than compute itself. As a result, reducing energy per bit can extend system‑level performance alongside gains in peak compute. The path to energy‑efficient AI therefore runs through system‑level engineering, spanning three tightly interconnected domains: Logic, where performance per watt depends on efficient transistor switching, low‑loss power, and signal delivery through dense wiring stacks. Memory, where surging bandwidth and capacity demands expose the memory wall, with processor capability advancing faster than memory access. Advanced packaging, where 3D integration, chiplet architectures, and high‑density interconnects bring compute and memory closer together — enabling system designs monolithic scaling can no longer sustain. These domains can no longer be optimized independently. Gains in logic efficiency stall without sufficient memory bandwidth. Advances in memory bandwidth fall short if packaging cannot deliver proximity within thermal and mechanical constraints. Packaging, in turn, is constrained by the precision of both front‑end device fabrication and back‑end integration processes. In the angstrom era, the hardest problems arise at the boundaries — between compute and memory in the package, front‑end and back‑end integration, and the tightly coupled process steps needed for precise 3D fabrication. And it is precisely this boundary‑driven complexity where the traditional innovation model breaks down. The Traditional R&D Workflow Is Too Slow for Angstrom‑Era AI For decades, the semiconductor industry’s R&D model has resembled a relay race. Capabilities are developed in one part of the ecosystem, handed off downstream through integration and manufacturing, evaluated by chip and system designers, and only then fed back for the next iteration. That model worked when progress was dominated by relatively modular steps that could be scaled independently and simply dropped into the manufacturing flow. But the AI timeline has upended these rules. At angstrom‑scale dimensions, the physics enforces inescapable coupling across the entire stack: materials choices shape integration schemes; integration defines design rules; design rules dictate power delivery; wiring sets thermal budgets; and thermals ultimately constrain packaging scaling. System architects simply cannot wait 10–15 years for each major semiconductor technology inflection to mature. Representing a roughly $5 billion investment, EPIC is the largest commitment to advanced semiconductor equipment R&D in U.S. history. A long‑term perspective is essential to align materials innovation with emerging device architectures — and to develop the tools and processes required to integrate both with manufacturable precision. At Applied Materials, together with our customers, we are charting a course across the next 3–4 generations, extending as far as 10 years down the roadmap. The angstrom era demands that we break down silos and bring together the industry’s best minds — from leading companies to leading academic institutions. If the problem is coupled, the solution must be coupled. If the timeline is compressed, the learning loop must be compressed. It’s not enough to just innovate — we must innovate how we innovate. EPIC: A Center and Platform for High‑Velocity Co‑Innovation This is the challenge that Applied Materials EPIC Center is designed to solve. Representing a roughly US $5 billion investment, EPIC is the largest commitment to advanced semiconductor equipment R&D in U.S. history. When it opens in 2026, it will deliver state‑of‑the‑art cleanroom capabilities built from the ground up to shorten the path from early‑stage research to full‑scale manufacturing. But the facilities are only one component of the model. EPIC is also a platform, an operating system for high-velocity co‑innovation that revolutionizes how ideas move from the lab to the fab. EPIC is a platform, an operating system for high-velocity co‑innovation that revolutionizes how ideas move from the lab to the fab.Applied Materials The EPIC model compresses the traditional workflow. Customer engineers work side‑by‑side with Applied technologists from day one — moving beyond isolated process optimization and downstream handoffs. Within a shared, secure environment, EPIC tightly integrates atomistic modeling, test vehicles, process development, validation, and metrology feedback. Constraints that once surfaced late in development are identified and addressed early. The result is a potentially 2x faster path that benefits the entire ecosystem under one roof: Chipmakers gain earlier access to Applied’s R&D portfolio, faster learning cycles, and accelerated transfer of next‑generation technologies into high‑volume manufacturing. Ecosystem partners gain earlier access to advanced manufacturing technology and collaboration opportunities that expand what is possible through materials innovation. Academic institutions gain opportunities to strengthen the lab‑to‑fab pipeline and help develop future semiconductor talent. Building on decades of co‑development, we are reinventing the innovation pipeline with our partners across logic, memory, and advanced packaging to deliver the next leap in energy‑efficient AI. Accelerating Advanced Logic Logic remains the engine of AI compute. In the angstrom era, however, system‑level gains are increasingly constrained by power and energy. Extending AI performance now depends on architectures that deliver more performance per watt — accelerating the move to 3D devices such as gate‑all‑around (GAA) transistors, which boost density within a compact footprint while preserving power efficiency. Architectures that deliver more performance per watt are accelerating the move to 3D devices such as gate‑all‑around (GAA) transistors, and further out, complementary FETs (CFETs), which push density scaling even more.Applied Materials These architectural shifts are unfolding at unprecedented scale, with the logic roadmap already extending beyond first‑generation GAA toward more advanced designs. One key example is GAA with backside power delivery, which relocates thick power lines to the backside of the wafer, reducing resistive losses and freeing front‑side routing for tighter logic cell integration. Another example brings adjacent GAA PMOS and NMOS transistors closer together while inserting a dielectric isolation wall between them to minimize electrical interference. Further out, complementary FETs (CFETs) push density scaling even more by stacking PMOS and NMOS devices directly atop one another. While these architectures deliver compelling gains in performance per watt and logic density without relying solely on tighter lithography, they significantly raise integration complexity. Manufacturing a single GAA device today can involve more than 2,000 tightly interdependent process steps. At the same time, wiring stacks continue to grow taller and denser to connect these advanced logic devices. Modern leading‑edge GPUs now in development pack more than 300 billion transistors into an area little larger than a postage stamp, interconnected by over 2,000 miles of wiring. Modern leading‑edge GPUs now in development pack more than 300 billion transistors into an area little larger than a postage stamp, interconnected by over 2,000 miles of wiring.Applied Materials At this level of complexity, the process steps used to create these precise 3D devices and wiring stacks cannot be optimized independently. Design and process must evolve in lockstep, and materials innovation and fabrication methods must advance alongside device architecture. EPIC’s co‑innovation model is designed to accelerate exactly this convergence — enabling logic compute to continue advancing the frontiers of AI at the pace the roadmap demands. Powering the Memory Roadmap At the same time, the AI computing era is fundamentally reshaping how data is generated, moved, and processed — making memory technologies, especially DRAM, central to delivering the energy‑efficient performance AI systems require. As models grow larger and more data‑hungry, the DRAM roadmap is shifting toward architectures that deliver higher density, greater bandwidth, and faster access per watt. At the DRAM cell level, AI performance requirements are driving a transition from 6F² buried‑channel array transistors (BCAT) to more compact 4F², and beyond that, architectures that move past what 2D scaling alone can deliver. Applied Materials At the DRAM cell level, this shift is driving a transition from 6F² buried‑channel array transistors (BCAT) to more compact 4F² architectures, which orient the transistor vertically to boost density and reduce chip area. Looking beyond 4F², sustaining gains in performance per watt will require moving past what 2D scaling alone can deliver. The industry is therefore turning to 3D DRAM, stacking memory cells vertically to add capacity within a constrained footprint. As these structures grow taller and aspect ratios intensify, high-mobility materials engineering in three dimensions becomes increasingly critical to performance and reliability. Beyond the memory cell array, another powerful lever for DRAM scaling is shrinking the peripheral circuitry, which includes logic transistors and interconnect wiring. One emerging approach places select periphery functions beneath the DRAM array by bonding two wafers — one optimized for the DRAM cells and the other for CMOS logic — using multiple wiring layers. Beyond the memory cell array, another powerful lever for DRAM scaling is shrinking the peripheral circuitry, which includes logic transistors and interconnect wiring.Applied Materials In parallel, DRAM performance is being extended by leveraging logic‑proven enhancers in the memory periphery. These include mobility boosters such as embedded silicon germanium and stress films, along with wiring upgrades like improved low‑k dielectrics and advanced copper interconnects. Memory manufacturers are also transitioning periphery transistors from planar devices to FinFET architectures, following the logic roadmap to further improve I/O speed. These valuable inflections are central to EPIC’s mission — where they can be co-developed and rapidly validated for next‑generation memory systems. Driving System Scaling With Advanced Packaging As data movement becomes the dominant energy cost in AI systems, advanced packaging has emerged as a critical lever for improving system‑level efficiency—shortening interconnect distances, increasing bandwidth density, and reducing the power required to move data between logic and memory. The rise of 3D packages such as high‑bandwidth memory (HBM) underscores why advanced packaging is becoming central to the AI era.Applied Materials High‑bandwidth memory (HBM) marks a major inflection along this path. By stacking DRAM dies — scaling to 16 layers and beyond — and placing memory much closer to the processor, HBM enables rapid access to ever‑larger working datasets. This delivers step‑function gains in both bandwidth and energy efficiency. More broadly, the rise of 3D packages such as HBM underscores why advanced packaging is becoming central to the AI era. Packaging now addresses system‑level constraints that logic and memory device scaling alone can no longer overcome. It also enables a move away from monolithic systems‑on‑chip toward chiplet‑based architectures, as AI workloads increasingly demand flexible designs that combine logic, memory, and specialized accelerators optimized for specific tasks. A vital technology powering this roadmap is hybrid bonding. With interconnect pitches approaching those of on‑chip wiring, conventional bumps and microbumps run into fundamental limits in density, power, and signal integrity. Hybrid bonding removes these barriers by allowing dramatically higher interconnect and I/O density, supporting a broad range of chiplet architectures — from memory stacking to tighter compute‑memory integration. EPIC tackles high‑value advanced‑packaging challenges through early, parallel co‑innovation across materials, integration, and manufacturing.Applied Materials As bonded structures like HBM stacks grow larger and more complex, warpage control, die placement, stack alignment, and thermal management become first‑order challenges. EPIC tackles these and other high‑value advanced‑packaging challenges through early, parallel co‑innovation across materials, integration, and manufacturing. Bringing It All Together Across logic, memory, and advanced packaging, our industry faces an ambitious roadmap that promises significant gains in energy efficiency for AI systems. But realizing that potential demands breakthrough materials innovation at a time when feature sizes are shrinking, interfaces are multiplying, and process interdependencies are escalating. These challenges cannot be solved on 10–15‑year timelines under the traditional relay‑race model. We must break down silos, align earlier across the ecosystem, and parallelize learning to keep pace with AI’s demands. In the AI era, progress will be defined by the speed at which lightbulb moments turn into manufacturing and commercialization reality. The only viable path forward is a new innovation model — and EPIC is how we are driving it.

IEEE Spectrum

중도 성향

IT/기술

IEEE Smart Village Is Helping to Electrify Rural Cameroon

More than 30 years ago, in the mountain village of Mbem in northwest Cameroon, the moon and stars in the night sky were the only light young Jude Numfor knew after the sunset. Electricity had not yet reached his rural community. “There was one person in the village with a petrol generator and a small television,” Numfor says. “When he turned it on, all the children would run to his house and peep through the window.” That memory became the spark for Numfor’s mission: to bring electricity to rural communities like his hometown. To accomplish his goal, in 2006 he cofounded Wireless Light and Power, since renamed Renewable Energy Innovators Cameroon, and he serves as its CEO. REI Cameroon designs, installs, and maintains solar minigrids for rural electrification. The minigrids use photovoltaic technology and battery-energy storage systems to generate electricity at 50 hertz. The electricity is distributed through smart meters. In 2017 the company received a grant from IEEE Smart Village to fund the expansion of REI’s minigrid operations and refine its business model. Smart Village supports projects and organizations bringing electricity and educational and employment opportunities to remote communities worldwide. The program is supported by IEEE societies and donations to the IEEE Foundation. The partnership has led to a collaboration developing open source metering, a free, community-driven way of tracking energy usage. Unlike proprietary utility meters, the system allows users, researchers, and utilities to view, customize, and verify how data is collected, ensuring transparency in billing, consumption tracking, and grid management. Smart Village’s support has been pivotal, Numfor says: “It’s not just about money. We share ideas, we get advice, and we have made friends. Entrepreneurship is lonely, but with the [Smart Village] community, it is different.” From teenage tinkerer to entrepreneur Numfor’s first experience of life with electricity was in 2001, after moving in with a missionary family in the small village of Allat. They used solar panels to power their whole home—an unimaginable luxury in Mbem. “I could watch TV, eat ice cream, and turn on lights,” he says. “It made me wish my brothers in Mbem had the same opportunity.” Numfor’s curiosity about electricity was ignited when a motion-sensor solar light in the family’s home stopped working. He tinkered with the device to find out why. “My missionary family told me to play with it like a toy,” he says, laughingly. “I replaced the dead battery with a motorcycle battery and was able to bring the power back for the night.” Jude Numfor [right] testing a rechargeable solar lantern, which aimed to replace hazardous kerosene lamps—known locally as “bush lamps.”REI Cameroon His missionary parents encouraged Numfor to study technology and engineering on his own, as none of the country’s universities offered solar energy educational programs at the time. They built him a library and stocked it with books on engineering, management, and entrepreneurship. In 2006, armed with his new knowledge, Numfor launched Wireless Light and Power with a friend, Ludwig Teichgraber. The nonprofit aimed to replace hazardous kerosene lamps—known locally as “bush lamps”—with rechargeable solar lanterns. These solar lanterns—called “light packs”—were built locally by Numfor and a team of 11 young Cameroonians using PVC pipes, nickel-metal hydride batteries, and LED bulbs. Families rented the lamps for a small fee, swapping discharged lamps for fully charged ones at solar-powered charging kiosks when they ran out of power. The kiosks then recharged the depleted lamps, making them available for the next swap. “The solar lantern was safer and cleaner, plus it gave children a chance to read at night,” Numfor explains. “People loved them.” Between 2006 and 2010, his team replicated the model across several villages. But when the global financial crisis hit in 2008, donor support dwindled, forcing the organization to evolve. “We pivoted from being an NGO to a commercial venture,” he says. “That’s how REI was born.” Building solar minigrids to serve community needs The new company’s goal was to move away from the lanterns and toward full electrification of communities. Villagers’ aspirations changed, Numfor says, as they now wanted to power their TVs, music systems, and mobile phones. In response, in 2010, REI developed one of the first solar minigrids in West Africa. Using locally procured components, the prototype supplied steady power to six households. The minigrid system used 12 123-watt solar photovoltaic panels manufactured by Sharp, 16 12-volt 100 ampere-hour automatic gain control lead acid batteries, and a Xantrex charge controller and inverter. Locally sourced wooden light poles were erected to distribute electricity throughout the village. REI charged each household a fee for the electricity. “It was a product-market-fit moment,” Numfor says. “People immediately asked, ‘When can we get this, too?’” The word-of-mouth, grassroots growth caught the attention of global partners. Numfor connected with Smart Village and in 2017, REI Cameroon received its first seed grant from the program. With that funding, Numfor was able to grow organically and attract additional grants, including one from the U.S. Trade Development Agency (USTDA), in partnership with the U.S. Department of Energy’s National Renewable Energy Laboratory. REI has since expanded to six villages, providing power to more than 1,000 households and businesses. With a dedicated team of 16 people, the company operates in multiple regions of the country, each with unique terrain, languages, and cultural dynamics. “It wasn’t easy,” he acknowledges. “I’m not an academic person—I had to learn everything by doing. [Smart Village] helped me structure the project and grow as an entrepreneur.” Today, Numfor pays it forward by sharing his Smart Village experience and mentoring new entrepreneurs. Launching a coalition for smart metering Minigrids can’t operate efficiently without clarifying operating rules to ensure quality service requirements and consumer protection, while also enabling reliable and effective monitoring of the system, Numfor says. “We need to know how power is being used, detect problems early, and manage the minigrid from a distance,” he explains. Existing commercial smart-meter providers offer limited and proprietary solutions. One major provider left the market, making their technology infrastructure obsolete. “It’s risky for an entire sector to depend on a few companies for such a critical technology,” Numfor says. In 2025, with the help of the Smart Village technical community, Numfor convened a consortium of open-source power advocates, including the Africa Mini-Grid Developers Association, EnAccess, Energy IOT, and NESL. The goal was to develop an open smart metering system that is accessible, transparent, and sustainable for all energy providers. “These organizations are collaborating as Open Advanced Metering Infrastructure [OpenAMI], which is about giving control back to the people who deliver the energy,” he says. Scaling for impact Numfor’s passion has grown from bringing light to local rural communities to bringing light to his entire country. Just 54 percent of Cameroon’s citizens have access to electricity, according to the International Energy Agency. For Numfor, the challenge is not just technological—it’s social and economic as well. “Electricity is the most important enabler of education and economic growth today,” he says. “When you have power, you unlock everything else.” “Electricity changed my life. Now I want to make sure every child can grow up with that same light.” —Jude Numfor Across the villages where REI has installed sustainable electricity solutions, small businesses are flourishing. Barbershops hum with community chatter, food vendors can preserve perishables, and entrepreneurs run companies such as phone-charging stations and small mills. “Some villages even have laundromats now,” Numfor says proudly. “Electricity creates jobs and changes mindsets.” Still, it has been a bumpy journey. It wasn’t until 2025 that REI obtained its official authorization (license) from Cameroon’s government to produce and distribute electricity in off-grid areas using solar minigrids. This was a major milestone because REI is one of the first private enterprises in the country to receive such authorization. “We were stuck between pilot projects and growth,” he explains. “Our projects were successful, and there was community demand for more, but to grow, we needed investors who require legal guarantees before committing funds. Now we can scale up and attract investors.” REI plans to expand its reach dramatically, beginning with 134 new villages identified through a feasibility study supported by the USTDA. Their long-term goal is to electrify 760 villages across Cameroon by 2031. While authorization opens doors, financing remains one of REI’s biggest challenges. “The minigrid space doesn’t attract venture capitalists easily,” Numfor notes. “Our return on investment is under 15 percent, so it’s not a typical tech startup model. The real return here is the impact” on the community. He hopes to attract investors who understand that access to electricity drives education, health care, and entrepreneurship. “There are people out there who want to make meaningful change,” he says. “We just need to connect with them. When you electrify a village, you never know who the next innovator will be. Maybe it’s another kid like me, looking through a window, dreaming.” Finding skilled staff is another challenge, Numfor says. To address this, REI developed an intensive recruitment and training process. “It used to take years to find the right people,” he says. “Now, we can identify who fits our company culture within six months.” Numfor’s wife, Angela Taliklong, who joined the venture in 2010, now oversees administration and human resources. A brighter Cameroon and beyond Numfor offers simple words of advice to other impact-driven entrepreneurs: Keep moving. “One of my mistakes early on was trying to be perfect,” he says. “I was spending time improving prototypes instead of increasing the number of our project installations and scaling how many communities we could electrify. You must keep momentum. Don’t wait until everything is perfect before you move forward.” That mindset, rooted in resilience and experimentation, has defined his journey. Rajan Kapur, president of Smart Village, says Numfor is a “shining example” of the program’s vision: “scalable and enduring impact through local entrepreneurs, local procurement, and community engagement based on the use of IEEE technology in underserved communities.” With the ongoing Smart Village partnership, Numfor is determined to bring light and opportunity to every corner of Cameroon, and beyond. He already has launched REI Nigeria. “Electricity changed my life,” he says. “Now I want to make sure every child can grow up with that same light.”

IEEE Spectrum

중도 성향

IT/기술

Andrew Ng: Unbiggen AI

Andrew Ng has serious street cred in artificial intelligence. He pioneered the use of graphics processing units (GPUs) to train deep learning models in the late 2000s with his students at Stanford University, cofounded Google Brain in 2011, and then served for three years as chief scientist for Baidu, where he helped build the Chinese tech giant’s AI group. So when he says he has identified the next big shift in artificial intelligence, people listen. And that’s what he told IEEE Spectrum in an exclusive Q&A. Ng’s current efforts are focused on his company Landing AI, which built a platform called LandingLens to help manufacturers improve visual inspection with computer vision. He has also become something of an evangelist for what he calls the data-centric AI movement, which he says can yield “small data” solutions to big issues in AI, including model efficiency, accuracy, and bias. Andrew Ng on... What’s next for really big models The career advice he didn’t listen to Defining the data-centric AI movement Synthetic data Why Landing AI asks its customers to do the work The great advances in deep learning over the past decade or so have been powered by ever-bigger models crunching ever-bigger amounts of data. Some people argue that that’s an unsustainable trajectory. Do you agree that it can’t go on that way? Andrew Ng: This is a big question. We’ve seen foundation models in NLP [natural language processing]. I’m excited about NLP models getting even bigger, and also about the potential of building foundation models in computer vision. I think there’s lots of signal to still be exploited in video: We have not been able to build foundation models yet for video because of compute bandwidth and the cost of processing video, as opposed to tokenized text. So I think that this engine of scaling up deep learning algorithms, which has been running for something like 15 years now, still has steam in it. Having said that, it only applies to certain problems, and there’s a set of other problems that need small data solutions. When you say you want a foundation model for computer vision, what do you mean by that? Ng: This is a term coined by Percy Liang and some of my friends at Stanford to refer to very large models, trained on very large data sets, that can be tuned for specific applications. For example, GPT-3 is an example of a foundation model [for NLP]. Foundation models offer a lot of promise as a new paradigm in developing machine learning applications, but also challenges in terms of making sure that they’re reasonably fair and free from bias, especially if many of us will be building on top of them. What needs to happen for someone to build a foundation model for video? Ng: I think there is a scalability problem. The compute power needed to process the large volume of images for video is significant, and I think that’s why foundation models have arisen first in NLP. Many researchers are working on this, and I think we’re seeing early signs of such models being developed in computer vision. But I’m confident that if a semiconductor maker gave us 10 times more processor power, we could easily find 10 times more video to build such models for vision. Having said that, a lot of what’s happened over the past decade is that deep learning has happened in consumer-facing companies that have large user bases, sometimes billions of users, and therefore very large data sets. While that paradigm of machine learning has driven a lot of economic value in consumer software, I find that that recipe of scale doesn’t work for other industries. Back to top It’s funny to hear you say that, because your early work was at a consumer-facing company with millions of users. Ng: Over a decade ago, when I proposed starting the Google Brain project to use Google’s compute infrastructure to build very large neural networks, it was a controversial step. One very senior person pulled me aside and warned me that starting Google Brain would be bad for my career. I think he felt that the action couldn’t just be in scaling up, and that I should instead focus on architecture innovation. “In many industries where giant data sets simply don’t exist, I think the focus has to shift from big data to good data. Having 50 thoughtfully engineered examples can be sufficient to explain to the neural network what you want it to learn.” —Andrew Ng, CEO & Founder, Landing AI I remember when my students and I published the first NeurIPS workshop paper advocating using CUDA, a platform for processing on GPUs, for deep learning—a different senior person in AI sat me down and said, “CUDA is really complicated to program. As a programming paradigm, this seems like too much work.” I did manage to convince him; the other person I did not convince. I expect they’re both convinced now. Ng: I think so, yes. Over the past year as I’ve been speaking to people about the data-centric AI movement, I’ve been getting flashbacks to when I was speaking to people about deep learning and scalability 10 or 15 years ago. In the past year, I’ve been getting the same mix of “there’s nothing new here” and “this seems like the wrong direction.” Back to top How do you define data-centric AI, and why do you consider it a movement? Ng: Data-centric AI is the discipline of systematically engineering the data needed to successfully build an AI system. For an AI system, you have to implement some algorithm, say a neural network, in code and then train it on your data set. The dominant paradigm over the last decade was to download the data set while you focus on improving the code. Thanks to that paradigm, over the last decade deep learning networks have improved significantly, to the point where for a lot of applications the code—the neural network architecture—is basically a solved problem. So for many practical applications, it’s now more productive to hold the neural network architecture fixed, and instead find ways to improve the data. When I started speaking about this, there were many practitioners who, completely appropriately, raised their hands and said, “Yes, we’ve been doing this for 20 years.” This is the time to take the things that some individuals have been doing intuitively and make it a systematic engineering discipline. The data-centric AI movement is much bigger than one company or group of researchers. My collaborators and I organized a data-centric AI workshop at NeurIPS, and I was really delighted at the number of authors and presenters that showed up. You often talk about companies or institutions that have only a small amount of data to work with. How can data-centric AI help them? Ng: You hear a lot about vision systems built with millions of images—I once built a face recognition system using 350 million images. Architectures built for hundreds of millions of images don’t work with only 50 images. But it turns out, if you have 50 really good examples, you can build something valuable, like a defect-inspection system. In many industries where giant data sets simply don’t exist, I think the focus has to shift from big data to good data. Having 50 thoughtfully engineered examples can be sufficient to explain to the neural network what you want it to learn. When you talk about training a model with just 50 images, does that really mean you’re taking an existing model that was trained on a very large data set and fine-tuning it? Or do you mean a brand new model that’s designed to learn only from that small data set? Ng: Let me describe what Landing AI does. When doing visual inspection for manufacturers, we often use our own flavor of RetinaNet. It is a pretrained model. Having said that, the pretraining is a small piece of the puzzle. What’s a bigger piece of the puzzle is providing tools that enable the manufacturer to pick the right set of images [to use for fine-tuning] and label them in a consistent way. There’s a very practical problem we’ve seen spanning vision, NLP, and speech, where even human annotators don’t agree on the appropriate label. For big data applications, the common response has been: If the data is noisy, let’s just get a lot of data and the algorithm will average over it. But if you can develop tools that flag where the data’s inconsistent and give you a very targeted way to improve the consistency of the data, that turns out to be a more efficient way to get a high-performing system. “Collecting more data often helps, but if you try to collect more data for everything, that can be a very expensive activity.” —Andrew Ng For example, if you have 10,000 images where 30 images are of one class, and those 30 images are labeled inconsistently, one of the things we do is build tools to draw your attention to the subset of data that’s inconsistent. So you can very quickly relabel those images to be more consistent, and this leads to improvement in performance. Could this focus on high-quality data help with bias in data sets? If you’re able to curate the data more before training? Ng: Very much so. Many researchers have pointed out that biased data is one factor among many leading to biased systems. There have been many thoughtful efforts to engineer the data. At the NeurIPS workshop, Olga Russakovsky gave a really nice talk on this. At the main NeurIPS conference, I also really enjoyed Mary Gray’s presentation, which touched on how data-centric AI is one piece of the solution, but not the entire solution. New tools like Datasheets for Datasets also seem like an important piece of the puzzle. One of the powerful tools that data-centric AI gives us is the ability to engineer a subset of the data. Imagine training a machine-learning system and finding that its performance is okay for most of the data set, but its performance is biased for just a subset of the data. If you try to change the whole neural network architecture to improve the performance on just that subset, it’s quite difficult. But if you can engineer a subset of the data you can address the problem in a much more targeted way. When you talk about engineering the data, what do you mean exactly? Ng: In AI, data cleaning is important, but the way the data has been cleaned has often been in very manual ways. In computer vision, someone may visualize images through a Jupyter notebook and maybe spot the problem, and maybe fix it. But I’m excited about tools that allow you to have a very large data set, tools that draw your attention quickly and efficiently to the subset of data where, say, the labels are noisy. Or to quickly bring your attention to the one class among 100 classes where it would benefit you to collect more data. Collecting more data often helps, but if you try to collect more data for everything, that can be a very expensive activity. For example, I once figured out that a speech-recognition system was performing poorly when there was car noise in the background. Knowing that allowed me to collect more data with car noise in the background, rather than trying to collect more data for everything, which would have been expensive and slow. Back to top What about using synthetic data, is that often a good solution? Ng: I think synthetic data is an important tool in the tool chest of data-centric AI. At the NeurIPS workshop, Anima Anandkumar gave a great talk that touched on synthetic data. I think there are important uses of synthetic data that go beyond just being a preprocessing step for increasing the data set for a learning algorithm. I’d love to see more tools to let developers use synthetic data generation as part of the closed loop of iterative machine learning development. Do you mean that synthetic data would allow you to try the model on more data sets? Ng: Not really. Here’s an example. Let’s say you’re trying to detect defects in a smartphone casing. There are many different types of defects on smartphones. It could be a scratch, a dent, pit marks, discoloration of the material, other types of blemishes. If you train the model and then find through error analysis that it’s doing well overall but it’s performing poorly on pit marks, then synthetic data generation allows you to address the problem in a more targeted way. You could generate more data just for the pit-mark category. “In the consumer software Internet, we could train a handful of machine-learning models to serve a billion users. In manufacturing, you might have 10,000 manufacturers building 10,000 custom AI models.” —Andrew Ng Synthetic data generation is a very powerful tool, but there are many simpler tools that I will often try first. Such as data augmentation, improving labeling consistency, or just asking a factory to collect more data. Back to top To make these issues more concrete, can you walk me through an example? When a company approaches Landing AI and says it has a problem with visual inspection, how do you onboard them and work toward deployment? Ng: When a customer approaches us we usually have a conversation about their inspection problem and look at a few images to verify that the problem is feasible with computer vision. Assuming it is, we ask them to upload the data to the LandingLens platform. We often advise them on the methodology of data-centric AI and help them label the data. One of the foci of Landing AI is to empower manufacturing companies to do the machine learning work themselves. A lot of our work is making sure the software is fast and easy to use. Through the iterative process of machine learning development, we advise customers on things like how to train models on the platform, when and how to improve the labeling of data so the performance of the model improves. Our training and software supports them all the way through deploying the trained model to an edge device in the factory. How do you deal with changing needs? If products change or lighting conditions change in the factory, can the model keep up? Ng: It varies by manufacturer. There is data drift in many contexts. But there are some manufacturers that have been running the same manufacturing line for 20 years now with few changes, so they don’t expect changes in the next five years. Those stable environments make things easier. For other manufacturers, we provide tools to flag when there’s a significant data-drift issue. I find it really important to empower manufacturing customers to correct data, retrain, and update the model. Because if something changes and it’s 3 a.m. in the United States, I want them to be able to adapt their learning algorithm right away to maintain operations. In the consumer software Internet, we could train a handful of machine-learning models to serve a billion users. In manufacturing, you might have 10,000 manufacturers building 10,000 custom AI models. The challenge is, how do you do that without Landing AI having to hire 10,000 machine learning specialists? So you’re saying that to make it scale, you have to empower customers to do a lot of the training and other work. Ng: Yes, exactly! This is an industry-wide problem in AI, not just in manufacturing. Look at health care. Every hospital has its own slightly different format for electronic health records. How can every hospital train its own custom AI model? Expecting every hospital’s IT personnel to invent new neural-network architectures is unrealistic. The only way out of this dilemma is to build tools that empower the customers to build their own models by giving them tools to engineer the data and express their domain knowledge. That’s what Landing AI is executing in computer vision, and the field of AI needs other teams to execute this in other domains. Is there anything else you think it’s important for people to understand about the work you’re doing or the data-centric AI movement? Ng: In the last decade, the biggest shift in AI was a shift to deep learning. I think it’s quite possible that in this decade the biggest shift will be to data-centric AI. With the maturity of today’s neural network architectures, I think for a lot of the practical applications the bottleneck will be whether we can efficiently get the data we need to develop systems that work well. The data-centric AI movement has tremendous energy and momentum across the whole community. I hope more researchers and developers will jump in and work on it. Back to top This article appears in the April 2022 print issue as “Andrew Ng, AI Minimalist.”

뉴스

타임라인 키워드

xAI Asks Court to Strip Alleged Grok Deepfake Nudes Victims of Anonymity

Microsoft and Mayo Clinic are teaming up to build a healthcare AI model

Hackers Are Using the Same Conversational Tricks on AI that Con Artists Use on People

Report: Tech Company Accidentally Spends $500 Million on Anthropic's Claude AI in Single Month

Motorola says affiliate hijacking of Amazon app was ‘unintended’

Accelerating Chipmaking Innovation for the Energy-Efficient AI Era

IEEE Smart Village Is Helping to Electrify Rural Cameroon

Andrew Ng: Unbiggen AI