Feature

What is GenAI’s true environmental impact? MIT explains

In a new study, MIT researchers explain exactly what makes generative AI so resource-intensive.

24/01/2025 16:58:55

The excitement surrounding the potential benefits of generative AI, from improving worker productivity to advancing scientific research, is hard to ignore.

While the explosive growth of this new technology has enabled the rapid deployment of powerful models in many industries, the environmental consequences of this generative AI “gold rush” remain difficult to pin down, let alone mitigate.

The computational power required to train generative AI models that often have billions of parameters, such as OpenAI’s GPT-4, can demand a staggering amount of electricity, which leads to increased carbon dioxide emissions and pressures on the electric grid.

Furthermore, deploying these models in real-world applications, enabling millions to use generative AI in their daily lives, and then fine-tuning the models to improve their performance draws large amounts of energy long after a model has been developed.

Beyond electricity demands, a great deal of water is needed to cool the hardware used for training, deploying, and fine-tuning generative AI models, which can strain municipal water supplies and disrupt local ecosystems.

The increasing number of generative AI applications has also spurred demand for high-performance computing hardware, adding indirect environmental impacts from its manufacture and transport.

“When we think about the environmental impact of generative AI, it is not just the electricity you consume when you plug the computer in,” says Elsa A. Olivetti, Professor in the Department of Materials Science and Engineering and the lead of the Decarbonization Mission of MIT’s new Climate Project.

“There are much broader consequences that go out to a system level and persist based on actions that we take.”

Demanding data centres
The electricity demands of data centres are one major factor contributing to the environmental impacts of generative AI, since data centres are used to train and run the deep learning models behind popular tools like ChatGPT and DALL-E.

A data centre is a temperature-controlled building that houses computing infrastructure, such as servers, data storage drives, and network equipment.

For instance, Amazon has more than 100 data centres worldwide, each of which has about 50,000 servers that the company uses to support cloud computing services.

While data centres have been around since the 1940s (the first was built at the University of Pennsylvania in 1945 to support the first general-purpose digital computer, the ENIAC), the rise of generative AI has dramatically increased the pace of data centre construction.

“What is different about generative AI is the power density it requires,” says Noman Bashir, lead author of the impact paper.

“Fundamentally, it is just computing, but a generative AI training cluster might consume seven or eight times more energy than a typical computing workload,”

Scientists have estimated that the power requirements of data centres in North America increased from 2,688MW at the end of 2022 to 5,341MW at the end of 2023, partly driven by the demands of generative AI.

Globally, the electricity consumption of data centres rose to 460TW in 2022.

This would have made data centres the 11th largest electricity consumer in the world, between the nations of Saudi Arabia (371TW) and France (463TW), according to the Organization for Economic Co-operation and Development.

By 2026, the electricity consumption of data centres is expected to approach 1,050TW (which would bump data centres up to fifth place on the global list, between Japan and Russia).

While not all data centre computation involves generative AI, the technology has been a major driver of increasing energy demands.

“The demand for new data centres cannot be met in a sustainable way. The pace at which companies are building new data centres means the bulk of the electricity to power them must come from fossil fuel-based power plants,” says Bashir.

The power needed to train and deploy a model like OpenAI’s GPT-3 is difficult to ascertain.

In a 2021 research paper, scientists from Google and the University of California at Berkeley estimated the training process alone consumed 1,287MW hours of electricity (enough to power about 120 average US homes for a year), generating about 552 tons of carbon dioxide.

While all machine learning models must be trained, one issue unique to generative AI is the rapid fluctuations in energy use that occur over different phases of the training process, Bashir explains.

Power grid operators must have a way to absorb those fluctuations to protect the grid, and they usually employ diesel-based generators for that task.

Increasing impacts from inference
Once a generative AI model is trained, the energy demands don’t disappear.

Each time a model is used, perhaps by an individual asking ChatGPT to summarise an email, the computing hardware that performs those operations consumes energy. Researchers have estimated that a ChatGPT query consumes about five times more electricity than a simple web search.

“But an everyday user doesn’t think too much about that,” says Bashir.

“The ease-of-use of generative AI interfaces and the lack of information about the environmental impacts of my actions means that, as a user, I don’t have much incentive to cut back on my use of generative AI.”

With traditional AI, energy usage is split fairly evenly between data processing, model training, and inference, which is the process of using a trained model to make predictions on new data.

However, Bashir expects the electricity demands of generative AI inference eventually to dominate since these models are becoming ubiquitous in so many applications, and the electricity needed for inference will increase as future versions of the models become larger and more complex.

Plus, generative AI models have an especially short shelf-life, driven by rising demand for new AI applications.

Companies release new models every few weeks, so the energy used to train prior versions goes to waste, Bashir adds.

New models often consume more energy for training, since they usually have more parameters than their predecessors.

While the electricity demands of data centres may be getting the most attention in the research literature, the amount of water consumed by these facilities has environmental impacts, as well.

Chilled water is used to cool a data centre by absorbing heat from computing equipment. It has been estimated that, for each kilowatt hour of energy a data centre consumes, it would need two litres of water for cooling, says Bashir.

“Just because this is called ‘cloud computing’ doesn’t mean the hardware lives in the cloud.

“Data centres are present in our physical world, and because of their water usage they have direct and indirect implications for biodiversity,” he says.

The computing hardware inside data centres brings its own, less direct environmental impacts.

While it is difficult to estimate how much power is needed to manufacture a GPU, a type of powerful processor that can handle intensive generative AI workloads, it would be more than what is needed to produce a simpler CPU because the fabrication process is more complex.

A GPU’s carbon footprint is compounded by the emissions related to material and product transport.

There are also environmental implications of obtaining the raw materials used to fabricate GPUs, which can involve dirty mining procedures and the use of toxic chemicals for processing.

Market research firm TechInsights estimates that the three major producers (NVIDIA, AMD, and Intel) shipped 3.85 million GPUs to data centres in 2023, up from about 2.67 million in 2022. That number is expected to have increased by an even greater percentage in 2024.

The industry is on an unsustainable path, but there are ways to encourage responsible development of generative AI that supports environmental objectives, Bashir says.

He, Olivetti, and their MIT colleagues argue that this will require a comprehensive consideration of all the environmental and societal costs of generative AI, as well as a detailed assessment of the value in its perceived benefits.

“We need a more contextual way of systematically and comprehensively understanding the implications of new developments in this space,” Olivetti says.

“Due to the speed at which there have been improvements, we haven’t had a chance to catch up with our abilities to measure and understand the trade-offs.”

Written By Sophia Bell

Sophia Bell is the Group Editor of DPA, Connectivity, and PBSI. She joined the team in 2020 as Assistant Editor after achieving a First-Class Honours BA in English Literature and Film Studies from the University of East Anglia. In her current role, she leads the editorial strategy, driving digital growth and ensuring each title continues to deliver trusted, relevant insight for engineering and manufacturing professionals.