12 docs tagged with "machine-learning"

Adopt serverless architecture for AI/ML workload processes

Building an ML model takes significant computing resources that need to be optimized for efficient utilization.

Leverage pre-trained models and transfer learning

Fine-tune existing pre-trained models instead of training from scratch to dramatically reduce the compute, energy, and time required for model development.

Optimize agent orchestration to reduce unnecessary model calls

Design agentic AI workflows to minimise redundant model invocations and unnecessary compute through caching, conditional logic, and efficient orchestration patterns.

Optimize data storage formats for AI training and inference

Use efficient storage formats, compression, and indexing strategies for AI datasets and embeddings to reduce storage footprint, data transfer, and retrieval compute.

Run AI models at the edge

Deploy AI inference on edge devices or local infrastructure to reduce data transfer, network energy use, and reliance on centralised cloud compute.

Select a more energy efficient AI/ML framework

Training an AI model implies a significant carbon footprint. The underlying framework used for the development, training, and deployment of AI/ML needs to be evaluated and considered to ensure the process is as energy efficient as possible.

Select efficient accelerators and instance types for AI workloads

Match AI workloads to the most energy-efficient hardware accelerator or instance type to improve utilisation and reduce energy consumption per inference or training run.

Select efficient ML frameworks and inference runtimes

Choose ML frameworks and inference runtimes that best match your hardware and workload to reduce compute overhead and improve energy efficiency across training and production inference.

Use carbon-aware scheduling and region selection for AI workloads

Reduce the carbon impact of AI workloads by running them in cloud regions with lower grid carbon intensity and scheduling deferrable jobs during periods of high renewable energy availability.

Use energy efficient AI/ML models

Evaluate and use alternative, more energy efficient, models that provide similar functionality.

Use on-demand execution for AI and agent workloads

Trigger AI and agent workloads only when needed using serverless or event-driven platforms to eliminate idle compute and reduce unnecessary energy consumption.

Use right-sized and energy-efficient AI models

Select and optimize AI models that are appropriately sized for the task to reduce compute, memory, and energy consumption during training and inference.