LoRA fine-tune Gemma 7B Using TIR with 10 Easy Steps

July 15, 2025

6 min read

Step 1: Fire Up the TIR Foundation Studio

Head over to the TIR Foundation Studio and log in. This is your command center, where you'll pick your model, upload data, configure training, and monitor progress. It's designed to be super user-friendly, so you'll feel right at home.

Screenshot 2025-07-04 142557.png

Step 2: Pick Your Preferred Model

From the list of available pre-trained models, select gemma-7B-it. This version is a reasonable choice, especially when you're looking to fine-tune with smaller datasets using LoRA, as it is already optimized for following instructions.

Screenshot 2025-07-04 131950.png

Step 3: Choose Your Engine (Compute Plan)

Next, you'll need to select a compute instance that is up to the task. For optimal performance, we highly recommend:

GPU: NVIDIA A100 80GB (a top-tier graphics processing unit built for AI)
Memory: 115GB

Pick the plan that aligns best with your budget and how long you expect your training to run. Think of it as choosing the right engine for your custom AI.

Screenshot 2025-07-04 150831.png

Step 4: Enable Training & Connect to Hugging Face

Time to get things moving! Click on “Train the Base Model”. You'll be prompted to authenticate with your Hugging Face access token. If you haven't linked your account yet, don't worry – just click “Create New” to generate and add a token. This token is crucial for pulling the base model weights (the AI's learned knowledge) and, if you choose, pushing your fine-tuned model back to your Hugging Face repository.

Screenshot 2025-07-04 150931.png

Screenshot 2025-07-04 151037.png

Step 5: Check Your Hugging Face Access

Before proceeding, take a quick moment to ensure your token is valid and that you have the necessary permissions for the model you're fine-tuning. A quick check here can save you headaches down the line!

Screenshot 2025-07-08 113353.png

Step 6: Upload Your Data (Training & Optional Evaluation)

Now for the memory of your custom model: your dataset! Prepare your data in the following format. For instance, if you're building a question-answering model, a JSON format like this works perfectly:

{
    "question": "What happens if an exception is not caught?",
    "answer": "The program crashes."
  },

Simply upload your file directly through the UI. You can also add an evaluation dataset to monitor the quality of your training as it progresses. You can use the python_code_instructions_18k_alpaca dataset to train your Python tutor chatbot without needing any additional preprocessing.

Screenshot 2025-07-04 151159.png

Screenshot 2025-07-04 151226.png

Step 7: Tell Your Model What to Do (Define the Task Type)

From the available options, select the specific task for which you're fine-tuning your model. In our example, you'd choose “Question Answering”. This helps TIR apply the correct training head and preprocessing logic, making sure your model learns exactly what you want it to do.

Screenshot 2025-07-04 151312.png

Step 8: Fine-Tune Your Training Parameters

This is where you customize the learning process. You'll want to configure these settings:

Training Type: Select Parameter-Efficient Fine-Tuning (PEFT). This is highly recommended because it significantly optimizes GPU usage and training time. Instead of updating the entire massive model, LoRA introduces small, trainable "adapter" layers, making the process much more efficient.
Epochs: Set this based on your dataset size (e.g., 3–10 epochs is a good starting point). An epoch means the model has seen your entire training dataset once.
Learning Rate: The default values are often sufficient, but you can tweak them manually if you're feeling adventurous. This controls how big of a "step" the model takes when adjusting its internal parameters during training.
LoRA Bias: This lets you choose how much of the model should be updated during fine-tuning. By selecting LoRA bias, you're telling the model to adjust only the small bias components instead of the full weight matrices. This makes training even more lightweight and faster, with minimal impact on performance, which is ideal when you're working with limited GPU resources or want quicker results.

Screenshot 2025-07-04 151450.png

Step 9 (Optional): Unlock Advanced Settings

For those who want more control, the Advanced Configurations offer deeper customization:

Quantization Type: A technique to make the model smaller and faster by using fewer bits to represent its data.
Batch Size: The number of examples the model processes at once.
Gradient Accumulation Steps: A way to simulate larger batch sizes if your hardware has memory limits.
Checkpoint Saving Frequency: How often the system saves your model's progress during training.
Weights & Biases (WandB) Tracking: A powerful tool for monitoring and visualizing your training metrics in real time.
Debugging Flags: Specific settings to help diagnose any issues during training.

These options give you fine-grained control over performance and how you monitor your model's training journey.

Screenshot 2025-07-04 151749.png

Screenshot 2025-07-04 151815.png

Screenshot 2025-07-04 151841.png

Screenshot 2025-07-04 151917.png

Step 10: Review and Launch Your Model

You're almost there! Before you hit that launch button:

Review all your selections in the summary pane.
Confirm your compute instance, chosen model, datasets, and hyperparameters.
Once you're satisfied, click Launch to kick off your training job!

Screenshot 2025-07-08 114633.png

You will see real-time logs, metrics, and model events directly in your dashboard. Once training is complete, your new fine-tuned model will be ready for download or deployment.

Screenshot 2025-07-08 114137.png

No-Code Model Fine-Tuning Made Easy with E2E Cloud’s TIR Platform

TIR’s no-code fine-tuning interface makes it easy for anyone to unlock the full potential of AI. Finetuning models like Gemma-7 B becomes a straightforward process, enabling customers to adapt powerful language models to their specific domains. By leveraging LoRA for efficient parameter tuning, you can build high-impact, domain-specific models ready for real-world deployment, even when working with smaller datasets. This allows your business to deliver tailored AI capabilities quickly and cost-effectively.

Ready to build your custom AI? Begin your journey with E2E TIR and unlock the potential of language models!

Sign up for Free Trial

Latest Blogs

A vector illustration of a tech city using latest cloud technologies & infrastructure

LoRA fine-tune Gemma 7B Using TIR with 10 Easy Steps

July 15, 2025

E2E Editorial

6 min read

Step 1: Fire Up the TIR Foundation Studio

Screenshot 2025-07-04 142557.png

Step 2: Pick Your Preferred Model

Screenshot 2025-07-04 131950.png

Step 3: Choose Your Engine (Compute Plan)

Next, you'll need to select a compute instance that is up to the task. For optimal performance, we highly recommend:

GPU: NVIDIA A100 80GB (a top-tier graphics processing unit built for AI)
Memory: 115GB

Pick the plan that aligns best with your budget and how long you expect your training to run. Think of it as choosing the right engine for your custom AI.

Screenshot 2025-07-04 150831.png

Step 4: Enable Training & Connect to Hugging Face

Screenshot 2025-07-04 150931.png

Screenshot 2025-07-04 151037.png

Step 5: Check Your Hugging Face Access

Screenshot 2025-07-08 113353.png

Step 6: Upload Your Data (Training & Optional Evaluation)

Now for the memory of your custom model: your dataset! Prepare your data in the following format. For instance, if you're building a question-answering model, a JSON format like this works perfectly:

{
    "question": "What happens if an exception is not caught?",
    "answer": "The program crashes."
  },

Screenshot 2025-07-04 151159.png

Screenshot 2025-07-04 151226.png

Step 7: Tell Your Model What to Do (Define the Task Type)

Screenshot 2025-07-04 151312.png

Step 8: Fine-Tune Your Training Parameters

This is where you customize the learning process. You'll want to configure these settings:

Training Type: Select Parameter-Efficient Fine-Tuning (PEFT). This is highly recommended because it significantly optimizes GPU usage and training time. Instead of updating the entire massive model, LoRA introduces small, trainable "adapter" layers, making the process much more efficient.
Epochs: Set this based on your dataset size (e.g., 3–10 epochs is a good starting point). An epoch means the model has seen your entire training dataset once.
Learning Rate: The default values are often sufficient, but you can tweak them manually if you're feeling adventurous. This controls how big of a "step" the model takes when adjusting its internal parameters during training.
LoRA Bias: This lets you choose how much of the model should be updated during fine-tuning. By selecting LoRA bias, you're telling the model to adjust only the small bias components instead of the full weight matrices. This makes training even more lightweight and faster, with minimal impact on performance, which is ideal when you're working with limited GPU resources or want quicker results.

Screenshot 2025-07-04 151450.png

Step 9 (Optional): Unlock Advanced Settings

For those who want more control, the Advanced Configurations offer deeper customization:

Quantization Type: A technique to make the model smaller and faster by using fewer bits to represent its data.
Batch Size: The number of examples the model processes at once.
Gradient Accumulation Steps: A way to simulate larger batch sizes if your hardware has memory limits.
Checkpoint Saving Frequency: How often the system saves your model's progress during training.
Weights & Biases (WandB) Tracking: A powerful tool for monitoring and visualizing your training metrics in real time.
Debugging Flags: Specific settings to help diagnose any issues during training.

These options give you fine-grained control over performance and how you monitor your model's training journey.

Screenshot 2025-07-04 151749.png

Screenshot 2025-07-04 151815.png

Screenshot 2025-07-04 151841.png

Screenshot 2025-07-04 151917.png

Step 10: Review and Launch Your Model

You're almost there! Before you hit that launch button:

Review all your selections in the summary pane.
Confirm your compute instance, chosen model, datasets, and hyperparameters.
Once you're satisfied, click Launch to kick off your training job!

Screenshot 2025-07-08 114633.png

You will see real-time logs, metrics, and model events directly in your dashboard. Once training is complete, your new fine-tuned model will be ready for download or deployment.

Screenshot 2025-07-08 114137.png

No-Code Model Fine-Tuning Made Easy with E2E Cloud’s TIR Platform

Ready to build your custom AI? Begin your journey with E2E TIR and unlock the potential of language models!

Sign up for Free Trial

Latest Blogs

LoRA fine-tune Gemma 7B Using TIR with 10 Easy Steps

Table of Contents

Step 1: Fire Up the TIR Foundation Studio

Step 2: Pick Your Preferred Model

Step 3: Choose Your Engine (Compute Plan)

Step 4: Enable Training & Connect to Hugging Face

Step 5: Check Your Hugging Face Access

Step 6: Upload Your Data (Training & Optional Evaluation)

Step 7: Tell Your Model What to Do (Define the Task Type)

Step 8: Fine-Tune Your Training Parameters

Step 9 (Optional): Unlock Advanced Settings

Step 10: Review and Launch Your Model

No-Code Model Fine-Tuning Made Easy with E2E Cloud’s TIR Platform

LoRA fine-tune Gemma 7B Using TIR with 10 Easy Steps

Table of Contents

Step 1: Fire Up the TIR Foundation Studio

Step 2: Pick Your Preferred Model

Step 3: Choose Your Engine (Compute Plan)

Step 4: Enable Training & Connect to Hugging Face

Step 5: Check Your Hugging Face Access

Step 6: Upload Your Data (Training & Optional Evaluation)

Step 7: Tell Your Model What to Do (Define the Task Type)

Step 8: Fine-Tune Your Training Parameters

Step 9 (Optional): Unlock Advanced Settings

Step 10: Review and Launch Your Model

No-Code Model Fine-Tuning Made Easy with E2E Cloud’s TIR Platform

LoRA fine-tune Gemma 7B Using TIR with 10 Easy Steps

How Does RAG Improve the Accuracy of LLM Responses?

Top 10 Cloud GPU Providers in 2025

What is Retrieval-Augmented Generation (RAG)?

AI Inference vs Training: Understanding Key Differences

Sovereign Cloud: India's Key to Digital Independence in the AI Age

E2E Sovereign Cloud Platform: Revolutionizing Cloud Sovereignty

Top 8 Generative AI Applications in 2025

A Comparison between TIR Containerized VMs vs Traditional VMs

Accelerate Your AI Application Development Using TIR Containerized VMs