Fine-tune a large language model (llm) for multi-turn conversations and run it on a Text Generation Inference (TGI) server

This blog post delves into the initial fine-tuning process for large language models (LLMs) for multi-turn conversations and their deployment on Text Generation Inference (TGI) servers. It covers topics such as use cases, data formats, training data preparation, server setup, and evaluation frameworks. The goal is to guide readers through the process of fine-tuning and deploying LLMs.

April 25, 2024 0

Easy migration from org.json to Gson

This blog post discusses how to migrate from org.json to Gson. It provides code examples for both libraries and concludes that the migration requires minimal changes.

March 13, 2024 0

How to create a watsonx.ai REST client in Spring Boot?

This blog post demonstrates the Java Spring Boot implementation to invoke a watsonx.ai endpoint. It outlines the classes and steps involved, including building and sending requests, handling prompts, and extracting answers. The post also provides sample code for invoking the endpoint and using RestTemplate. Overall, it offers a comprehensive guide on utilizing watsonx.ai in a Spring Boot application.

March 4, 2024 3

Create an IBM Cloud IAM access token in your Spring Boot Java application

This blog post provides an example of obtaining an IBM Cloud access token using the IBM Cloud IAM REST API and Spring Boot. It includes a Java RestClient implementation for getting the access token and a REST endpoint invocation in a sample application.

March 4, 2024 1

CheatSheet: How to set up Java and Maven on macOS

This blog post provides a list of links to resources for setting up a Java development environment on MacOS.

March 4, 2024 1

CheatSheet: How to add users to your watsonx project?

This cheat sheet provides a two-step guide for adding users to your watsonx project in IBM Cloud.

January 31, 2024 1

CheatSheet: Configure the Block Storage usage in Virtual Server Instances on IBM Cloud

This post introduces the use of Block Storage in Virtual Server Instances, particularly in relation to GPUs. It covers the process of mounting and configuring block storage, along with creating, formatting, and mounting the disk. It also provides steps for permanently mounting the storage and attaching existing block storage to a new virtual service instance machine.

January 28, 2024 1

Unleash your creativity and design a custom visualization for the Shelly 3EM device with Grafana

The blog post details an example implementation of a connection server using Shelly 3EM, IBM Cloud Cloudant, and Grafana. It aims to store historical data for visualizing electricity consumption. The project involves detailed architecture, environment setup, Python, FastAPI, Podman, and more usage. The setup covers Raspberry Pi, Podman Compose, and IBM Cloud Code Engine environments, with prerequisites and detailed configurations. The approach allows users to monitor and visualize power consumption efficiently and cost-effectively using Grafana.

January 21, 2024 0

CheatSheet: Run a PostgreSQL container with Podman and podman-compose

This brief article provides a step-by-step guide for setting up and running a PostgreSQL database container locally using Podman Desktop and podman-compose. It covers installation, configuration, and execution, along with additional notes on maintenance and troubleshooting.

January 20, 2024 1

How do you initially set up a Virtual Server Instance with a GPU in IBM Cloud?

Generative AI offers diverse business opportunities, often requiring GPU for intensive computing. IBM Cloud provides easy GPU instantiation with Virtual Server Instance (VSI) in a Virtual Private Cloud, available in minutes with pay-per-usage. This guide covers VPC configuration, VSI setup with GPU, SSH access, GPU accessibility in Ubuntu, and GPU verification in Python.

January 18, 2024 3

Blog at WordPress.com.

Up ↑

Category: cloud