Amazon Redshift

This documentation describes the integration of MindsDB with Amazon Redshift, a fully managed, petabyte-scale data warehouse service in the cloud. You can start with just a few hundred gigabytes of data and scale to a petabyte or more, enabling you to use your data to acquire new insights for your business and customers.

Prerequisites

Before proceeding, ensure the following prerequisites are met:

Install MindsDB locally via Docker or Docker Desktop.
To connect Redshift to MindsDB, install the required dependencies following this instruction.

Connection

Establish a connection to your Redshift database from MindsDB by executing the following SQL command:

CREATE DATABASE redshift_datasource
WITH
  engine = 'redshift',
  parameters = {
    "host": "examplecluster.abc123xyz789.us-west-1.redshift.amazonaws.com",
    "port": 5439,
    "database": "example_db",
    "user": "awsuser",
    "password": "my_password"
  };

Required connection parameters include the following:

host: The host name or IP address of the Redshift cluster.
port: The port to use when connecting with the Redshift cluster.
database: The database name to use when connecting with the Redshift cluster.
user: The username to authenticate the user with the Redshift cluster.
password: The password to authenticate the user with the Redshift cluster.

Optional connection parameters include the following:

schema: The database schema to use. Default is public.
sslmode: The SSL mode for the connection.

Usage

Retrieve data from a specified table by providing the integration name, schema, and table name:

SELECT *
FROM redshift_datasource.schema_name.table_name
LIMIT 10;

Run Amazon Redshift SQL queries directly on the connected Redshift database:

SELECT * FROM redshift_datasource (

    --Native Query Goes Here
    WITH VENUECOPY AS (SELECT * FROM VENUE)
      SELECT * FROM VENUECOPY ORDER BY 1 LIMIT 10;

);

The above examples utilize redshift_datasource as the datasource name, which is defined in the CREATE DATABASE command.

Troubleshooting Guide

Database Connection Error

Symptoms: Failure to connect MindsDB with the Amazon Redshift cluster.
Checklist:
1. Make sure the Redshift cluster is active.
2. Confirm that host, port, user, password and database are correct. Try a direct Redshift connection using a client like DBeaver.
3. Ensure that the security settings of the Redshift cluster allow connections from MindsDB.
4. Ensure a stable network between MindsDB and Redshift.

SQL statement cannot be parsed by mindsdb_sql

Symptoms: SQL queries failing or not recognizing table names containing spaces or special characters.
Checklist:
1. Ensure table names with spaces or special characters are enclosed in backticks.
2. Examples:
  - Incorrect: SELECT * FROM integration.travel data
  - Incorrect: SELECT * FROM integration.‘travel data’
  - Correct: SELECT * FROM integration.`travel data`

This troubleshooting guide provided by AWS might also be helpful.

Overview

Data Sources

Connection

Data Catalog

Bring Your Own Models

Amazon Redshift

Prerequisites

Connection

Usage

Troubleshooting Guide

Overview

Data Sources

Connection

Data Catalog

Bring Your Own Models

​Prerequisites

​Connection

​Usage

​Troubleshooting Guide

Prerequisites

Connection

Usage

Troubleshooting Guide