Amazon Redshift
This documentation describes the integration of MindsDB with Amazon Redshift, a fully managed, petabyte-scale data warehouse service in the cloud. You can start with just a few hundred gigabytes of data and scale to a petabyte or more, enabling you to use your data to acquire new insights for your business and customers.
Prerequisites
Before proceeding, ensure the following prerequisites are met:
- Install MindsDB locally via Docker or Docker Desktop.
- To connect Redshift to MindsDB, install the required dependencies following this instruction.
Connection
Establish a connection to your Redshift database from MindsDB by executing the following SQL command:
CREATE DATABASE redshift_datasource
WITH
engine = 'redshift',
parameters = {
"host": "examplecluster.abc123xyz789.us-west-1.redshift.amazonaws.com",
"port": 5439,
"database": "example_db",
"user": "awsuser",
"password": "my_password"
};
Required connection parameters include the following:
host
: The host name or IP address of the Redshift cluster.port
: The port to use when connecting with the Redshift cluster.database
: The database name to use when connecting with the Redshift cluster.user
: The username to authenticate the user with the Redshift cluster.password
: The password to authenticate the user with the Redshift cluster.
Optional connection parameters include the following:
schema
: The database schema to use. Default is public.sslmode
: The SSL mode for the connection.
Usage
Retrieve data from a specified table by providing the integration name, schema, and table name:
SELECT *
FROM redshift_datasource.schema_name.table_name
LIMIT 10;
Run Amazon Redshift SQL queries directly on the connected Redshift database:
SELECT * FROM redshift_datasource (
--Native Query Goes Here
WITH VENUECOPY AS (SELECT * FROM VENUE)
SELECT * FROM VENUECOPY ORDER BY 1 LIMIT 10;
);
The above examples utilize redshift_datasource
as the datasource name, which is defined in the CREATE DATABASE
command.
Troubleshooting Guide
Database Connection Error
- Symptoms: Failure to connect MindsDB with the Amazon Redshift cluster.
- Checklist:
- Make sure the Redshift cluster is active.
- Confirm that host, port, user, password and database are correct. Try a direct Redshift connection using a client like DBeaver.
- Ensure that the security settings of the Redshift cluster allow connections from MindsDB.
- Ensure a stable network between MindsDB and Redshift.
SQL statement cannot be parsed by mindsdb_sql
- Symptoms: SQL queries failing or not recognizing table names containing spaces or special characters.
- Checklist:
- Ensure table names with spaces or special characters are enclosed in backticks.
- Examples:
- Incorrect: SELECT * FROM integration.travel data
- Incorrect: SELECT * FROM integration.‘travel data’
- Correct: SELECT * FROM integration.`travel data`
This troubleshooting guide provided by AWS might also be helpful.
Was this page helpful?