0 / 0
Apache Impala connection
Last updated: Nov 26, 2024
Apache Impala connection

To access your data in Apache Impala, create a connection asset for it.

Apache Impala provides high-performance, low-latency SQL queries on data that is stored in popular Apache Hadoop file formats.

Supported versions

Apache Impala 1.3+

Create a connection to Apache Impala

To create the connection asset, you need these connection details:

  • Database (optional): If you do not enter a database name, you must enter the catalog name, schema name, and the table name in the properties for SQL queries.
  • Hostname or IP address
  • Port number
  • Username and password
  • SSL certificate (if required by the database server)

Authentication method

Select the security mechanism to use to authenticate the user:

  • Username and password or Kerberos credentials
    Available Kerberos selections depend on whether you select Personal or Shared credentials.

  • LDAP
    Use an LDAP security mechanism for external authentication.

    Note:

    SPSS Modeler supports only the Username and password authentication method.

For Private connectivity, to connect to a database that is not externalized to the internet (for example, behind a firewall), you must set up a secure connection.

Choose the method for creating a connection based on where you are in the platform

In a project
Click Assets > New asset > Connect to a data source. See Adding a connection to a project.
In a catalog
Click Add to catalog > Connection. See Adding a connection asset to a catalog.
In a deployment space
Click Import assets > Data access > Connection. See Adding data assets to a deployment space.
In the Platform assets catalog
Click New connection. See Adding platform connections.

Next step: Add data assets from the connection

Where you can use this connection

You can use Apache Impala connections in the following workspaces and tools:

Projects

  • Data Refinery (watsonx.ai Studio or IBM Knowledge Catalog)
  • DataStage (DataStage service). See Connecting to a data source in DataStage.
  • Metadata enrichment (IBM Knowledge Catalog)
  • Metadata import (IBM Knowledge Catalog)
  • SPSS Modeler (watsonx.ai Studio)

Catalogs

  • Platform assets catalog

  • Other catalogs (IBM Knowledge Catalog)

Data Virtualization service
You can connect to this data source from Data Virtualization.

Apache Impala setup

Apache Impala installation

Restriction

You can use this connection only for source data. You cannot write to data or export data with this connection.

Running SQL statements

To ensure that your SQL statements run correctly, refer to the Impala SQL Language Reference for the correct syntax.

Learn more

Apache Impala documentation

Parent topic: Supported connections

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more