Apache Knox
- Last Updated: January 30, 2023
- 2 minute read
- DataDirect Connectors
- ODBC
- Apache Hive 8.0
- Documentation
Apache Knox is a gateway system that serves as a reverse proxy to Apache Hadoop clusters. The primary advantages of Apache Knox are that it provides a single point of authentication that simplifies security policy enforcement while providing REST API access to a clustered environment. The driver supports connecting to Apache Knox in a similar manner to a standard connection using HTTP mode.
To connect to an Apache Knox gateway:
The following examples demonstrates a basic connection to Apache Knox using Kerberos and SSL data encryption.
Using a connection string:
DRIVER=DataDirect 8.0 Apache Hive Wire Protocol;AuthenticationMethod=4;
DatabaseName=hivedb1;EncryptionMethod=1;HostName=HiveServer;
HTTPPath=gateway/default/hive;PortNumber=8443;
ServicePrincipalName=knox/knoxserver1.example.com@EXAMPLE.COM;
TransportMode=1;
Using the odbc.ini file:
Driver=ODBCHOME/lib/ivhivexx.so
Description=DataDirect Apache Hive Wire Protocol driver
AuthenticationMethod=4
Database=hivedb1
EncryptionMethod=1
HostName=HiveServer
HTTPPath=gateway/default/hive
HostNameInCertificate=MySubjectAltName
PortNumber=8443
ServicePrincipalName=knox/knoxserver1.example.com@EXAMPLE.COM
TransportMode=1
Note: If you receive an
HTTP/1.1 500 Server Error message when attempting
to insert a large number of rows with Apache Knox, reduce the value specified for the
ArrayInsertSize property until the operation succeeds.