Skip to main content

About the Product

About Amazon Database

Amazon Database is a serverless interactive query service capable of querying data from Amazon Simple Storage Service (S3) using SQL. It is designed for short, interactive queries that are useful for data exploration. Database enables you to run ad-hoc queries and quickly analyze data that is stored in S3 without ETL processes. Query results are stored in an S3 bucket and made available for analysis in BI tools.

The data formats that Database supports include CSV, JSON, Parquet, Avro, and ORC. Unlike traditional RDBMS or SQL-on-Hadoop solutions that require centralized schema definitions, Database can query self-describing data as well as complex or multi-structured data that is commonly seen in big data systems. Moreover, Database does not require a fully structured schema and can support semi-structured or nested data types such as JSON.

Amazon Database processes the data in record batches and discovers the schema during the processing of each record batch. Thus, Database has the capability to support changing schemas over the lifetime of a query. Database reconfigures its operators and handles these situations to ensure that data is not lost.

Note:

  • Access from Database to your S3 data store is configured through Amazon Web Services (AWS). For information about enabling Database to access S3 data stores, see the Amazon Database documentation: http://docs.aws.amazon.com/athena/latest/ug/what-is.html.
  • When using Database, you are charged for each query that you run. The amount that you are charged is based on the amount of data scanned by the query. For more information, see Amazon Athena Pricinghttps://aws.amazon.com/athena/pricing/.

About the Connector

The Product enables organizations to connect their BI tools to the Amazon Database query service, enabling Business Intelligence, analytics, and reporting on the data that Database returns from Amazon S3 databases.

The Product complies with the JDBC 4.1 and 4.2 data standards. JDBC is one of the most established and widely supported APIs for connecting to and working with databases. At the heart of the technology is the JDBC connector, which connects an application to the database.

The Product is tested on the following data sources:

  • AWS Glue Data Catalog (default)
  • Apache Hive metastore

The Simba JDBC Driver for database Installation and Configuration Guide is suitable for users who are looking to access data returned by the Database query service from their desktop environment. Application developers may also find the information helpful. Refer to your application for details on connecting via JDBC.

Was this article helpful?

We're sorry to hear that.