Scanning Engine v2 - Module - PrestoDB¶
Overview¶
Presto (including PrestoDB, and PrestoSQL which was re-branded to Trino) is a distributed query engine for big data using the SQL query language. Its architecture allows users to query data sources such as Hadoop, Cassandra, Kafka, AWS S3, Alluxio, MySQL, MongoDB and Teradata, and allows use of multiple data sources within a query. Presto is community-driven open-source software released under the Apache License.
Upon connection to a Presto service, this module attempts to dump all of the databases. Gathering all the catalogs, schemas and tables from them. It doest not gather actual values from the tables themselves.
Targeting¶
This module targets TCP ports by IP address or hostname.
Configuration¶
Only the publicly-available configuration keys that can be set in a job's module invocations will be described below. Additional configuration keys may exist, but not be shown here because they are restricted to specific users or because they are permanently set as a static value. If no keys have (required)
after their names then invocations of this module need not contain a config
key.
Named Keys¶
debug
¶
Run module with a debugging configuration.
- Type: boolean
tls
¶
Connect using SSL/TLS.
- Type: boolean
Schemas¶
The schema for the body
object of all results generated with .task.module_name
equal to prestodb
can be found here. The schema for results is available both in standalone and bundled form.
Examples¶
These are examples of the .body
object for results with .task.module_name
equal to prestodb
.
Live Host¶
This example was generated with a live host on the internet.
{
"catalogs": {
"hive": {
"schemas": {
"default": {
"tables": []
},
"information_schema": {
"tables": [
"applicable_roles",
"columns",
"enabled_roles",
"roles",
"schemata",
"table_privileges",
"tables",
"views"
]
}
}
}
}
}
Changelog¶
- 2023-07-12: Initial public release of documentation.