General requirements
System and sizing requirements
Cluster nodes Indexima
Operating System supported : Centos, Amazon Linux, Debian, Redhat, OpenSUSE, Ubuntu, Windows Server 2016+
- Number of machines : 2 minimum for production usage
CPU : 4 Cores minimum
RAM : 16 Gb minimum
Local storage : 100 Gb available
Administrator account / root privileges on the machines
Shared storage
- Shared storage accessible by Indexima nodes (ex : NFS, S3, HDFS, CEPH / …). See compatibility matrix Storage Compatibility Matrix.
- Minimum 200 Gb available (depending on the use cases).
Console Indexima (optional)
1 machine
Operating System supported : same as indexima nodes
CPU : 2 Cores
RAM : 8 Gb
Storage : 40 Gb available
Administrator account / root privileges
Ansible installer (optional)
1 machine
Operating System supported : Centos, Amazon Linux, Debian
CPU : 2 Cores
RAM : 8 Gb
Storage : 40 Gb available
Administrator account / root privileges
Network requirements
This table list all network requirements.
Source | Destination | Port | Protocol | Description |
---|---|---|---|---|
IP Cluster Indexima | 443 | TCP | Download packages from the internet, for the installation of the software prerequisites (Java / Hadoop / Indexima installer / Ansible) Also it’s possible to install with local zip packages | |
Private client network | IP Cluster Indexima | 8082 | TCP | Connect to Indexima web console / Connect to Indexima API |
Private client network | IP Cluster Indexima | 9999 | TCP | Nodes status web page |
IP Cluster Indexima | IP Cluster Indexima | All | All | Inter-node communication inside Indexima Cluster |
Ansible machine | IP Cluster Indexima | 22 | TCP | Ansible connection to Indexima cluster to install and configure Indexima |
Data consumers (dataviz) | IP Cluster Indexima | 10000 | All | SQL entrypoint to query Indexima core engine |
IP Cluster Indexima | Data sources (jdbc or datalake) | N/A | TCP | Requests between Indexima and data sources |
IP Cluster Indexima | Indexima warehouse (S3, NFS, ...) | N/A | TCP | Requests between Indexima and the shared storage for Indexima warehouse |
This list of network requirements is valid for the default Indexima configuration. Network ports can be configured through Indexima configuration.
A network load balancer is advised to load balance the sql queries between Indexima Core nodes (port 10000). See Load balancing for more details.
A valid SSL certificate is required to activate SSL encryption (https to access Indexima console, or ssl encryption between nodes and console).
Software requirements
The following softwares are required to run Indexima
Software | Version | Install Link |
---|---|---|
Java JDK (Oracle or Open jdk) | 8 | Linux: https://openjdk.java.net/install/ Windows: https://www.java.com/en/download/help/windows_manual_download.xml |
Hadoop librairies for standalone deployment | > 2.8.3 | https://archive.apache.org/dist/hadoop/common/hadoop-2.8.3/hadoop-2.8.3.tar.gz https://archive.apache.org/dist/hadoop/common/hadoop-3.1.4/hadoop-3.1.4.tar.gz |
Tez librairies for standalone deployment with hadoop 3 | 0.9.2 | With hadoop 3 setup, add following librairies into galactica/tez folder: |
Hadoop libraries for yarn deployment | N/A | See compatibility matrix: Cloudera Compatibility matrix |
JDBC drivers for connecting to datasource | N/A | See compatibility matrix: Data Source Compatibility matrix |