Allowing Xplenty access to my data on Hadoop Distributed File System (HDFS)

Xplenty can access data residing on any Hadoop distributed file system (HDFS). This article details creating an HDFS connection in Xplenty.

You must provide Xplenty access to the cluster's HDFS. Please consult our support team if the HDFS is behind firewall.

To define a connection in Xplenty to Hadoop Distributed File System (HDFS)

  1. Click your avatar, then click Account settings.
  2. On the left menu, click Connections. Existing connections are listed.
  3. To create a connection, click new connection
  4. Click Hadoop Distributed File System (HDFS).
  5. In the new HDFS connection window, name the connection and enter the connection information:
    • User Name - the user name to use when connecting to HDFS (Kerberos authorization is not currently supported).
    • NameNode Hostname - the host name of the NameNode server or the logical name of the NameNode in a high availability configuration.  
    • NameNode Port - the TCP port of the name node. Leave empty if the NameNode is in a high availability configuration.
    • HttpFS Hostname - the host name of the Hadoop HttpFS gateway node. This should be available to Xplenty's platform.
    • HttpFS Port - the TCP port of the Hadoop HttpFS gateway node (Default is 14000).
  6. Click test connection If the credentials are correct, a message that the connection test was successful appears.
  7. Click create hdfs connection.
  8. The connection is created and appears in the list of file storage connections.
  9. Now you can create a package and test it on your actual data stored in Hadoop Distributed File System (HDFS).

Connectivity And Security

  1. Allowing Xplenty access to data on Amazon S3
  2. Allowing Xplenty access to my Redshift cluster
  3. Allowing Xplenty access to my data on Google AdWords
  4. Allowing Xplenty access to my data on Google Analytics
  5. Allowing Xplenty access to my Google BigQuery dataset
  6. Allow Xplenty access to my Google Cloud Spanner instance
  7. Allow Xplenty access to my data on Google Cloud Storage
  8. Allowing Xplenty access to my data on Hadoop Distributed File System (HDFS)
  9. Allowing Xplenty access to my server behind a firewall
  10. Allowing Xplenty direct access to relational databases on Amazon Web Services RDS
  11. Allowing Xplenty access to MongoDB
  12. Importing Heroku Connections
  13. Allowing Xplenty access to my data on Bing Ads
  14. Allowing Xplenty access to Azure SQL Databases
  15. Allowing Xplenty access to my data on Secure File Transfer Protocol (SFTP)
  16. Allowing Xplenty access to Facebook Ads Insights data (Beta)
  17. Allow Xplenty access your data on Salesforce
  18. Allowing Xplenty access to NetSuite
  19. Allowing Xplenty access to my data on Intercom
  20. Set up a reverse SSH tunnel from Windows
  21. Allowing Xplenty access to my data on Xero (Beta)
  22. Allow Xplenty access to my data on Azure Blob Storage
  23. Allowing Xplenty access to my Snowflake account
  24. Allowing Xplenty access to my data on YouTube
  25. Allowing Xplenty access to my data on Google Sheets
  26. Allowing Xplenty access to my data on Linkedin
  27. Allowing Xplenty access to my data on Instagram
  28. Allowing Xplenty access to my data on Shopify
  29. Allowing Xplenty access to my data on Pinterest

Feedback and Knowledge Base