HDFS Integration Guides and Tutorials



A list of guides and tutorials for connecting to and working with live HDFS data.

CData Software's connectivity tools enable users to connect directly to live HDFS data data from widely-used BI, analytics, ETL, and custom applications, ensuring that our customers can access their data wherever they desire. Below, you'll find a collection of guides and tutorials on integrating with live HDFS data.

Integration Use-Cases

Click below to jump to articles related to specific integration use-case.

Business Intelligence & Analytics


ProductTechnologyArticle Title
Alteryx DesignerODBCPrepare, Blend, and Analyze HDFS in Alteryx Designer (ODBC)
Aqua Data StudioJDBCConnect to HDFS in Aqua Data Studio
AWS DatabricksJDBCProcess & Analyze HDFS Data in Databricks (AWS)
BirstJDBCBuild Visualizations of HDFS in Birst
BIRTJDBCDesign BIRT Reports on HDFS
Clear AnalyticsODBCBuild Charts with HDFS in Clear Analytics
DBxtraODBCBuild Dashboards with HDFS in DBxtra
DomoODBCCreate Datasets from HDFS in Domo Workbench
Dundas BIODBCBuild Dashboards with HDFS in Dundas BI
FineReportJDBCFeed HDFS into FineReport
IBM Cognos BIODBCCreate Data Visualizations in Cognos BI with HDFS
JasperServerJDBCCreate HDFS Reports on JasperReports Server
Jaspersoft BI SuiteJDBCConnect to HDFS in Jaspersoft Studio
JReport DesignerJDBCIntegrate with HDFS in JReport Designer
KNIMEJDBCEnable the HDFS JDBC Driver in KNIME
LINQPadADO.NETWorking with HDFS in LINQPad
Microsoft SSASADO.NETBuild an OLAP Cube in SSAS from HDFS
MicroStrategyJDBCUse the CData JDBC Driver for HDFS in MicroStrategy
Microstrategy DesktopJDBCUse the CData JDBC Driver for HDFS in MicroStrategy Desktop
Microstrategy WebJDBCUse the CData JDBC Driver for HDFS in MicroStrategy Web
OBIEEJDBCHDFS Reporting in OBIEE with the HDFS JDBC Driver
pandasPythonUse pandas to Visualize HDFS in Python
Pentaho Report DesignerJDBCIntegrate HDFS in the Pentaho Report Designer
Power BI DesktopPower BIAuthor Power BI Reports on Real-Time HDFS
QlikViewODBCConnect to and Query HDFS in QlikView over ODBC
RJDBCAnalyze HDFS in R (JDBC)
RODBCAnalyze HDFS in R (ODBC)
RapidMinerJDBCConnect to HDFS in RapidMiner
SAP Business ObjectsJDBCCreate an SAP BusinessObjects Universe on the CData JDBC Driver for HDFS
SAP Crystal ReportsJDBCPublish Reports with HDFS in Crystal Reports (JDBC)
SASODBCUse the CData ODBC Driver for HDFS in SAS for Real-Time Reporting and Analytics
SAS JMPODBCUse the CData ODBC Driver for HDFS in SAS JMP
SisenseJDBCVisualize Live HDFS in Sisense
Spago BIJDBCConnect to HDFS in SpagoBI
TableauTableauVisualize HDFS in Tableau Desktop
Tableau ServerTableauPublish HDFS-Connected Dashboards in Tableau Server
TIBCO SpotfireADO.NETVisualize HDFS in TIBCO Spotfire through ADO.NET
TIBCO Spotfire ServerJDBCOperational Reporting on HDFS from Spotfire Server

Back to top

ETL & Replication


ProductTechnologyArticle Title
Amazon RedshiftCData SyncAutomated Continuous HDFS Replication to Amazon Redshift
Amazon S3CData SyncAutomated Continuous HDFS Replication to Amazon S3
Apache AirflowJDBCBridge HDFS Connectivity with Apache Airflow
Apache CamelJDBCIntegrate with HDFS using Apache Camel
Apache CassandraCData SyncAutomated Continuous HDFS Replication to Apache Cassandra
Apache KafkaCData SyncAutomated Continuous HDFS Replication to Apache Kafka
Apache NiFiJDBCBridge HDFS Connectivity with Apache NiFi
Azure Data LakeCData SyncAutomated Continuous HDFS Replication to Azure Data Lake
Azure SynapseCData SyncAutomated Continuous HDFS Replication to Azure Synapse
BIMLSSISUse Biml to Build SSIS Tasks to Replicate HDFS to SQL Server
CloverDXJDBCConnect to HDFS in CloverDX (formerly CloverETL)
CouchbaseCData SyncAutomated Continuous HDFS Replication to Couchbase
CSVCData SyncAutomated Continuous HDFS Replication to Local Delimited Files
DatabricksCData SyncAutomated Continuous HDFS Replication to Databricks
ETL ValidatorJDBCHow to Work with HDFS in ETL Validator
FoxProODBCWork with HDFS in FoxPro
Google AlloyDBCData SyncAutomated Continuous HDFS Replication to Google AlloyDB
Google BigQueryCData SyncAutomated Continuous HDFS Replication to Google BigQuery
Google Cloud SQLCData SyncAutomated Continuous HDFS Replication to Google Cloud SQL
Google Data FusionJDBCBuild HDFS-Connected ETL Processes in Google Data Fusion
Heroku / Salesforce ConnectCData SyncReplicate HDFS for Use in Salesforce Connect
HULFT IntegrateJDBCConnect to HDFS in HULFT Integrate
IBM DB2CData SyncAutomated Continuous HDFS Replication to IBM DB2
Informatica CloudJDBCIntegrate HDFS in Your Informatica Cloud Instance
Informatica PowerCenterJDBCCreate Informatica Mappings From/To a JDBC Data Source for HDFS
Jaspersoft ETLJDBCConnect to HDFS in Jaspersoft Studio
Microsoft AccessCData SyncAutomated Continuous HDFS Replication to Microsoft Access
Microsoft OneLakeCData SyncAutomated Continuous HDFS Replication to Microsoft OneLake (Fabric}
MongoDBCData SyncAutomated Continuous HDFS Replication to MongoDB
MySQLCData SyncAutomated Continuous HDFS Replication to MySQL
Oracle Data IntegratorJDBCETL HDFS in Oracle Data Integrator
Oracle DatabaseCData SyncAutomated Continuous HDFS Replication to Oracle
Pentaho Data IntegrationJDBCIntegrate HDFS in Pentaho Data Integration
petlPythonExtract, Transform, and Load HDFS in Python
PostgreSQLCData SyncAutomated Continuous HDFS Replication to PostgreSQL
Replicate to MySQLPowerShellReplicate HDFS to MySQL with PowerShell
SAP HANACData SyncAutomated Continuous HDFS Replication to SAP HANA
SingleStoreCData SyncAutomated Continuous HDFS Replication to SingleStore
SnapLogicJDBCIntegrate HDFS with External Services using SnapLogic (JDBC)
SnowflakeCData SyncAutomated Continuous HDFS Replication to Snowflake
SQL ServerCData SyncAutomated Continuous HDFS Replication to SQL Server
SQLiteCData SyncAutomated Continuous HDFS Replication to SQLite
TalendJDBCConnect to HDFS and Transfer Data in Talend
UiPath StudioODBCCreate an RPA Flow that Connects to HDFS in UiPath Studio
VerticaCData SyncAutomated Continuous HDFS Replication to a Vertica Database

Back to top

Data Virtualization



Back to top

Software Development


ProductTechnologyArticle Title
AWS LambdaJDBCAccess Live HDFS Data in AWS Lambda
.NET ChartsADO.NETDataBind Charts to HDFS
.NET QueryBuilderODBCRapidly Develop HDFS-Driven Apps with Active Query Builder
Apache SparkJDBCWork with HDFS in Apache Spark Using SQL
C++BuilderODBCDataBind Controls to HDFS Data in C++Builder
ColdFusionJDBCQuery HDFS in ColdFusion Using JDBC
ColdFusionODBCQuery HDFS in ColdFusion Using ODBC
DashPythonUse Dash & Python to Build Web Apps on HDFS
DelphiODBCDataBind Controls to HDFS Data in Delphi
DevExpressADO.NETDataBind HDFS to the DevExpress Data Grid
EF - Code FirstADO.NETAccess HDFS with Entity Framework 6
EF - LINQADO.NETLINQ to HDFS
EF - MVCADO.NETBuild MVC Applications with Connectivity to HDFS
Filemaker ProODBCBidirectional Access to HDFS from FileMaker Pro
Filemaker Pro (on Mac)JDBCBidirectional Access to HDFS from FileMaker Pro (on Mac)
GoODBCWrite a Simple Go Application to work with HDFS on Linux
HibernateJDBCObject-Relational Mapping (ORM) with HDFS Entities in Java
IntelliJJDBCConnect to HDFS in IntelliJ
JBossJDBCConnect to HDFS from a Connection Pool in JBoss
JDBIJDBCCreate a Data Access Object for HDFS using JDBI
JRubyJDBCConnect to HDFS in JRuby
Lazarus IDEODBCEasily Integrate HDFS Data in Lazarus Pascal IDE
MendixJDBCBuild HDFS-Connected Apps in Mendix (JDBC)
NodeJSODBCQuery HDFS through ODBC in Node.js
PHPODBCNatively Connect to HDFS in PHP
PowerBuilderADO.NETConnect to HDFS from PowerBuilder
PowerShellPowerShellPipe HDFS to CSV in PowerShell
PyCharmODBCUsing the CData ODBC Driver for HDFS in PyCharm
PythonODBCConnect to HDFS in Python on Linux/UNIX
RubyODBCConnect to HDFS in Ruby
RunMyProcess DSECJDBCConnect to HDFS in DigitalSuite Studio through RunMyProcess DSEC
ServoyJDBCBuild HDFS-Connected Apps in Servoy
Spring BootJDBCAccess Live HDFS Data in Spring Boot Apps
SQLAlchemyPythonUse SQLAlchemy ORMs to Access HDFS in Python
TomcatJDBCConfigure the CData JDBC Driver for HDFS in a Connection Pool in Tomcat
VCL App (RAD Studio)ODBCBuild a Simple VCL Application for HDFS
WebLogicJDBCConnect to HDFS from a Connection Pool in WebLogic

Back to top

Data Management



Back to top

Workflow Automation



Back to top

Ready to get started?

Learn more:

HDFS Connectivity Solutions