Find anything about our product, documentation, and more.

Blazor Components | 70+ Native UI Controls | Syncfusion®

https://www.syncfusion.com/blazor-components

The Syncfusion^® native Blazor components library offers 70+ UI and Data Viz web controls that are responsive and lightweight for building modern web apps.

.NET PDF Framework | C# / VB.NET PDF API | Syncfusion®

https://www.syncfusion.com/pdf-framework/net

.NET PDF framework is a high-performance and comprehensive library used to create, read, merge, split, secure, edit, view, and review PDF files in C#/VB.NET.

155+ Xamarin UI controls for iOS, Android & UWP apps | Syncfusion®

https://www.syncfusion.com/xamarin-ui-controls

Over 155 Xamarin UI controls to create cross-platform native mobile apps for iOS, Android, UWP and macOS platforms from a single C# code base.

My Dashboard

SIGN OUT

Syncfusion Big Data: query data from HDFS in Avro format

3 Replies
2 Participants

Created by
IB Ilya Bo

Platform
Big Data Platform

Platform
Big Data Platform

Control
General

Created On
Jun 22, 2017 10:41 AM UTC

Last Activity On
Jun 27, 2017 01:02 PM UTC

Want to subscribe?
SIGN IN

I have data which stored in HDFS in Avro format.

How can I query it via Syncfusion Big Data Studio by Spark SQL?

Thanks!

3 Replies

AT Aravindraja Thinakaran Syncfusion Team June 23, 2017 07:20 AM UTC

Hi Ilya,

Thanks for contacting Syncfusion support.

Please check with below steps to access AVRO files which available in HDFS using SparkSQL from Big Data Studio.

Step 1: Download and extract the spark-avro_2.11-3.2.0.jar file and copy the jar to below location.

<Install Drive>:\Syncfusion\BigData\<Install Drive>\BigDataSDK\SDK\Spark\jars\

Step 2: Restart Spark Thrift server service from Service Manager.

Step 3: Use “/Data/Spark/Resources/Users.avro” as input file from HDFS to create table in Spark SQL using below command.

CREATE TABLE Users USING com.databricks.spark.avro OPTIONS (path "/Data/Spark/Resources/Users.avro");

Step 4: After table created use below command to view the created table.

select * from Users;

Thanks,

Aravindraja T

IB Ilya Bo June 26, 2017 03:31 PM UTC

Thank you for your answer!One more thing I would like to clarify:How to specify Avro Schema (.avsc file) correctly?

AT Aravindraja Thinakaran Syncfusion Team June 27, 2017 01:02 PM UTC

Hi Ilya,

You can specify a custom Avro schema (.avsc file) using Scala API and access it using Spark SQL as usual. Please follow the below procedure.

Step 1: Create Spark table by specifying Avro schema using Spark Scala tab in Big Data Studio by running the following script,

http://www.syncfusion.com/downloads/support/forum/131126/ze/AvroFileSchema-1089509997.zip

Step 2: Access the table as usual using Spark SQL.

Note:

It seems there is some limitation in specifying custom Avro schema in Spark SQL API , so we provided solution by using Scala API to specify a custom schema.

https://github.com/databricks/spark-avro

Thanks,

Aravindraja T.

3 Replies
2 Participants
Want to subscribe?
SIGN IN
Created by
IB Ilya Bo
Platform
Big Data Platform
Control
General
Created On
Jun 22, 2017 10:41 AM UTC
Last Activity On
Jun 27, 2017 01:02 PM UTC

Viewer Component

.NET PDF Processing Library

Conversions

Editor Component

.NET Word Processing Library

Conversions

Editor Component

.NET Excel Processing Library

Conversions

.NET PowerPoint Processing Library

Conversions

Syncfusion Big Data: query data from HDFS in Avro format

Enterprise Solutions

Free Tools

Viewer Component

.NET PDF Processing Library

Conversions

Editor Component

.NET Word Processing Library

Conversions

Editor Component

.NET Excel Processing Library

Conversions

.NET PowerPoint Processing Library

Conversions

Learning

Resources

Support

Syncfusion Big Data: query data from HDFS in Avro format