Apache Parquet

@ApacheParquet

language agnostic, open source Columnar file format for analytics

parquet.io

2 Photos and videos Photos and videos

Tweets

Tweets, current page.
Tweets & replies
Media

You blocked @ApacheParquet

Are you sure you want to view these Tweets? Viewing Tweets won't unblock @ApacheParquet

Apache Parquet Retweeted
Julien Le Dem‏ @J_ 6 Nov 2018
PSA: If you use the page-level statistics in @ApacheParquet please chime in on JIRA: https://issues.apache.org/jira/browse/PARQUET-1365 …

3 retweets 1 like
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Raniere Silva‏ @rgaiacs 25 Jul 2018
Last speaker on the #europython's scientific room before lunch is Peter Hoffmann talking about#Pandas and #Dask to work with large datasets in @ApacheParquet.pic.twitter.com/gwwrRrgkkb

2 replies 18 retweets 34 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Gyula Fora‏ @GyulaFora 30 Jul 2018
Replying to @GbrHrmnn @bol_com and

Have a look at the @ApacheFlink bucketing sink rework for the upcoming release and the Parquet writer ;)

4 replies 2 retweets 4 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Rajat‏ @rajatk95 18 Jun 2018
@StackOverflow @ApacheSpark Can someone answere this -> why is @ApacheParquet format faster than other columnar storage like hbase, kudu etc? https://stackoverflow.com/q/48761227/3185670?stw=2 …

1 reply 3 retweets 5 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Lee Blum‏ @theLeeBlum 2 Jul 2018
My talk from the DMBI 2018 Conference at @bengurionu about our journey at @Verint_Cyber to #BigData #Cyber Analytics on @hadoop @ApacheSpark @apachekafka @ApacheParquet is available at https://www.youtube.com/watch?v=nh-JyY6Wy4c … . Thanks everyone for attending!

2 retweets 2 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Thiago de Faria‏ @thiagoavadore 19 Apr 2018
How big @CERN data is?? Well... after filtering the collisions, they generate 12.3 PB in a month... Special ROOT format + @ApacheParquet #JSON #avro #DWS18pic.twitter.com/mAVpz47Kif

10 retweets 21 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Lee Blum‏ @theLeeBlum 23 Apr 2018
In one month from now I'll be speaking on @Verint big data journey with @ApacheSpark @apachekafka and @ApacheParquet at the #StrataData Conference in London. If you're there, drop by!https://conferences.oreilly.com/strata/strata-eu/public/schedule/speaker/278192 …

1 retweet 2 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Florian #GoogleCloudNext #BlackLivesMatter‏ @frathgeber 28 Apr 2018
2nd #PyDataLDN #keynote - @holdenkarau & @BooProgrammer walk us through a zoo of #tools for #BigData & #distributed #data in #Python: #Apache #Spark, #PySpark, #Arrow, #Beam, #Parquet & #Dask @ApacheSpark @ApacheArrow @ApacheBeam @ApacheParquet @dask_dev #PyData @PyData @NumFOCUSpic.twitter.com/Tlunq778ha

11 retweets 27 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Renee Yao‏ @ReneeYao1 27 Mar 2018
Join the #GPU accelerated #analytics and #ML revolution. @ApacheArrow @ApacheParquet and @gpuoai #GTC18pic.twitter.com/bgzCJEt8Gm

7 retweets 10 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
lucien fregosi‏ @lulufrego 26 Mar 2018
Great benchmark between @ApacheParquet on #hdfs and @ApacheKudu https://blog.clairvoyantsoft.com/guide-to-using-apache-kudu-and-performance-comparison-with-hdfs-453c4b26554f … In short kudu is faster than Parquet for random access Querys like CRUD operations but slower for analytics queries.

1 reply 10 retweets 12 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Julien Le Dem‏ @J_ 5 Mar 2018
If you’re a company using open source projects and not sure how to contribute, a release engineer would be a tremendous help. It’s hard to do this properly part time. I have a specific project in mind, if you need a hint.

9 retweets 11 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Mustafa AKIN‏ @mustafaakin 28 Feb 2018
You do not need Spark to create @ApacheParquet files, you can use plain Java and it can even fit in AWS Lambda for a serverless solution:https://engineering.opsgenie.com/analyzing-aws-vpc-flow-logs-using-apache-parquet-files-and-amazon-athena-27f8025371fa …

6 retweets 14 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
nuvolatech‏ @nuvola_tech 5 Mar 2017
Learn how to use hive views for advanced schema evolution #hive http://blog.nuvola-tech.com/2017/02/schema-evolution-with-hive-and-parquet-using-partitioned-views/ …

1 retweet 2 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Jeeva‏ @Jeeva_G 1 Feb 2018
Is there a way to #sqoop from mssql to #s3 as a parquet directly? #awsemr @ApacheParquet @hadoop #bigdata #datalake

3 replies 2 retweets 2 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Lee Blum‏ @theLeeBlum 11 Jan 2018
I'll be speaking at #StrataData Conference this May in London, and share our journey in one of our many #BigData adventures with @ApacheSpark. You're all invited! https://conferences.oreilly.com/strata/strata-eu/public/schedule/speaker/278192 … @apachehadoop @apachekafka @ApacheParquet @strataconf

1 retweet 4 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
f0nzie@OilGainsAnalytics‏ @OilGains 4 Jan 2018
f0nzie@OilGainsAnalytics Retweeted Shubham Chaudhary

I wonder if we have @ApacheParquet in #rstatshttps://twitter.com/ylogx/status/948858317580902400 …

f0nzie@OilGainsAnalytics added,

Shubham Chaudhary @ylogx

Working with a 10Gig csv data. Pandas read_csv took 16mins to load the csv into memory. Converted to @ApacheParquet with @ApacheArrow. It took 30 secs to read into pyarrow table and 16 sec to convert to pandas dataframe. 16mins => 46sec! https://tech.blue-yonder.com/efficient-dataframe-storage-with-apache-parquet/ … pic.twitter.com/nECwiWlhgL

Show this thread

1 reply 6 retweets 7 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Shubham Chaudhary‏ @ylogx 4 Jan 2018
Also the file size went down from 10Gigs to 3Gigs without any compression.

1 reply 6 retweets 19 likes

Show this thread

Show this thread
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Shubham Chaudhary‏ @ylogx 4 Jan 2018
Working with a 10Gig csv data. Pandas read_csv took 16mins to load the csv into memory. Converted to @ApacheParquet with @ApacheArrow. It took 30 secs to read into pyarrow table and 16 sec to convert to pandas dataframe. 16mins => 46sec! https://tech.blue-yonder.com/efficient-dataframe-storage-with-apache-parquet/ …pic.twitter.com/nECwiWlhgL

16 replies 165 retweets 587 likes

Show this thread

Show this thread
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Julien Le Dem‏ @J_ 7 Dec 2017
Julien Le Dem Retweeted Justin Q Coffey

Come hear me talk about @ApacheArrow and @ApacheParquet at #NABDConf in Palo Alto next Tuesday!https://twitter.com/jqcoffey/status/927859244912824321 …

Julien Le Dem added,

Justin Q Coffey @jqcoffey

#NABDConf Palo Alto is on! 12/11 is #ScalaTraining w/@guillaumebort, 12/12 hear from @J_ @sinisa_lyh +more for $50! http://www.criteo.com/events/nabdconf-palo-alto/ …

1 reply 5 retweets 9 likes
Thanks. Twitter will use this to make your timeline better. Undo

Undo
Apache Parquet Retweeted
Ioannis Athanasiadis‏ @inathens 8 Dec 2017
At @ucc_bdcat today in #Austin presenting our work with @pbr_wur on managing #agri #genomic #bigdata with @ApacheSpark and @ApacheParquetpic.twitter.com/ynOrgXac9n

1 reply 8 retweets 8 likes

Show this thread

Show this thread
Thanks. Twitter will use this to make your timeline better. Undo

Undo

Loading seems to be taking a while.

Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.

Country	Code	For customers of
United States	40404	(any)
Canada	21212	(any)
United Kingdom	86444	Vodafone, Orange, 3, O2
Brazil	40404	Nextel, TIM
Haiti	40404	Digicel, Voila
Ireland	51210	Vodafone, O2
India	53000	Bharti Airtel, Videocon, Reliance
Indonesia	89887	AXIS, 3, Telkomsel, Indosat, XL Axiata
Italy	4880804	Wind
Italy	3424486444	Vodafone
» See SMS short codes for other countries

Apache Parquet

@ApacheParquet

Tweets

You blocked @ApacheParquet

Loading seems to be taking a while.

You may also like

false

Saved searches

Apache Parquet

@ApacheParquet

Tweets

You blocked @ApacheParquet

Apache Parquet followed

Loading seems to be taking a while.

New to Twitter?

You may also like

false

Choose a trend location

Go to a person's profile

Saved searches

Promote this Tweet

Block

Tweet with a location

Your lists

Create a new list

Copy link to Tweet

Embed this Tweet

Embed this Video

Preview

Why you're seeing this ad

Log in to Twitter

Sign up for Twitter

Not on Twitter? Sign up, tune into the things you care about, and get updates as they happen.

Two-way (sending and receiving) short codes:

Confirmation

Welcome home!

Tweets not working for you?

Say a lot with a little

Spread the word

Join the conversation

Learn the latest

Get more of what you love

Find what's happening

Never miss a Moment