Ayush Saxena

Bengaluru, Karnataka, India Contact Info
467 followers 337 connections

Join to view profile

About

Working as part of Cloudera's Enterprise DataWarehouse R&D Team.

Mainly focusing…

Experience & Education

  • The Apache Software Foundation

View Ayush’s full experience

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Publications

Projects

  • Apache Hive 4.x & Apache Iceberg Integration

    Working as part of Hive-Iceberg team, mainly focusing on areas related to Apache Hive-4.x Integration with Apache Iceberg

  • Apache Hive

    - Present

    Working on solving production bugs and performance issues for Apache Hive.
    Contributing the same to Apache Hive opensource community

  • Apache Hadoop

    Active member of Apache Hadoop opensource community.
    Contributing/Reviewing/Commiting code fixes and improvements.

  • Hive Runtime

    -

    Working on analysing and optimising Hive ETL performance on Cloud storages along with handling feature requests, improvements and bugs around Hive Runtime framework

    Worked on upgrading Hadoop supported version for Hive and adding support for Aarch64/ARM CPU architectures

    Worked on integrating Apache Ozone with Hive

    Worked on several improvements in Apache Tez

  • Hive GeoSpatial Data Support

    -

    Worked on design and implementation of GeoSpatial Data support in Hive.
    Integrating use of standard ESRI GeoSpatial UDFs in Hive along with providing an ability to store and import the standard ESRI geospatial data into Hive.

  • Hive Replication

    -

    Hive Replication provides a mechanism to replicate hive metadata & data between clusters.

    Worked on several feature enhancements, performance improvements and bug fixes for Hive Replication.

    Designed & Implemented use of HDFS snapshot based data copy for External Table Data.

    Designed & Implemented the Optimised Bootstrap Solution for Planned and Unplanned failovers.

    Optimised the Checkpointing flow of Hive-Replication & Implemented several In-Progress tracking…

    Hive Replication provides a mechanism to replicate hive metadata & data between clusters.

    Worked on several feature enhancements, performance improvements and bug fixes for Hive Replication.

    Designed & Implemented use of HDFS snapshot based data copy for External Table Data.

    Designed & Implemented the Optimised Bootstrap Solution for Planned and Unplanned failovers.

    Optimised the Checkpointing flow of Hive-Replication & Implemented several In-Progress tracking mechanisms (metrics/JMX)

    Implemented several data copy enhancements & solved some critical bugs.

    Contributing the same to Apache Hive Opensource

  • Hive Warehouse Connector(HWC)

    -

    Worked on implementing several performance improvements & solving production issues for HWC & Spark-Acid.

  • SparklyrHWC

    -

    Implemented a R interface for Spark-Hive Connectivity using sparklyr and HWC

    See project
  • Hadoop HDFS

    -

    Worked as part of Huawei Hadoop-HDFS team.
    Contributed several performance improvements, bug fixes and solved some critical production issues.

    Worked on features like HDFS Erasure Coding, Router Based Federation(RBF), ViewFs(Client Side Federation), WebHdfs, Block Storage Policies, Block Placement Policies(Developed Available Space Rack Fault Tolerant BPP), Observer Reads along with other core components of HDFS.

    Contributed to Apache Hadoop

  • ARM Support In Hadoop

    -

    Worked on analysing and solving challenges on deploying Hadoop clusters on ARM machines.

    See project
  • DR Mover

    -

    Tool to move Data Replicas to different Disaster Recovery(DR) Availability Zones(AZ), depending upon the AZ policy set on the directory.

    This enables to distribute the data replicas across different Availability Zones, in order to prevent data loss in case of failure of any Data Centre

  • NAS HDFS

    -

    Worked on providing an interface for the HDFS Java Client to interact with NAS Server similarly as it connects with a HDFS Namenode.

  • Huawei DLC

    -

    Worked as part of Huawei’s DataLake Client team in the later stages.
    Chased some critical pre-release bugs and improvements. Worked on implementing the UserNamespace Server for DLC.
    UNS provides a way to isolate FileSystem tree at user level

  • FLEX EC

    -

    Added support for FLEX EC Native EC Algorithm(Similar to Intel ISA-L) in HDFS

  • HDFS Erasure Coding

    -

    Worked on stabilising and deployment of Hadoop-HDFS Erasure Coding feature, by contributing and reviewing bug fixes and performance improvements for the feature.

    Feature owner for production issues and customer requests related to Erasure Coding at Huawei

  • HDFS Router Based Federation

    -

    Worked on stabilisation and deployment of Router Based Federation(RBF as a scalability solution for HDFS).
    Contributed several critical bug fixes, code and performance improvements and Implemented several Client Protocol API's for RBF.

Honors & Awards

  • Code Award: Influence

    Cloudera

    For contributions towards Apache Hive-4.0

  • Code Award: Commitment

    Cloudera

    For Contributions to Apache Hive

  • Code Award: Commitment

    Cloudera

    For contributions to Apache Hive

  • Code Award: Influence

    Cloudera

    For contributions to Apache Hive

  • Code Award: Influence

    Cloudera

    For contribution towards Apache Iceberg-Hive-4.x Integration

  • Code Award: Commitment

    Cloudera

    For contributions to Apache Hive

  • Inspiring Performance Badge

    -

    For contributions to Hadoop

  • Maestro-Execution Excellence

    -

    For contributions to Hadoop-HDFS

  • SPOT AWARD

    Huawei Technologies

    For Contributions to HDFS Router Baased Federation(RBF).

  • SPOT AWARD

    Huawei Technologies

    Flex EC Native Erasure Coding Algorithm Implementation in HDFS

  • Half Yearly BL Award

    -

    For Contributions to HDFS

Languages

  • English

    Full professional proficiency

  • Hindi

    Native or bilingual proficiency

Organizations

  • Apache Software Foundation

    Member, Committer & Contributor

    - Present

    Contributing to Apache Hadoop, Apache Hive, Apache Ozone and Apache Tez Opensource Projects

View Ayush’s full profile

  • See who you know in common
  • Get introduced
  • Contact Ayush directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Ayush Saxena in India