novatechflow | Alexander Alten

Posts

Showing posts from July, 2014

Why HiveServer2 Replaced the Hive CLI (and Why It Still Matters)

HiveServer2 replaced the old Hive CLI because the CLI bypassed all security and governance layers, could not support multi-user concurrency, and created operational risks that modern data platforms cannot tolerate. This updated version explains the historical context, what changed in today’s Hadoop and Hive environments, and why Beeline and JDBC remain the only correct way to access Hive securely and predictably. When Hive 0.11 introduced HiveServer2 (HS2), it marked a necessary break with the legacy Hive CLI model. While the original post explained this transition for early Hadoop distributions, the underlying reasons remain valid even in modern Hive deployments. Today Hive CLI is effectively obsolete, and all secure or governed environments require HS2 as the mandatory entry point. Why the Hive CLI Had to Die 1. The CLI Bypassed All Security The original Hive CLI talked directly to the Hive Metastore and launched MapReduce or Tez jobs without going through a controlled ser...

Understanding HDFS Extended Attributes (XAttr) in Modern Hadoop

HDFS extended attributes (XAttr) allow files and directories to carry custom metadata such as encryption markers, checksums, lineage tags or security labels. Introduced years ago and now fully stable in Hadoop 3.x, they provide a flexible way for governance tools and applications to attach structured or free-form information directly to filesystem objects. This updated version explains how namespaces work, how limits are configured and how to read and write attributes using current HDFS commands. Extended Attributes (XAttr), familiar from UNIX-like filesystems, allow HDFS to store custom metadata alongside files and directories. Modern data platforms use these attributes for encryption tagging, data classification, backup markers, application metadata and security frameworks. How HDFS Stores Extended Attributes HDFS supports four namespaces, aligned with Linux kernel semantics: user – user-defined metadata security – security-related attributes (superuser only) sy...