Questions tagged [hive]
Apache Hive is a database built on top of Hadoop and facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible distributed file system. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. Please DO NOT use this tag for flutter database which is also named Hive, use flutter-hive tag instead.
21,917
questions
0
votes
0
answers
7
views
Hive Connectvity setup in Abinitio
Can someone help me with how to connect hive from Abinitio?
What are the prerequisites for hive setup in abinitio?
Thanks
Rao
Hello All,
Can someone help me with how to connect hive from Abinitio?
...
0
votes
0
answers
7
views
AWS EMR - reading multiple "zip" files from S3 bucket returns Your key is too long
In my daily job I use EMR to process large amount of data. This data are stored in CSV files on S3 bucket. The idea I had was to try to process ziped csv files instead of plain csv.
In Hive app I use ...
0
votes
0
answers
13
views
ANALYZE command not write data into hive metastore
Good day,
In SIT, I have manually create a hive table.
Follow by manually create a partition on it.
Then I copy the bucket file from other table and put inside the partition.
At this moment, if I run ...
-1
votes
0
answers
19
views
Hive sql repeating records with amount differing only by +-5%?
How to delete repeating records with amount differing only by 5%? by amount in the range amout-5% to amount+5%?
Here is an example table on hive
[# a.uk b.uk AMOUNT START_DATE FINISHED_DATE source
1 ...
1
vote
1
answer
18
views
How to test hive functionalities with flutter?
I have a TourDB(in hive) that I need to test:
class TourDB {
TourDB._();
static TourDB? _instance;
factory TourDB.getInstance() {
_instance ??= TourDB._();
return _instance!;
}
...
0
votes
0
answers
14
views
How Ambari manages Apache hadoop
Ambari is an open source project, but HDP is no longer available for download, so how does Ambari manage components such as Apache Hadoop, Hive, and Spark? Can Ambari directly manage existing Hadoop ...
0
votes
1
answer
9
views
Split User name from one field to two fields in HIVE
I have a field: Username.
the ouput is lastname,firstname (No extra space after the comma)
Ultimately, I need to script the display to be firstname lastname.
My thought is to split the username on ...
0
votes
0
answers
22
views
SQL Error [16777217]: Query failed (#20240710_084433_00075_u2prc): java.io.IOException: Start token not found where expected
I'm trying to create table in trino using hive-connector and minio as storage layer to query logs present in json array of objects file by the following command:
CREATE TABLE IF NOT EXISTS minio....
0
votes
0
answers
12
views
Unable to register Log4j shutdown hook because JVM is shutting down. Using SimpleLogger
I have started trino but the hive-metastore-1 keep exited after few seconds in docker. The logs are showing as below:
2024-07-10 06:56:24,321 shutdown-hook-0 INFO Log4j appears to be running in a ...
-1
votes
0
answers
16
views
Why Ranger authorization doesn't work when I connect to Hiveserver2 by using ODBC/JDBC?
I have two users. (A, B)
And I have two hive databases (a_db, b_db)
I gave 'read' authorization to A user only for a_db.
And I didn't give any authorizations to B user.
But when A user connected to ...
1
vote
0
answers
39
views
How to load .dat file to Hive with additional columns?
I want to load .dat(without headers) file to hive external table.
But in hive table there are extra columns like cob_date , region, file_name
which are not present in .dat file.
cob_date will be the ...
1
vote
0
answers
43
views
build_runner is not working ( niether for hive nor for isar) , i am using hive currently
import 'package:moneylog/logsheet_model.dart';
import 'package:moneylog/logpage.dart';
import 'package:hive/hive.dart';
import 'package:flutter/material.dart';
part 'log_model.g.dart';
@HiveType(...
0
votes
0
answers
26
views
Spark Thrift throws TTransportException every time the client disconnects
I am running Thrift like this, notice I am using latest apache/spark-py docker image unmodified:
docker run -e SPARK_NO_DAEMONIZE=true
-p 10000:10000 -it apache/spark-py /opt/spark/sbin/start-...
0
votes
0
answers
25
views
ClassCastException in Spark SQL Incremental Load with DBT
I'm encountering a ClassCastException error when running an incremental load using DBT and Spark SQL. The error message indicates an issue with casting in the Spark execution plan:
org.apache.hive....
0
votes
1
answer
40
views
Why inner join creates new distinct values in other column?
I make SQL-queries in Hive using Hue. When I use inner join between two tables, one of the table's column get new distinct values, that wasn't in this column before join.
I have table periods with ...