Code Monkey home page Code Monkey logo

apiary-metastore-docker's People

Contributors

abhimanyugupta07 avatar akravchuk1 avatar andreeapad avatar barnharts4 avatar dhrubajyotisadhu avatar githubjianli avatar javsanbel2 avatar jaygreeeen avatar jianli-expedia avatar kenfigueiredo avatar lulav14 avatar massdosage avatar patduin avatar pradeepbhadani avatar rickart avatar rpoluri avatar spuranda123 avatar warciuch avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

apiary-metastore-docker's Issues

hms-readwrite container restart should fail if calls to aws services fails

Describe the bug
hms-readwrite container restart succeeding even if the calls to aws fails like to get aws account_id, which cause issues like removing aws account_id from s3 bucket name apiary--<aws_account_id>-<aws_region>-<schema_name>.

below shows a blank for aws_account_id

desc database sandbox;

sandbox  s3://apiary-<prefix>--us-east-1-sandbox/ root USER 

To Reproduce
Can restart the hms-readwrite container with aws calls failing

Expected behavior
hms-readwrite container startup should fail if any aws call fails.

Logs
@rpoluri has more details

Lock remains in HMS if HiveTableOperations gets killed

Is your feature request related to a problem? Please describe.
There is a bug where Iceberg clients aborts the session and then keeps orphan locks in the Hive DB.

Related to apache/iceberg#2301

Describe the solution you'd like
Enable Hive Housekeeper service that could cleanup these locks.

Additional context
example error message:
org.apache.iceberg.hive.HiveTableOperations$WaitingForLockException: Waiting for lock.
org.apache.spark.SparkException: Writing job aborted
Caused by: org.apache.iceberg.exceptions.CommitFailedException: Timed out after 181560 ms waiting for lock

emrfs unable to load ECS container credentials

apiary metastore is failing with following message after switching to fargate from EC2 instances.

message:com.amazon.ws.emr.hadoop.fs.shaded.com.amazonaws.SdkClientException: Unable to load AWS credentials from any provider in the chain

Enable Hive Metastore metrics

As a SysOps
I want to enable Hive Metastore metrics
So that process health can be monitored

Acceptance Criteria:

  • Hive metastore emits metrics to a file

startup.sh script breaks when dbpass has special character like ")"

When dbpass contains special character like ), startup.sh script fails with following error:

bash: -c: line 0: `/usr/lib/hive/bin/hive --service metastore --hiveconf hive.root.logger=INFO,console --hiveconf javax.jdo.option.ConnectionURL=jdbc:mysql://apiary-cluster.cluster-XXXXXXX.us-west-2.rds.amazonaws.com:3306/apiarydb --hiveconf javax.jdo.option.ConnectionUserName=hive_user --hiveconf javax.jdo.option.ConnectionPassword=XXX;YYY)ZZZ='

Publish metastore events to SNS

As a SysOps
I want to publish metastore events to SNS
So that those events can be processed by external application to automatically trigger replication or ETL.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.