Comments (5)
Milvus doesn't guarantee primary key dedup.
If both key is retrieved, new will overwrite old. but we don't guarantee this over write happens.
so user has to ensure there is no over write happens
from milvus.
Milvus doesn't guarantee primary key dedup. If both key is retrieved, new will overwrite old. but we don't guarantee this over write happens. so user has to ensure there is no over write happens
Whether or not it overwrites the old data, I think it's still within a logically controllable range. However, if a mistake is made by inserting the same primary key, as in my previous issue, when I query again using ids=[0,1], it only returns the data for id==0. Yet, the data for id==1 actually exists in the database (as it can be retrieved using ids=[1]). This seems to make the database's behavior strange and uncontrollable.
from milvus.
Milvus doesn't guarantee primary key dedup. If both key is retrieved, new will overwrite old. but we don't guarantee this over write happens. so user has to ensure there is no over write happens
Whether or not it overwrites the old data, I think it's still within a logically controllable range. However, if a mistake is made by inserting the same primary key, as in my previous issue, when I query again using ids=[0,1], it only returns the data for id==0. Yet, the data for id==1 actually exists in the database (as it can be retrieved using ids=[1]). This seems to make the database's behavior strange and uncontrollable.
because the data can be in different segment and milvus don't have the information there is a duplicate pk. this is by design and so far we can not do anything.
if you know the pk exist, you can use upsert to delete old one and insert a new one
from milvus.
Milvus doesn't guarantee primary key dedup. If both key is retrieved, new will overwrite old. but we don't guarantee this over write happens. so user has to ensure there is no over write happens
Whether or not it overwrites the old data, I think it's still within a logically controllable range. However, if a mistake is made by inserting the same primary key, as in my previous issue, when I query again using ids=[0,1], it only returns the data for id==0. Yet, the data for id==1 actually exists in the database (as it can be retrieved using ids=[1]). This seems to make the database's behavior strange and uncontrollable.
because the data can be in different segment and milvus don't have the information there is a duplicate pk. this is by design and so far we can not do anything.
if you know the pk exist, you can use upsert to delete old one and insert a new one
OK ,Thanks
from milvus.
I'd close this issue, and please free to file a new one if you met any new issues
from milvus.
Related Issues (20)
- [Enhancement]: add some metrics for restful server HOT 1
- [Bug]: access address was not malloc HOT 1
- [Bug]: When BulkInsert performs large data import tasks, there is a chance of encountering the error: "connection reset by peer: importing data failed." HOT 2
- [Bug]: Skip alloc timestamp improvement fails to apply to legacy guanrantee_ts sdk & restful API HOT 1
- [Bug]: querynode crash when running cluster in k8 HOT 49
- [Enhancement]: make management restful api idempotent HOT 2
- [Bug]: After reinstalling Milvus, the QueryNode restarted during the testing process. HOT 2
- [Bug]: "drop_partition" delete Partition Error: ERROR:pymilvus.decorators:RPC error: [has_partition], <ParamError: (code=1, message=`partition_name` value 1439114954827296 is illegal)>, HOT 3
- [Bug]: Service exception HOT 10
- [Bug]: bulksearch with (nq=500,nlist=1024,nprobe=32) is very slow,feature is orb feature with 256 dim and indexed by BIN_IVF_FLAT HOT 2
- [Bug]: bulksearch with (nq=500,nlist=1024,nprobe=32) is very slow,feature is orb feature with 256 dim and indexed by BIN_IVF_FLAT HOT 10
- [Feature]: Add a config for server http2 max stream
- [Bug]: GetQueryInfo return dropped segment dropped by L2 single compaction HOT 2
- [Enhancement]: skip creating index when there is no querynode HOT 4
- [Bug]: Datanode watch progress tickler not working HOT 1
- [Bug]: variable column batch size too big HOT 2
- [Bug]: load timeout HOT 4
- [Bug]: no idle index node found and failed to build index HOT 6
- [Bug]: querynode often restart when loading diskann index to local HOT 8
- [Feature]: how to install attu when using a vmware machine (ubuntu 22.04 arm 64 architecture) HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from milvus.