jeadie

✨ jeadie.xyz

πŸ“… Joined in 2022

πŸ”Ό 122 Karma

✍️ 68 posts

πŸŒ€
15 latest posts

Load

(Replying to PARENT post)

We’re building vector indexes into Datafusion for search (starting with S3 vectors).

Open source at https://github.com/spiceai/spiceai

πŸ‘€jeadieπŸ•‘1moπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

This is one of the ideas behind using DuckDB in github.com/spiceai/spiceai
πŸ‘€jeadieπŸ•‘5moπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

πŸ‘€jeadieπŸ•‘5moπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

This is a common feature now. If anything, for being so early to vector databases, Pinecone was rather late to integrating embeddings.

Timescale most recently added it but, yes a bunch of others: Weaviate, Spice AI, Marqo, etc.

πŸ‘€jeadieπŸ•‘1yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

Why not just federate Postgres and parquet files? That way the query planner can push down as much of the query and reduce how much data has to move about?
πŸ‘€jeadieπŸ•‘1yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

This looks functionally similar as using http://github.com/spiceai/spiceai with a postgreSQL data accelerator.
πŸ‘€jeadieπŸ•‘1yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

Spice AI | Senior Software Engineer | GMT+10 (e.g. Australia) through GMT-7 (e.g. Seatle/SF/LA) | Remote | Full Time

Spice AI provides building blocks for data and AI-driven applications by composing real-time and historical time-series data, high-performance SQL query, machine learning training and inferencing, in a single, interconnected AI backend-as-a-service.

We just launched github.com/spiceai/spiceai, a unified SQL query interface and portable runtime to locally materialize, accelerate, and query data tables sourced from any database, data warehouse, or data lake.

We're hiring experienced software engineers, ideally with Rust and/or Golang production experience. We're focused on large data and distributed systems, experience in these is important too. More details: https://spice.ai/careers#section-open-positions

πŸ‘€jeadieπŸ•‘1yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

And yes, Iceberg is very high up on our list
πŸ‘€jeadieπŸ•‘1yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

Yes! It can connect to FlightSQL compatible servers (see https://docs.spiceai.org/data-connectors/flightsql ) and its also a FlightSQL compatible server
πŸ‘€jeadieπŸ•‘1yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

Have you seen github.com/marqo-ai/marqo? It does all this wrapping, and you don't even need to pay for OpenAI or pinecone
πŸ‘€jeadieπŸ•‘2yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

I'm very glad that this has some added funding. I am building a serverless API on the cloudflare edge network using GGML as the backbone --> tryinfima.com
πŸ‘€jeadieπŸ•‘2yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

"AI Native" catching on
πŸ‘€jeadieπŸ•‘2yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

I've tried both Chroma and Qdrant. I don't think Chroma lacks that much. Definitely newer, but is also a great product. I think cloud support coming Q3 2023
πŸ‘€jeadieπŸ•‘2yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

(Not affiliated with hyperDB)
πŸ‘€jeadieπŸ•‘2yπŸ”Ό0πŸ—¨οΈ0

(Replying to PARENT post)

I've been using https://github.com/jdagdelen/hyperDB and it's been really easy to use. I think Clickhouse support is on the short-term roadmap.
πŸ‘€jeadieπŸ•‘2yπŸ”Ό0πŸ—¨οΈ0