MrPowers
Personal blog: https://neapowers.com/
Github: https://github.com/mrpowers
From New York, spend most of my time in Colombia & Brazil.
Speak Spanish fluently, learning Portuguese.
๐ Joined in 2012
๐ผ 1,636 Karma
โ๏ธ 312 posts
Load more
(Replying to PARENT post)
Here are the queries: https://github.com/apache/sedona-spatialbench/blob/main/prin...
They should be fairly easy to replicate!
(Replying to PARENT post)
(Replying to PARENT post)
From the README:
> Update (August 2024): GeoPolars is blocked on Polars supporting Arrow extension types, which would allow GeoPolars to persist geometry type information and coordinate reference system (CRS) metadata. It's not feasible to create a geopolars. GeoDataFrame as a subclass of a polars. DataFrame (similar to how the geopandas. GeoDataFrame is a subclass of pandas.DataFrame) because polars explicitly does not support subclassing of core data types.
(Replying to PARENT post)
There are other good alternatives, such as GeoPandas and DuckDB Spatial. SedonaDB has Python/SQL APIs and is very fast. New features like full raster support and compatibility with lakehouse formats are coming soon!
(Replying to PARENT post)
But pretty awesome this individual is retiring from programming / taking a sabbatical. There is nothing wrong with taking some time off and pursuing other interests when you lose your passion.
(Replying to PARENT post)
Lots of engines like Polars, PyTorch, Spark, and Ray can read structured data from databases, but Lakehouses are more efficient.
Databases aren't as good for storing unstructured data.
Databases can also be much more expensive than a Data Lakehouse.
Databases are awesome and have lots of amazing use cases of course. Like you mentioned, data lakehouses are great for high data volume and throughput, but there are other use cases as well IMO.
(Replying to PARENT post)
(Replying to PARENT post)
Ballista is much less mature than Spark and needs a lot of work. It's awesome they're making Spark faster with Comet.
(Replying to PARENT post)
Medellin is surrounded by mountains and the contaminated air cannot escape. There didn't used to be a lot of cars, but now there is financing so the number of cars is growing significantly.
The hills are steep and old busses spew black smoke.
Here is some more info on pollution in Medellin: https://medellinguru.com/medellin-pollution/
Saying Medellin's temp decreased by 2 degrees Celsius based on "Mejorar el microclima hasta 2ยฐC" is a misinterpretation. I think this article is quite misleading.
(Replying to PARENT post)
SedonaDB currently supports SQL, Python, R, and Rust APIs. We can support APIs for other languages in the future. That's another nice part about Rust. There are lots of libraries to expose other language bindings to Rust projects.