Commit graph

10 commits

Author SHA1 Message Date
MauricioGarciaS
5a5df5f218
fix(redshift-connector): Solved issue while inserting from s3 to redshift (events table) (#1543) 2023-10-24 14:06:26 +02:00
MauricioGarciaS
852ce7b324
fix(redshift-connector): Redshift connector update (#1538)
* Updated sql events table

* Updated and fixed redshift connector python
2023-10-23 17:21:05 +02:00
MauricioGarciaS
440ebca03b
fix(connectors): small fix of varchar size (#1403) 2023-07-11 15:07:42 +02:00
MauricioGarciaS
ecbc42a03f
fix(connectors): Updated sql to varchar 8000 (#1402)
* fix(connectors): Changed sql from varchar to varchar 8000

* fix(connectors): changed cropping to size 8000 for str in worker task
2023-07-11 13:59:46 +02:00
MauricioGarciaS
28182b951e
Redshift connector (#1170)
* Updated dependancies for redshift connector, changed os module for python-decouple module

* Updated service and images

* Updated message protocol, added exception for BatchMetadata when version is 0 (we apply old read method)

* fixed load error from s3 to redshift. null values for string columns are now empty strings ("")

* Added file test consumer_async.py: reads every 3 minutes kafka raw and send task in background to upload to cloud

* Added method to skip messages that are not inserted to cloud

* Added logs into consumer_async. Changed urls and issues in sessions table from list to string

* Split between messages for sessions table and for events table

* Updated redshift tables

* Fixed small issue in query redshift_sessions.sql

* Updated Dockerfiles. Cleaned logs of consumer_async. Updated/Fixed tables. Transformed Nan as NULL for VARCHAR columns

* Added error handler for sql dropped connection

* chore(docker): Optimize docker builds

Signed-off-by: rjshrjndrn <rjshrjndrn@gmail.com>

* Variables renamed

* Adding compression libraries

* Set default value of count events to 0 (instead of NULL) when event did not occur

* Added support specific  project tracking. Added PG handler to connect to sessions table

* Added method to update values in db connection for sessions ended and restarted

* Removing intelligent file copying

* chore(connector): Build file

Signed-off-by: rjshrjndrn <rjshrjndrn@gmail.com>

* Adding connection pool for pg

* Renaming and optimizing

* Fixed issue of missing information of sessions

---------

Signed-off-by: rjshrjndrn <rjshrjndrn@gmail.com>
Co-authored-by: rjshrjndrn <rjshrjndrn@gmail.com>
2023-05-02 14:02:57 +02:00
mauricio garcia suarez
7abe9f9680 Updated CH connector and event types 2022-07-20 10:54:53 +02:00
mauricio garcia suarez
de1977b0de Changed types of some messages and force type when pandas fails to recognize column type 2022-07-18 14:38:14 +02:00
mauricio garcia suarez
02cbeca771 deleted some prints 2022-07-15 17:15:56 +02:00
mauricio garcia suarez
b446147cb8 solved some minor issues 2022-07-15 17:12:22 +02:00
ourvakan
d3db0c0734 connectors 2021-05-18 17:10:08 +03:00