* feature(intelligent-search): Added API to connect to Llama.cpp in EC2 and filter the response into OR filters * updated sql to filter script and added init.sql for tables * feature(intelligent-search): Changed llama.cpp for llama in GPU now contained in API * Updated Dockerfile to use GPU and download LLM from S3 * Added link to facebook/research/llama * Updated Dockerfile * Updated requirements and Dockerfile base images * fixed minor issues: Not used variables, updated COPY and replace values * fix(intelligent-search): Fixed WHERE statement filter * feature(smart-charts): Added method to create charts using llama. style(intelligent-search): Changed names for attributes to match frontend format. fix(intelligent-search): Fixed vulnerability in requiments and small issues fix * Added some test before deploying the service * Added semaphore to handle concurrency --------- Co-authored-by: EC2 Default User <ec2-user@ip-10-0-2-226.eu-central-1.compute.internal>
19 lines
323 B
SQL
19 lines
323 B
SQL
CREATE TABLE IF NOT EXISTS mlruns.public.llm_data
|
|
(
|
|
user_id TEXT,
|
|
project_id BIGINT,
|
|
request TEXT,
|
|
response TEXT,
|
|
accuracy BOOL
|
|
);
|
|
|
|
CREATE TABLE IF NOT EXISTS mlruns.public.llm_metrics
|
|
(
|
|
load_time BIGINT,
|
|
sample_time BIGINT,
|
|
prompt_eval_time BIGINT,
|
|
eval_time BIGINT,
|
|
total_time BIGINT,
|
|
PARAMS jsonb
|
|
);
|
|
|