FLaNK-AIM Weekly for 06 May 2024

 

06-May-2024

https://www.youtube.com/@FLaNK-Stack




FLaNK / KNIFe AI / FLaNK-AIM Weekly

http://knifeai.org/

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

image

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

**This is Issue #136 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

New Releases

Articles

https://medium.com/@tspann/small-language-models-sml-for-the-win-ea0c6fee8061

https://medium.com/@tspann/maybe-four-smaller-open-llm-s-are-better-than-one-93f78fb69eb9

https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://medium.com/@tspann/searching-slack-from-apache-nifi-9ed562aa2397

https://medium.com/@tspann/events-streams-flows-and-maps-22a8d27cd9b4

https://medium.com/@tspann/storing-meetup-user-data-as-events-dad3b1dc89f5

https://medium.com/@tspann/real-time-in-boston-part-1-0f92d7da3496

NSA AI Security https://www.nsa.gov/Press-Room/Press-Releases-Statements/Press-Release-View/Article/3741371/nsa-publishes-guidance-for-strengthening-ai-system-security/

https://zilliz.com/learn/Sentence-Transformers-for-Long-Form-Text

https://zilliz.com/zilliz-cloud-pipelines

https://huggingface.co/BAAI/bge-large-en-v1.5

https://github.com/cloudevents/sdk-python/blob/main/samples/http-json-cloudevents/client.py

https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://docs.cloudera.com/machine-learning/cloud/release-notes/topics/ml-whats-new.html#ml_workspace_resource_tags

https://zilliz.com/blog/finding-right-fit-embedding-support-for-RAG-in-zilliz-cloud-pipelines-from-voyageai-openai-and-oss

https://hazelcast.com/glossary/streaming-data/

https://postgres.ai/blog/20220525-common-db-schema-change-mistakes

https://medium.com/cloudera-inc/consuming-rss-feeds-from-flink-sql-eaf33c1a5a23

https://medium.com/cloudera-inc/adding-generative-ai-results-to-sql-streams-513e1fd2a6af

https://www.linuxfoundation.org/press/lf-ai-data-foundation-launches-open-platform-for-enterprise-ai-opea

https://blog.mozilla.ai/local-llm-as-judge-evaluation-with-lm-buddy-prometheus-and-llamafile/

https://blog.mozilla.ai/open-source-in-the-age-of-llms/

https://www.pinecone.io/learn/structured-data/

https://medium.com/@stoty/a-bug-for-ages-fixing-time-zone-handling-in-apache-phoenix-e9934d7acd80

https://www.geeknarrator.com/blog/stream-processing/stream-processing-concepts

https://blog.cloudera.com/setting-up-and-getting-started-with-clouderas-new-sql-ai-assistant/

https://thenewstack.io/how-to-cure-llm-weaknesses-with-vector-databases/

https://dev.to/zilliz/exploring-bge-m3-and-splade-two-machine-learning-models-for-generating-sparse-embeddings-22p1

https://zilliz.com/learn/transforming-pdfs-into-insights-vectorizing-and-ingesting-with-zilliz-cloud-pipelines

https://zilliz.com/blog/how-to-evaluate-and-optimize-performance-of-milvus-storage

https://datavolo.io/2024/05/apache-nifi-designed-for-extension-at-scale/

Videos

Generative AI with Milvus https://www.youtube.com/watch?v=IfWIzKsoHnA

Four Models at Once https://youtu.be/xvNgsZyfo6A?si=zxwc9VcFc3o0vU3P

Search Slack https://www.youtube.com/watch?v=3ugppfb2kN8&t=5s&ab_channel=DatainMotion-HowToBeaStreamingEngineer

MBTA Transit Live with LLM https://www.youtube.com/watch?v=JGGY_uzQTdY&t=3s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D

Events, Streams, Maps with Irish Rail https://www.youtube.com/watch?v=14CSQRfUWoE&t=684s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D

FLaNK AI Channel https://www.youtube.com/@FLaNK-Stack

NiFi https://www.youtube.com/watch?v=m-ZoqHOYy_k

Slides

https://www.slideshare.net/slideshow/generative-ai-on-enterprise-cloud-with-nifi-and-milvus/267678399

https://www.slideshare.net/slideshow/conf42llmadding-generative-ai-to-realtime-streaming-pipelines/267269788

https://github.com/tspannhw/FLaNK-Milvus

https://medium.com/cloudera-inc/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://www.slideshare.net/slideshow/generative-ai-on-enterprise-cloud-with-nifi-and-milvus/267678399

https://www.youtube.com/watch?v=ssoM5S87BBs

Events

May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx

https://twitter.com/DBTADataSummit/status/1778393005646397636

May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.

May 30, 2024: Conf42: Machine learning https://www.conf42.com/Machine_Learning_2024_Tim_Spann_enriching_generative_events

June 12, 2024: Budapest Data + ML Forum. Virtual. image https://budapestml.hu/2024/en/speakers/

June 20, 2024: AI Camp Meetup. NYC.

Sept 24, 2024: JConf.Dev. Dallas. https://2024.jconf.dev/session/598816

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online. https://xtremepython.dev/2024/

tim_v2_1200_628python

Cloudera Events https://www.cloudera.com/about/events.html

https://www.cloudera.com/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY25-Q1-AMER-WS-Cloudera-Now-Events-Page-P06&cid=701Hr000000tW6qIAE&internal_link=p06

More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

Models

Tools

Cool Tool

Convert Spark SQL to Trino SQL https://github.com/linkedin/coral

Discount

Discount access to DataSummit 2024 https://secure.infotoday.com/RegForms/DataSummit/?Priority=24SPKR

© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack FLaNK-AIM with LLAMA 3

FLaNK AI Weekly for 29 April 2024

 

29-April-2024

Cool stuff happening in Trenton

https://www.meetup.com/trenton-makes-tech-_/




FLaNK / KNIFe AI Weekly

http://knifeai.org/

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

**This is Issue #135 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

New Releases

https://celeborn.apache.org/

Articles

https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://medium.com/@tspann/searching-slack-from-apache-nifi-9ed562aa2397

https://medium.com/@tspann/events-streams-flows-and-maps-22a8d27cd9b4

https://medium.com/@tspann/storing-meetup-user-data-as-events-dad3b1dc89f5

https://medium.com/@tspann/real-time-in-boston-part-1-0f92d7da3496

https://thenewstack.io/apache-nifi-2-0-0-building-python-processors/

https://medium.com/plain-simple-software/the-llm-app-stack-2024-eac28b9dc1e7

https://blog.cloudera.com/climate-and-sustainability-hackathon-meet-the-judges/

https://www.intel.com/content/www/us/en/developer/articles/technical/get-started-with-generative-ai.html

https://huggingface.co/blog/vlms

https://haystack.deepset.ai/blog/chatting-with-sql-databases-3-ways

https://www.denoise.digital/llama-3-get-started-with-llms/

https://www.pinecone.io/learn/chunking-strategies/

https://www.pinecone.io/blog/canopy-rag-framework/

https://www.pinecone.io/learn/series/rag/embedding-models-rundown/

Picking an Embedding Model https://www.pinecone.io/learn/series/rag/embedding-models-rundown/ https://huggingface.co/spaces/mteb/leaderboard

https://doordash.engineering/2024/04/23/building-doordashs-product-knowledge-graph-with-large-language-models/amp/

https://medium.com/airbnb-engineering/airbnb-brandometer-powering-brand-perception-measurement-on-social-media-data-with-ai-c83019408051

Retrieval Augmented Generation Assessment (RAGAS) Metrics-Driven Agent Development https://www.pinecone.io/learn/series/rag/ragas/

https://engineering.grab.com/data-observability

JSON Lines (JSONL) https://jsonlines.org/

https://www.timeplus.com/post/real-time-ai-oss-tools

https://www.jetson-ai-lab.com/research.html#meeting-schedule

https://www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product

https://zilliz.com/learn/pandas-dataframe-chunking-anf-vectorizing-with-milvus

Videos

Search Slack https://www.youtube.com/watch?v=3ugppfb2kN8&t=5s&ab_channel=DatainMotion-HowToBeaStreamingEngineer

MBTA Transit Live with LLM https://www.youtube.com/watch?v=JGGY_uzQTdY&t=3s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D

Events, Streams, Maps with Irish Rail https://www.youtube.com/watch?v=14CSQRfUWoE&t=684s&pp=ygUOVGltIFNwYW5uIE5pRmk%3D

Building Real-Time Pipelines XTremeJ https://www.youtube.com/watch?v=SszeF57IdW4

Adding Generative AI to Real-Time Streaming Pipelines | Tim Spann | Conf42 LLMs 2024 https://www.youtube.com/watch?v=Yeua8NlzQ3Y

MLConf NYC 2022 https://www.youtube.com/watch?v=Vw-jlU8STBk

Summer School Data Science Festival https://www.youtube.com/watch?v=0G98z_fs_SQ

ScyllaDB Summit 2023 https://www.youtube.com/watch?v=ZwhoosP1UWU

https://www.youtube.com/watch?v=-_52DIIOsCE&ab_channel=JamesBriggs

https://www.youtube.com/watch?v=XFZ-rQ8eeR8

Slides

https://www.slideshare.net/slideshow/april-2024-nlit-cloudera-realtime-llm-streaming-2024/267269851 https://www.slideshare.net/slideshow/realtime-ai-streaming-ai-max-princeton/267269816 https://www.slideshare.net/slideshow/conf42llmadding-generative-ai-to-realtime-streaming-pipelines/267269788

Events

May 1, 2024: Gen AI in the Enterprise Cloud. Virtual. https://www.linkedin.com/events/7180985346103410688/comments/ https://lu.ma/q7pcfyjn

May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx

https://twitter.com/DBTADataSummit/status/1778393005646397636

May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.

June 12, 2024: Budapest Data + ML Forum. Virtual. image https://budapestml.hu/2024/en/speakers/

June 20, 2024: AI Camp Meetup. NYC.

Sept 24, 2024: JConf.Dev. Dallas. https://2024.jconf.dev/session/598816

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online. https://xtremepython.dev/2024/

tim_v2_1200_628python

Cloudera Events https://www.cloudera.com/about/events.html

https://www.cloudera.com/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY25-Q1-AMER-WS-Cloudera-Now-Events-Page-P06&cid=701Hr000000tW6qIAE&internal_link=p06

More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

Models

Tools

Discount

Discount access to DataSummit 2024 https://secure.infotoday.com/RegForms/DataSummit/?Priority=24SPKR

© 2020-2024 Tim Spann