In 2019, I was working at Advanced Nutes, and managing the IT, Digital Marketing, and Data Science / Analytics teams, and I was trying to find an edge to basically breed ahead of the curve. This still maybe a dumb idea, but I think it holds merit in a scientific way.
I hired a smart young gal who had little background in Cannabis, but was a genius with data, and she clued me in that she heard that an Open Cannabis Project was closing it’s database and her python programming class, she used that database for a thesis project and if I wanted the data before it was gone, I should code a scraper and retrieve that data. Well, I did. Then I found several other databases from US labs, and did the same. After a few days, I had over 20k results all in different formats. I spent weeks cleaning up the data, and making it useful. I wrote new columns that helped me calculate important factors (ratios of THC:CBD, terp %, Cannabinoid %, etc). I then aligned all the data to one uniform datasheet on G-Drive.
I have intended for the last 5 years to turn this into a visualization platform, with AI and ML to help growers align cultivars that will breed with intentional outcomes (new medicines, resistance traits due to specific terps, terp + cannabinoid combos for cancers, etc). I am willing to open up this work outside of my own G-Drive to anyone else who wants to work along this route to breeding, as I feel it holds the most value for my time personally, to be able to heal others and myself.
Here’s a snippet of what I’m talking about:
I really want this to play a part in some projects, as I think the science speaks and is worth a shot to use this to nail a trait, while keeping gene pools more open than a tight inbred selfing project. Maybe by finding that Ocimene slayer in US and mash it to that Ocimene slayer in South Africa, it’ll produce heterosis on the genes that produce Ocimene where as inbreeding would dilute the gene possibly. IDK, looking for people with smarter ideas and more experience.
CURRENTLY THE “BIG LIST” HAS 12,880 real test results (after removing unusable data) from the last decade. The labs that the data is from are in Southern California, and Colorado + Open Cannabis Project original data. I am open to adding datasets if it’s good data, lmk.
Here’s how you can get involved…
- Want access to view the list and sort it as you wish for your own intentional breeding? No prob, lmk and I can give you viewing access.
- Work at a lab, know a lab manager, or have a dataset? Let’s chat so I can verify the data will work and it can be added successfully.
- Want to support the project? Tell anyone that you know who could benefit from the “Big List” and spread the word!
Thanks OG! Let’s give the tools to the people so we can all breed the medicines that work for us and our loved ones.
Panda