create account

Democratizing Data: Leveling The Playing Field by taskmaster4450

View this thread on: hive.blogpeakd.comecency.com
· @taskmaster4450 ·
$13.54
Democratizing Data: Leveling The Playing Field
Even the giants are having difficult getting the [data](https://inleo.io/@leoglossary/leoglossary-data) needed.


It does not require a great [investigator](https://inleo.io/@leoglossary/leoglossary-investigator) to come across headlines such as this:


 https://img.inleo.io/DQmdJiwBhghkd67rSGryDsYHa2BcjFZCicPRaB9f6DUcPnG/data1.png 

<center> [Source](https://www.theverge.com/2024/4/6/24122915/openai-youtube-transcripts-gpt-4-training-data-google)


 https://img.inleo.io/DQmQxjQn8X9kSeeQYrSw6BGeFefRWemr9mLodFM6EvK6Fkv/data3.png 

[Source](https://archive.ph/XmplZ#selection-635.0-655.467) </center>


People often say that data is the new [oil](https://inleo.io/@leoglossary/leoglossary-oil).  If this is the case, then we best wake up.  When the likes of [OpenAi](https://inleo.io/@leoglossary/leoglossary-openai), Anthropic, and others are on the prowl, this puts smaller entities at an extreme disadvantage.  It also hand a tremendous amount of power to these firms, ones that have not exactly shown themselves to be trustworthy in the past.


When it comes to [Web 3.0](https://inleo.io/@leoglossary/leoglossary-web-3-0), perhaps the greatest utility that it provides, at least for the moment, is the democratization of data.


## Democratized Data: The Equalizer


The reality is that without data, there is nothing.  It is where the [digital](https://inleo.io/@leoglossary/leoglossary-digital) [world](https://inleo.io/@leoglossary/leoglossary-world) is heading.


We are dealing with something much larger than simply seeing data [harvest](https://inleo.io/@leoglossary/leoglossary-harvest) to [sell](https://inleo.io/@leoglossary/leoglossary-sell) to advertisers.  Instead, we are watching the foundation of how the entire digital world will operate.


Once again, do we want this in the hands of Big Tech only.


To me, this is one of the most crucial moments we are facing.  The entire premise of Web 3.0 is based upon this.


Here is the problem:


When a major [technology](https://inleo.io/@leoglossary/leoglossary-technology) [company](https://inleo.io/@leoglossary/leoglossary-company-business) gets a hold of this data, it trains a model that is utilized by millions of people.  There is a natural feedback loop where the results of the engagement generate more data.  Guess who owns this data?


Another way to look at it is the model is generating synthetic data for the company.  It is closed, not available to the public.  This further shifts the playing field in favor of these entities.


Naturally, any data that is open these companies can feed in.  The difference is that so can anyone.  It is not only relegated to the major firms.


Of course, when you have a large enough legal team, there are other avenues.


<center>  https://img.inleo.io/DQmcmsBuDYAGbQtmd9wCeNtbEqoyK2ajS6Fex5PhFNmT46c/data2.png 
[Source](https://archive.ph/iOOmg)</center>


They are not above taking the data (if they can) and dealing with the consequences later.  The first headline depicting OpenAI training its model on 1 million hours of [YouTube](https://inleo.io/@leoglossary/leoglossary-youtube) [videos](https://inleo.io/@leoglossary/leoglossary-video) is telling.  It is against Google's terms of service but OpenAI doesn't care.


The challenge is if a smaller entity tried this, and was caught, Google's legal team would bury them.


It is why we need to generate as much open data as possible.


## Web 3.0 AI Solutions


Fortunately, there are some solutions that are popping up.  We see a few models that are worthy of testing.  The key is to alter the feedback loop.


Before getting to the Web 3.0 solutions, we can start by undertaking a simple strategy.  Each [time](https://inleo.io/@leoglossary/leoglossary-time) we prompt something on a closed model, simply take the result and [post](https://inleo.io/@leoglossary/leoglossary-post-hive) it in an open [database](https://inleo.io/@leoglossary/leoglossary-database).  There are a number of options out there, most of them tied to [blockchain](https://inleo.io/@leoglossary/blockchain).


We can also use some of the AI models that are being appearing. 


The two that I utilize are [Venice.ai](https://venice.ai/) and [Qtum.ai](https://qtum.ai/solstice).  Qtum has both a [chatbot](https://inleo.io/@leoglossary/leoglossary-chatbot) and an [image](https://inleo.io/@leoglossary/leoglossary-image) generator (the chatbot is linked).  


As an aside, when I did a comparison between Venice and Claude3 (the free version), Venice did hold up.  These were simply prompts, so it might not perform when asking about coding or hard mathematical problems.  Nevertheless, for simple prompts, it is decent.


Venice utilizes [open source](https://inleo.io/@leoglossary/leoglossary-open-source) models.  It has things such as Llama3.1 and Flux built in.


This model also is [privacy](https://inleo.io/@leoglossary/leoglossary-privacy) focused, placing the results locally.  There is no need to log in or create an [account](https://inleo.io/@leoglossary/leoglossary-account) to utilize it.

Here is a comparison from the Venice.ai website.


<center> https://img.inleo.io/DQmbfzb2q1ZvE4m7kBZwJvLTadXXHX5Kw1NUsuWxd1DyesJ/image.png </center>


Qtum is a blockchain based solution.  They are taking a different approach in that they are looking to implement a [NFT](https://inleo.io/@leoglossary/leoglossary-non-fungible-token-nft) system which can be created as something is generated by the individual.  This means that using the model (for text or image) will, at some point, provide ownership.


Of course, all this data generated can be opened up simply by posting it to a [permissionless](https://inleo.io/@leoglossary/leoglossary-permissionless-blockchain) database.  Blockchain is very valuable in this regard since most are designed in this fashion.


## Open Source Is Not Enough


It is great that Meta opened up Llama.  However, that is still not enough.  Zuckerberg is positioning his company to dominate the open source [space](https://inleo.io/@leoglossary/leoglossary-space).  His goal is to control the [ecosystem](https://inleo.io/@leoglossary/leoglossary-ecosystem-digital), with Meta at the center.


Even if we look at the data, using Llama3.1 still provides the feedback loop to Zuck.  He gets all responses generated off Meta.ai.  Anyone can get a hold of the [software](https://inleo.io/@leoglossary/leoglossary-software), along with the weights, but the data is off limits.  


Certainly, the fact that other entities are feeding that into their models does help.  This is why something like Venice, in spite of the limitations on prompts, is an option.


Ultimately, we have to ensure the feedback loop of data is in place.  This means more being posted to permissionless databases.  In my mind, this is essential to the essence of Web 3.0.


We are not going to have the next generation [Internet](https://inleo.io/@leoglossary/leoglossary-internet) if we are dependent upon closed systems that are housed on company controlled [servers](https://inleo.io/@leoglossary/leoglossary-server-computer).


If we dig in, much of what we are discussing is still based upon the client-server architecture.  That is the epitome of [Web 2.0](https://inleo.io/@leoglossary/leoglossary-web-2-0).


Data democratization is going to help meet the growing demand for data.  People have complained for [years](https://inleo.io/@leoglossary/leoglossary-year) how Big Tech "steals" it.  If that is the case, why do people keep feeding it?  Naturally, the ability to completely get away from these companies is hindered.  That said, we see people volunteering to support them wherever they can.


AI models are just another example.  How many are running to [ChatGPT](https://inleo.io/@leoglossary/leoglossary-chatgpt), Gemini, or Grok?   As the technology expands, habits will likely take over and these companies will have people locked in.


This has to change.  Firms are paying big [money](https://inleo.io/@leoglossary/leoglossary-money) for data, creating a new system that is, by default, going to be closed.  The solution is for everyone to realize how important the democratization of data truly is.


____


<center> [What Is Hive](https://inleo.io/@leoglossary/leoglossary-what-is-hive) </center>





Posted Using [InLeo Alpha](https://inleo.io/@taskmaster4450/democratizing-data-leveling-the-playing-field-dbu)
👍  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and 897 others
👎  
properties (23)
authortaskmaster4450
permlinkdemocratizing-data-leveling-the-playing-field-dbu
categoryhive-167922
json_metadata{"app":"leothreads/0.3","format":"markdown","tags":["hive-167922","ai","chatbots","data","democratization","bigtech","llms","mancave","neoxian","proofofbrain"],"canonical_url":"https://inleo.io/@taskmaster4450/democratizing-data-leveling-the-playing-field-dbu","links":["https://inleo.io/@leoglossary/leoglossary-data)","https://inleo.io/@leoglossary/leoglossary-investigator)","https://www.theverge.com/2024/4/6/24122915/openai-youtube-transcripts-gpt-4-training-data-google)","https://archive.ph/XmplZ#selection-635.0-655.467)","https://inleo.io/@leoglossary/leoglossary-oil).","https://inleo.io/@leoglossary/leoglossary-openai),","https://inleo.io/@leoglossary/leoglossary-web-3-0),","https://inleo.io/@leoglossary/leoglossary-digital)","https://inleo.io/@leoglossary/leoglossary-world)","https://inleo.io/@leoglossary/leoglossary-harvest)","https://inleo.io/@leoglossary/leoglossary-sell)","https://inleo.io/@leoglossary/leoglossary-technology)","https://inleo.io/@leoglossary/leoglossary-company-business)","https://archive.ph/iOOmg)</center>","https://inleo.io/@leoglossary/leoglossary-youtube)","https://inleo.io/@leoglossary/leoglossary-video)","https://inleo.io/@leoglossary/leoglossary-time)","https://inleo.io/@leoglossary/leoglossary-post-hive)","https://inleo.io/@leoglossary/leoglossary-database).","https://inleo.io/@leoglossary/blockchain).","https://venice.ai/)","https://qtum.ai/solstice).","https://inleo.io/@leoglossary/leoglossary-chatbot)","https://inleo.io/@leoglossary/leoglossary-image)","https://inleo.io/@leoglossary/leoglossary-open-source)","https://inleo.io/@leoglossary/leoglossary-privacy)","https://inleo.io/@leoglossary/leoglossary-account)","https://inleo.io/@leoglossary/leoglossary-non-fungible-token-nft)","https://inleo.io/@leoglossary/leoglossary-permissionless-blockchain)","https://inleo.io/@leoglossary/leoglossary-space).","https://inleo.io/@leoglossary/leoglossary-ecosystem-digital),","https://inleo.io/@leoglossary/leoglossary-software),","https://inleo.io/@leoglossary/leoglossary-internet)","https://inleo.io/@leoglossary/leoglossary-server-computer).","https://inleo.io/@leoglossary/leoglossary-web-2-0).","https://inleo.io/@leoglossary/leoglossary-year)","https://inleo.io/@leoglossary/leoglossary-chatgpt),","https://inleo.io/@leoglossary/leoglossary-money)","https://inleo.io/@leoglossary/leoglossary-what-is-hive)","https://inleo.io/@taskmaster4450/democratizing-data-leveling-the-playing-field-dbu)"],"images":["https://img.inleo.io/DQmdJiwBhghkd67rSGryDsYHa2BcjFZCicPRaB9f6DUcPnG/data1.png","https://img.inleo.io/DQmQxjQn8X9kSeeQYrSw6BGeFefRWemr9mLodFM6EvK6Fkv/data3.png","https://img.inleo.io/DQmcmsBuDYAGbQtmd9wCeNtbEqoyK2ajS6Fex5PhFNmT46c/data2.png","https://img.inleo.io/DQmbfzb2q1ZvE4m7kBZwJvLTadXXHX5Kw1NUsuWxd1DyesJ/image.png"],"isPoll":false,"dimensions":{}}
created2024-09-02 14:11:45
last_update2024-09-02 14:11:45
depth0
children11
last_payout2024-09-09 14:11:45
cashout_time1969-12-31 23:59:59
total_payout_value0.134 HBD
curator_payout_value13.408 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length8,395
author_reputation6,633,754,653,229,062
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries
0.
accounttaskmaster4450le
weight9,900
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,785,476
net_rshares109,435,225,519,028
author_curate_reward""
vote details (962)
@bbhbot ·
<center><p>@taskmaster4450! <b>@fiberfrau likes your content!</b> so I just sent 1 <b>BBH</b> to your account on behalf of @fiberfrau. <sub>(4/20)</sub></p>
<p><!--<img src="https://i.imgur.com/QwsegY0.png">--></p></center>
properties (22)
authorbbhbot
permlinkre-taskmaster4450-20240902t221234
categoryhive-167922
json_metadata""
created2024-09-02 22:12:33
last_update2024-09-02 22:12:33
depth1
children0
last_payout2024-09-09 22:12:33
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length223
author_reputation2,761,745,564,350
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,793,096
net_rshares0
@daniasi ·
@daniasi "**We are not going to have the next generation Int..."
**We are not going to have the next generation Internet if we are dependent upon closed systems**, unfortunately many don't care about this. They are ok wherever the breeze will blow them to. 
I only wish web3 had its personal infrastructure, this to me poses a limitation.
properties (22)
authordaniasi
permlinkre-taskmaster4450-2foqqceky
categoryhive-167922
json_metadata{"app":"leothreads/0.3","format":"markdown","tags":["leofinance"],"canonical_url":"https://inleo.io/threads/view/daniasi/re-taskmaster4450-2foqqceky","isPoll":false,"pollOptions":{},"dimensions":[]}
created2024-09-02 20:16:00
last_update2024-09-02 20:16:00
depth1
children0
last_payout2024-09-09 20:16:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length273
author_reputation70,972,505,353,528
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,791,407
net_rshares0
@fiberfrau ·
!BBH
properties (22)
authorfiberfrau
permlinkre-taskmaster4450-202492t18124850z
categoryhive-167922
json_metadata{"content_type":"general","type":"comment","tags":["hive-167922","ai","chatbots","data","democratization","bigtech","llms","mancave","neoxian","proofofbrain"],"app":"ecency/3.1.5-mobile","format":"markdown+html"}
created2024-09-02 22:12:06
last_update2024-09-02 22:12:06
depth1
children0
last_payout2024-09-09 22:12:06
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length4
author_reputation44,371,631,165,714
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,793,090
net_rshares0
@leophile ·
Data are big part of the systematic business that social media giants use, and they commercialize them with their tools that they have available to their hands. What I think with AI being included to the system, might create even graver revenue added to their company assets. 
Lot of other factors also work with it. 
properties (22)
authorleophile
permlinkre-taskmaster4450-202492t211948395z
categoryhive-167922
json_metadata{"tags":["hive-167922","ai","chatbots","data","democratization","bigtech","llms","mancave","neoxian","proofofbrain"],"app":"ecency/3.2.0-vision","format":"markdown+html"}
created2024-09-02 15:19:48
last_update2024-09-02 15:19:48
depth1
children0
last_payout2024-09-09 15:19:48
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length318
author_reputation7,327,482,239,351
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,786,741
net_rshares0
@newsmanganow ·
$0.02
@newsmanganow "Open source is awesome. but the current available ..."
Open source is awesome.
but the current available solutions are linked to Blockchain not on Blockchain (which is kinda impossible ig). 
👍  , , , , , , ,
properties (23)
authornewsmanganow
permlinkre-taskmaster4450-hn4t3qcm
categoryhive-167922
json_metadata{"app":"leothreads/0.3","format":"markdown","tags":["leofinance"],"canonical_url":"https://inleo.io/threads/view/newsmanganow/re-taskmaster4450-hn4t3qcm","isPoll":false,"pollOptions":{},"dimensions":[]}
created2024-09-02 15:24:39
last_update2024-09-02 15:24:39
depth1
children0
last_payout2024-09-09 15:24:39
cashout_time1969-12-31 23:59:59
total_payout_value0.012 HBD
curator_payout_value0.012 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length136
author_reputation44,632,058,327
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,786,818
net_rshares107,213,548,626
author_curate_reward""
vote details (8)
@outwars ·
$0.08
I think Google will also be going after OpenAI. I just think they are currently accumulating data and evidence to use in court. OpenAI is a big competitor, and I wouldn't expect Google to let them do as they please with their data.

I guess the adoption of democratized data by the masses can also hinge on their views on AI. Since a lot of people are currently against AI because it is taking jobs, I don't see many moving to open data any time soon.
👍  
properties (23)
authoroutwars
permlinkre-taskmaster4450-sj7upd
categoryhive-167922
json_metadata{"tags":["hive-167922"],"app":"peakd/2024.8.7"}
created2024-09-03 02:53:39
last_update2024-09-03 02:53:39
depth1
children0
last_payout2024-09-10 02:53:39
cashout_time1969-12-31 23:59:59
total_payout_value0.038 HBD
curator_payout_value0.038 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length451
author_reputation238,349,364,596,711
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,796,275
net_rshares310,074,173,969
author_curate_reward""
vote details (1)
@revolverocelotyt ·
>The solution is for everyone to realize how important the democratization of data truly is.

Democratization of data is our requirement. 
properties (22)
authorrevolverocelotyt
permlinkre-taskmaster4450-202492t192010400z
categoryhive-167922
json_metadata{"tags":["hive-167922","ai","chatbots","data","democratization","bigtech","llms","mancave","neoxian","proofofbrain"],"app":"ecency/3.2.0-vision","format":"markdown+html"}
created2024-09-02 14:20:12
last_update2024-09-02 14:20:12
depth1
children0
last_payout2024-09-09 14:20:12
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length138
author_reputation15,369,377,488,717
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,785,631
net_rshares0
@rzc24-nftbbg ·
I'm not sure if this is related to data democratization. Publishers and authors of books find it challenging to insist on copyright law when there are websites sharing ebooks almost for free. 
properties (22)
authorrzc24-nftbbg
permlinkre-taskmaster4450-sj8wy5
categoryhive-167922
json_metadata{"tags":["hive-167922"],"app":"peakd/2024.8.7"}
created2024-09-03 16:39:45
last_update2024-09-03 16:39:45
depth1
children0
last_payout2024-09-10 16:39:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length192
author_reputation129,334,971,460,155
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,807,054
net_rshares0
@shortsegments ·
@shortsegments "I hope to get one of those fabled prompt engineer ..."
I hope to get one of those fabled prompt engineer positions teaching AI
properties (22)
authorshortsegments
permlinkre-taskmaster4450-bjcdg118
categoryhive-167922
json_metadata{"app":"leothreads/0.3","format":"markdown","tags":["leofinance"],"canonical_url":"https://inleo.io/threads/view/shortsegments/re-taskmaster4450-bjcdg118","isPoll":false,"pollOptions":{},"dimensions":[]}
created2024-09-03 04:31:00
last_update2024-09-03 04:31:00
depth1
children0
last_payout2024-09-10 04:31:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length71
author_reputation667,547,193,649,674
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,797,263
net_rshares0
@uter ·
To achieve a fair and equitable technological system, it is essential to decentralize information. If this is achieved, it could homogenize opportunities and increase collaboration in innovation, allowing users to have greater control over their data.
properties (22)
authoruter
permlinkre-taskmaster4450-202492t133311592z
categoryhive-167922
json_metadata{"tags":["hive-167922","ai","chatbots","data","democratization","bigtech","llms","mancave","neoxian","proofofbrain"],"app":"ecency/3.2.0-vision","format":"markdown+html"}
created2024-09-02 17:33:06
last_update2024-09-02 17:33:06
depth1
children0
last_payout2024-09-09 17:33:06
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length251
author_reputation326,967,935,452
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,789,047
net_rshares0
@vimukthi ·
It has been almost a half a decade since I last heart about Qtum. It made me think back into 2017 when I became a part of cryptosphere. 
👎  ,
properties (23)
authorvimukthi
permlinkre-taskmaster4450-sj70jg
categoryhive-167922
json_metadata{"tags":["hive-167922"],"app":"peakd/2024.8.7"}
created2024-09-02 16:02:06
last_update2024-09-02 16:02:06
depth1
children0
last_payout2024-09-09 16:02:06
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length136
author_reputation491,643,536,842,063
root_title"Democratizing Data: Leveling The Playing Field"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id136,787,562
net_rshares-41,522,279,672
author_curate_reward""
vote details (2)