PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module by geekgirl

View this thread on: hive.blog | peakd.com | ecency.com

hive-148441 · @geekgirl · Nov 16 '21

$131.55

PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module

<center>![pytonpdf.jpg](https://images.hive.blog/DQmVGgeToFjpmq85FnwsDNQmn644EysojtZh92b1ZrZrrUs/pytonpdf.jpg)</center>

I love Python! It is easy to learn. It is fun to use. But most importantly it saves time. Time is the most precious asset we all have. Often we spend out time doing repetitive work over and over again. Computers are really good at doing repetitive work and they do it in more efficient manner. To tell computers do things we need a way to communicate with them. This is where programming languages come in. Learning any programming language is a big task. Python makes learning how to code easy and accessible to anybody with a little effort. With right right mindset anybody can learn basics of python to use it for daily repetitive tasks and ultimately save time.

Python has a big community and many many libraries available to tackle various tasks. One of the python module's I have been using lately is `pdfplumber`. As the name suggests this module works with pdf files and helps with extracting relevant data. 

PDF is a one of the widely used documents formats. If your business, work, and school activities involve any documents, chances are you are familiar with pdf files. What if your daily activities involve reading through large amounts of pdf documents with many many pages? Over time we can get more efficient and effective with how we process these documents manually. But we still have physical limitations and do end up spending countless hours on such repetitive tasks.

Using `pdfplumber` we can tell the computer to do the repetitive parts of the task, identifying what is needed, extracting relevant data, and maybe even use this data to further analysis or storing for future use and comparison. This is not the only module that helps with extracting data from pdf files. There are many more solutions out there. I found this one to be the easiest to understand and use. And it just works. If you know of any better solutions, feel free to let me know in the comments. 

`pdfplumber` has a great documentation and has examples to demonstrate how it works. Please visit [pdfplumber GitHub page](https://github.com/jsvine/pdfplumber) for the details. 

The most important feature I have been using is extracting text from pdf files. This can be accomplished as following:

```
import pdfplumber

with pdfplumber.open("path/to/file.pdf") as pdf:
    pages = pdf.pages
    first_page = pages[0]

    print(first_page.page_number)
    print(first_page.width)
    print(first_page.height)
    print(len(first_page.chars))  

```
<br>
`pdf.pages` in the code above returns the list of all pages. This will be a list of page objects. Using properties like '.page_number', '.width', '.height' we can get these self-explanatory values. '.chars' returns a list of all characters used in the page. It has many useful properties as well. This can be used for more complex data extraction. I will share more about '.chars' a bit later.

What makes `pdfplumber` awesome and super easy to use is its line by line text extraction. Take a look at the following code.

```
import pdfplumber

with pdfplumber.open("path/to/file.pdf") as pdf:
    pages = pdf.pages
    for page in pages:
        text = page.extract_text().split('\n')
        print(len(text))
```

This codes read the pdf file, stores pages in a pages variable. Then we iterate through pages and extract text for each page. We split the extracted text and get a list of text for each line of text. If we know what documents we are working with we can identify certain text patterns to keep the text we need and throw away not needed ones.

Since the text lines are already in order as they appear in the document, this helps us in building a more useful code based on what text appears after certain text patterns. This line by line text extraction function of `pdfplumber` while may seem very simple, is very powerful and saves me a lot of time.

If you want to build more complex algorithms in extracting data you need, `.chars` property of the page can be very helpful. It takes a character at a time and provides a lot of information about the character like the value, font, size, x and y locations on the page, etc. To see the full list of  `.char` visit the GitHub link above and/or experiment in your code.

This module can also extract various other objects in a pdf file like lines, rectangles, curves, annotation, and images. They all have similar properties like the char object. Moreover, `pdfplumbler` can also help with table extraction and has visual debugging feature.

If you work with pdf files a lot and use python, give this module a try. I hope it can help you automate some tasks and save time as well. If you already use it, let me know about your experience with the module in the comments.

👍 appreciator, trafalgar, v4vapid, buildawhale, themarkymark, jedigeiss, solominer, sanjeevm, spectrumecons, traf, slobberchops, steemik, isaria, abh12345, x30, borran, captainhive, hanshotfirst, utopis, vancouverdining, ace108, lynds, daveks, bala41288, kevinwong, whangster79, indiaunited, galenkp, chorock, selfhelp4trolls, roleerob, cardtrader, raindrop, dcityrewards, josediccus, bobinson, eroche, adamada, nnaraoh, harkar, leaky20, celi130, chops316, joseph1956, tomiscurious, tombstone, brofi, bil.prag, kantos, straykat, someguy123, fw206, gabrielatravels, silverquest, emrebeyler, apeminingclub, oldman28, the-bitcoin-dood, zoidsoft, pexpresiones, wulff-media, thehive, forykw, manniman, and 351 others

`author`	geekgirl
`permlink`	pdfplumber-extract-data-you-need-with-this-super-easy-to-use-python-module
`category`	hive-148441
`json_metadata`	{"tags":["hive-148441","python","pdf","programming","data","proofofbrain","stem","neoxian"],"image":["https://images.hive.blog/DQmVGgeToFjpmq85FnwsDNQmn644EysojtZh92b1ZrZrrUs/pytonpdf.jpg"],"links":["https://github.com/jsvine/pdfplumber"],"app":"hiveblog/0.1","format":"markdown"}
`created`	2021-11-16 02:26:48
`last_update`	2021-11-16 02:26:48
`depth`	0
`children`	12
`last_payout`	2021-11-23 02:26:48
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	65.873 HBD
`curator_payout_value`	65.679 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	4,807
`author_reputation`	1,586,488,611,824,452
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,776,675
`net_rshares`	89,541,815,638,693
`author_curate_reward`	""

properties (23)vote details (415)

voter	rshares	pct
tombstone	136,764,105,826	6.3%
tuck-fheman	2,516,333,244	100%
kevinwong	398,468,967,643	35%
mangou007	257,412,857	9.17%
gerber	29,432,915,718	7.4%
daan	51,626,961,905	8%
ezzy	401,871,789	7.4%
mrwang	559,299,431	17.5%
exyle	66,958,733,795	7.4%
arconite	817,027,961	17.5%
ace108	435,281,327,899	11%
hanshotfirst	542,395,963,679	10%
borran	728,954,545,124	76%
bert0	834,893,469	9.17%
the-bitcoin-dood	90,337,596,785	50%
someguy123	116,262,758,177	3.7%
elguille	23,717,227,834	80%
ats-david	2,345,124,398	50%
jlufer	14,917,098,550	100%
daveks	412,790,048,094	10%
abh12345	780,644,025,512	25%
clayboyn	21,969,670,119	25%
planosdeunacasa	690,039,978	7.4%
techslut	28,850,922,002	5%
slider2990	7,849,796,858	100%
supergoodliving	28,456,618,643	50%
eroche	177,100,152,879	100%
bigtakosensei	42,679,255,217	46%
v4vapid	8,698,360,089,964	55%
delso	19,076,818,848	100%
mornevd	2,825,826,558	100%
playfulfoodie	58,054,394,618	100%
yuslindwi	544,089,288	20%
trafalgar	17,118,832,777,157	43%
ganjafarmer	482,625,289	0.37%
dickturpin	7,280,021,068	17.5%
raindrop	274,392,657,532	43%
bloggingforbeans	4,461,734,909	50%
dune69	45,057,489,030	6.29%
iansart	16,085,125,168	7.4%
forykw	74,852,588,805	3.7%
kaylee.nicole	3,719,059,158	50%
isaria	912,186,724,419	50%
galenkp	359,308,018,753	12%
sam99	45,965,054,598	16%
spectrumecons	1,549,744,869,196	30%
jayna	25,363,376,169	5%
techken	2,565,669,057	3.7%
joeyarnoldvn	566,002,292	1.68%
kris10	21,133,515,055	50%
hardikv	3,586,646,043	20%
coffeelovers	5,034,059,003	100%
bluemist	15,678,707,023	8%
goldkey	46,945,307,928	5%
appreciator	23,916,536,290,714	8%
themarkymark	3,493,202,629,273	10%
sanjeevm	1,744,501,626,934	100%
leaky20	153,661,442,542	100%
isabelpena	23,469,031,511	100%
dineroconopcion	700,230,545	7.4%
steemik	945,077,951,709	100%
zonabitcoin	662,748,341	7.4%
oldman28	91,581,503,211	60%
jeanlucsr	871,080,640	0.74%
duekie	5,832,495	100%
chops316	146,421,486,618	100%
felander	35,310,094,886	7.4%
santigs	5,790,879,457	68%
karja	1,853,503,424	5%
parejan	3,905,071,731	100%
jedigeiss	2,227,409,181,251	100%
skura88	0	100%
nurhayati	1,624,271,042	35%
buildawhale	7,210,296,037,873	10%
yogacoach	7,686,708,457	7.4%
tomiscurious	139,981,876,821	32.7%
karlin	63,284,329,140	50%
roleerob	317,704,852,777	75%
fatman	8,611,940,782	2%
oizaguirres	6,379,082,683	50%
makerhacks	32,523,739,503	10%
sayee	23,353,294,553	20%
vishalsingh4997	1,273,026,554	33%
vegoutt-travel	22,001,914,885	30%
josediccus	248,814,068,943	25%
x30	762,399,195,789	10%
zoidsoft	88,877,884,022	100%
jatinhota	28,470,804,193	40%
noble-noah	9,616,792,786	100%
caladan	45,101,337,666	6.29%
fknmayhem	7,532,059,013	75%
joseph1956	144,985,898,483	40%
inuke	3,434,830,432	100%
mariu.espinoza	23,758,210,376	50%
emrebeyler	102,936,038,256	7.4%
bobinson	234,855,133,755	40%
sankysanket18	787,455,132	20%
phderoes	3,023,033,086	100%
javisem	4,302,101,633	50%
popurri	19,081,764,724	50%
jayoxaju	35,892,097,996	50%
vishire	1,148,286,140	40%
hendersonp	783,443,523	7.4%
mytechtrail	54,311,352,771	15%
silenteyes	1,380,246,985	20%
traf	1,487,878,013,844	43%
hojaraskita	694,480,931	50%
leninbracho50	20,657,724,661	50%
shonyishere	2,404,644,478	28.8%
syllem	26,640,024,865	50%
petronila	1,128,916,459	50%
straykat	116,397,547,895	100%
fourfourfun	813,970,292	2.75%
upmyvote	6,744,921,167	10%
frames	1,721,636,475	25%
gabrielatravels	111,746,973,168	43%
pexpresiones	86,903,815,669	50%
tryskele	1,276,558,517	5%
bala41288	403,481,452,165	50%
heidimarie	3,911,494,745	10%
jtk1	20,899,848,402	50%
chorock	331,000,893,977	45%
yolmare	25,308,888,730	50%
philnewton	930,139,722	12.5%
indiaunited	382,019,443,707	40%
nnaraoh	163,905,115,553	100%
nacarid	8,610,571,086	50%
tobias-g	4,855,979,600	18.75%
chronocrypto	0	100%
jazzhero	14,181,625,737	25%
unconditionalove	877,637,991	3.7%
jadams2k18	3,613,769,626	50%
movement19	2,639,263,397	12.5%
beeyou	23,525,650,373	100%
ladysalsa	733,543,310	7.4%
videoaddiction	19,818,268,017	100%
backinblackdevil	19,520,844,325	75%
reazuliqbal	39,495,498,191	7.4%
yablonsky	46,277,386,247	100%
aplausos	5,105,020,810	50%
m-zir	0	100%
silverd510	13,175,145,886	100%
bil.prag	122,703,646,414	20%
bestboom	2,124,211,534	7.4%
manniman	71,950,407,592	11%
karolines	3,112,179,227	50%
adamada	171,746,244,844	25%
goldvault	2,144,651,442	5%
raqibul	820,944,141	20%
jan23com	6,092,801,199	90%
paragism	379,105,653	12%
mh-mubarak	0	100%
axeltheartist	3,521,811,357	50%
juanmanuellopez1	1,637,476,676	7.4%
freddio	3,310,759,939	15%
roshansuares	9,624,846,401	100%
tonimontana	5,439,522,127	100%
inciter	4,636,567,567	8%
r-zah90	0	100%
m-zir81	0	100%
choco11oreo11	3,726,227,801	90%
tresor	11,583,719,099	9.17%
kgakakillerg	24,473,514,702	10%
rainbowbala	1,049,814,973	40%
maujmasti	577,689,586	100%
kendallron	580,186,225	25%
promobot	1,323,173,959	11%
solominer	1,977,083,435,368	50%
olivia08	37,984,542,449	100%
luces	1,727,479,697	50%
eleazarvo	2,284,887,362	15%
fw206	111,886,327,775	8.6%
steem.services	4,507,143,725	1.85%
slobberchops	1,354,720,353,564	35%
punkblogs	2,963,546,697	30%
yuma08	19,396,183,885	50%
kadoshmenorah	46,383,016,998	100%
kantos	122,431,095,538	50%
eliorrios	52,272,382,536	50%
linco	12,414,844,869	6.66%
swisswitness	4,429,271,142	7.4%
indiaunited-bot	1,451,434,232	40%
bingbabe	622,931,009	35%
alcidescadiz	34,297,309,218	50%
czera	619,249,504	100%
ambiguity	6,498,426,165	25%
eliassamuel	26,047,932,905	50%
bestofph	4,980,918,812	25%
thehive	77,492,934,775	70%
joserafaelhf	26,090,879,805	50%
luciannagy	772,425,387	3.6%
smartvote	44,479,556,381	2.1%
harkar	163,664,276,197	20%
littleshadow	3,000,068,356	90%
idakarlsen	33,753,278,924	10%
dlike	46,577,474,704	7.4%
junior182	3,770,143,263	50%
joseph6232	1,155,594,910	90%
emaillisahere	6,485,688,315	75%
buzzbee	1,097,275,260	50%
girlhunter	1,530,277,897	50%
silviathomson	16,493,413,866	50%
caoimhin	850,901,755	100%
bobby.madagascar	636,595,623	1.85%
djtrucker	913,835,424	75%
voter001	70,017,530,151	86.8%
marshalmugi	28,512,068,469	85%
podg3	1,456,032,942	90%
silverkey	584,044,529	5%
silvervault	2,131,846,339	5%
cryptycoon	1,041,081,965	5%
pradeep.sidd68	1,758,110,421	4%
misstaken	1,189,209,812	90%
racarjoal	1,375,772,957	25%
berthold	1,709,162,284	6%
merlin7	7,386,244,759	2.96%
thrasher666	2,118,487,114	60%
memeteca	1,220,044,244	9.17%
tsunsica	1,917,436,298	8%
marinmex	16,716,012,356	50%
yashoda	524,656,287	20%
followjohngalt	2,382,498,138	6.66%
celi130	150,748,127,989	50%
brosgn	2,424,029,682	50%
bro-poker	693,297,962	50%
jussbren	1,302,605,061	90%
cakemonster	3,651,271,725	3.7%
misterengagement	1,995,449,388	22.5%
khan.dayyanz	17,714,205,214	36%
bluerobo	67,448,073,951	100%
princessdani	36,860,309,802	50%
stratton.npc	105,209,464	100%
curationvoter	2,861,479,570	50%
limka	73,071,183	70.38%
leeyh2	20,103,372,229	100%
olaexcel	3,935,193,370	20%
mfblack	3,722,855,810	7.03%
samujaeger	1,332,623,483	100%
rodrook	8,296,270,860	100%
yiobri	22,698,796,426	50%
raspibot	1,364,316,953	100%
likwid	4,376,572,341	11%
wulff-media	81,904,186,275	50%
vancouverdining	473,446,610,247	16%
brosino	1,761,782,451	50%
steemindian	524,280,038	3.7%
sparstrumpf	775,831,750	100%
brosgn.fund	7,264,613,447	50%
suigener1s	517,470,648	100%
milu-the-dog	3,019,758,380	7.4%
yeswecan	6,618,678,452	90%
triplea.bot	2,060,444,218	7.4%
iraeli	1,142,051,111	50%
tiffin	2,078,097,292	5.92%
steem.leo	42,255,944,374	7.4%
bearjohn	1,489,599,399	75%
mktmaker	702,410,735	72.75%
babytarazkp	4,053,749,182	40%
abh12345.stem	3,329,837,038	100%
whangster79	398,454,671,416	50%
kanibot	12,319,691,172	50%
lynds	420,900,353,850	100%
acta	2,433,735,684	80%
the-table	18,155,444,294	90%
cardtrader	313,643,846,795	86%
brocfml	953,779,250	100%
thehouse	1,648,647,587	90%
emeka4	5,948,134,026	100%
zeesh	3,237,170,120	4%
silverquest	110,604,716,240	90%
mittwochsquickie	2,495,302,808	60%
yggdrasil.laguna	66,608,292	20%
eleez	14,461,723,007	100%
honeychip	2,079,399,023	88%
redwarbull	543,540,380	5%
ribary	2,558,585,450	3.7%
chapmain	137,874,851	100%
gloriaolar	13,220,310,578	7.5%
mice-k	701,291,198	7.4%
julesquirin	1,481,231,189	8.6%
davidlionfish	31,206,556,614	50%
football-stats	2,113,812,999	100%
dpend.active	1,592,543,492	1.48%
shinoxl	7,546,096,638	100%
blue-witness	4,632,112,238	100%
bnk	12,743,768,836	9.17%
benthomaswwd	5,834,646,269	100%
softworld	6,290,556,926	75%
alther	133,119,226	100%
captainhive	577,338,923,204	30%
nildasalazar	2,610,396,761	50%
dcityrewards	256,811,237,619	7.4%
hivelist	3,229,461,094	0.74%
josepgs	49,057,180,231	68%
benantca	21,803,071,062	50%
sidjay	573,893,703	12%
smh01	6,292,412,979	100%
hivecur	13,804,839,157	7.4%
memesforhive	5,832,403,044	100%
rdst	1,151,614,581	100%
saraidiaz	5,824,155,078	50%
myword	6,586,050,120	100%
sofiatovar	488,507,097	25%
yadiurbina	9,391,310,115	50%
malenavargas	30,017,292,012	50%
eddieferrer	29,389,711,910	50%
obandoduarte	31,131,796,070	50%
nataliabuonasera	25,344,985,877	50%
salvinopinos	24,314,222,283	50%
sevalo13	4,993,094,875	50%
recoveryinc	13,710,736,797	25%
josemoises	18,402,246,753	50%
alejabenitez	20,978,718,400	50%
lenincarrizo	29,821,600,342	50%
vicnzia	21,175,597,024	25%
liz.writes	758,356,501	37.5%
delvallepinate	16,734,489,252	50%
dying	705,491,603	2.5%
ilsanunez	3,706,355,189	50%
horaciogomez	22,678,521,821	50%
ordet	1,081,722,273	2.5%
discohedge	695,069,263	2%
elenaponce	5,534,803,019	50%
jonalyn2020	1,073,605,780	12.5%
emmatortolero	32,105,091,629	50%
hermaryrc	1,088,570,542	50%
pfwaus	798,030,772	100%
senseiphil	12,384,575,400	10%
alimaneiro	7,562,656,396	50%
scarletruiz	15,331,447,877	50%
noalys	622,194,524	4%
dorkpower	3,039,971,686	100%
almabelran	15,520,590,442	50%
agustocarrillo	22,277,664,632	50%
alejandroguerra	10,639,572,280	50%
andresmujica	45,433,953,836	50%
aniballara	1,411,959,144	50%
camilar	33,477,125,087	50%
carlosleon	20,940,755,158	50%
alejuu	21,324,515,471	50%
ynescontrer	17,962,127,654	50%
edgarlezama	24,329,934,822	50%
freddyma	17,189,332,926	50%
gregorioher	6,484,407,476	50%
indirafernandez	25,940,059,269	50%
dartodarmadi	1,891,239,012	100%
jesusin	1,450,097,523	50%
jhonluis	4,904,089,315	50%
marcosnin	15,163,700,921	50%
karinaparra	16,487,297,259	50%
soledadayala	2,646,690,984	50%
alvarezjhon1	28,668,109,778	50%
kattycrochet	1,771,777,411	4%
stemcur	1,031,647,865	100%
yogeshbhatt	851,763,585	40%
brofund-stem	717,695,275	20%
surrealfia	64,000,271,723	100%
psicologiaexpres	3,745,865,104	15%
he-index	4,132,851,782	15%
academiccuration	0	100%
godfather.ftw	1,246,722,918	36%
dcrops	38,218,317,769	3.7%
potpourry	527,747,550	15%
krishu.stem	676,181,810	100%
samrisso	14,520,879,120	25%
tfranzini	4,973,180,185	32%
gctoys1014	0	100%
alimustafa	9,962,724,491	50%
shyrybovich24	27,788,455,836	100%
kriszrokk	7,379,884,382	100%
haitch	1,406,280,534	100%
adamada.stem	775,030,549	100%
huzzah	2,988,767,102	12.5%
utopis	482,551,092,853	50%
finguru	21,600,827,454	80%
sutkyo	1,933,805,829	100%
selfhelp4trolls	324,437,386,102	36%
nyxlabs	629,280,739	6.25%
brofi	131,986,184,238	2%
crystalmoon	4,993,371,858	100%
juecoree.stem	663,219,830	100%
moviekeeda	685,419,909	20%
panmonagas	739,507,209	8%
holovision.stem	64,479,819	50%
flowermari	2,769,051,122	100%
kamaleshwar	5,906,986,323	50%
chandra.shekar	9,320,962,095	50%
kannannv	27,448,207,147	50%
yulilemus02	3,272,673,189	100%
solominer.stem	637,827,987	100%
apeminingclub	100,036,441,197	10%
xp16x10expoct21	2,001,555,671	36%
triloswc	566,116,082	88%
acpc24	1,549,498,312	50%
c-t-esc	2,540,137,505	60%
nosa4life	102,854,943	100%
wilmer963	0	100%
tyrnis.curation	511,415,061	50%
mintfinch	38,351,674	100%
waivio.curator	1,392,895,654	2.73%
brohbroh	0	100%
khantika	0	100%
meroom	0	100%
polkastarter	0	100%
fiveten	0	100%
maoyagi	11,302,467,776	100%
limpbizkit	0	100%
shibarmy	0	100%
habibh	526,944,755	100%
boyranking	733,978,879	100%
neilsen18	471,204,041	100%
huckleberrie	8,365,207,867	50%
donna8	1,804,095,445	100%
grntrees22	0	100%
bearbear.stem	183,338,282	100%

@ace108 · Nov 17 '21

$0.43

Cool. Looks like this ranks higher then PyPDF2. 
Thanks for the information.

👍 geekgirl, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	ace108
`permlink`	r2p1fa
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-17 01:57:12
`last_update`	2021-11-17 01:57:12
`depth`	1
`children`	1
`last_payout`	2021-11-24 01:57:12
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.215 HBD
`curator_payout_value`	0.212 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	76
`author_reputation`	1,221,584,858,014,761
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,798,761
`net_rshares`	273,373,861,404
`author_curate_reward`	""

properties (23)vote details (5)

voter	rshares	pct
geekgirl	252,842,895,548	15%
stemgeeks	20,076,704,026	20%
stemcuration	338,993,347	20%
yggdrasil.laguna	7,733,798	10%
saboin.stem	107,534,685	20%

@geekgirl · Nov 17 '21

I was going to try pypdf2 next. Haven't tried it yet.

👍 yggdrasil.laguna

`author`	geekgirl
`permlink`	r2p3qr
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-17 02:47:18
`last_update`	2021-11-17 02:47:18
`depth`	2
`children`	0
`last_payout`	2021-11-24 02:47:18
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 HBD
`curator_payout_value`	0.000 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	53
`author_reputation`	1,586,488,611,824,452
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,799,451
`net_rshares`	9,326,619
`author_curate_reward`	""

properties (23)vote details (1)

voter	weight	wgt%	rshares	pct	time
yggdrasil.laguna	0 B		9,326,619	10%

@anomadsoul · Nov 17 '21 (edited)

$0.66

Learning Python is on my list once I am done with react and nodejs, so I'm saving this post for later (right now I am pretty sure I am just going to be confused and lonely) and I'll be back in around one month (hopefully since I am putting in like 8-10 hours a day to learn to code) to see what's this all about :D

👍 geekgirl, v4vapid, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	anomadsoul
`permlink`	re-geekgirl-20211116t144629308z
`category`	hive-148441
`json_metadata`	{"tags":["hive-148441","python","pdf","programming","data","proofofbrain","stem","neoxian"],"app":"ecency/3.0.19-vision","format":"markdown+html"}
`created`	2021-11-16 20:46:30
`last_update`	2021-11-17 03:49:42
`depth`	1
`children`	2
`last_payout`	2021-11-23 20:46:30
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.329 HBD
`curator_payout_value`	0.326 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	314
`author_reputation`	1,681,171,138,068,684
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,793,875
`net_rshares`	433,892,389,812
`author_curate_reward`	""

properties (23)vote details (6)

voter	rshares	pct
v4vapid	159,075,622,950	1%
geekgirl	254,202,838,565	15%
stemgeeks	20,157,360,954	20%
stemcuration	340,552,220	20%
yggdrasil.laguna	7,849,064	10%
saboin.stem	108,166,059	20%

@geekgirl · Nov 17 '21

$0.13

I remember seeing that you were learning javascript. I always wanted to learn react too. That is awesome. When you get a chance you should look into threejs. Looking forward to seeing some cool apps from you.

👍 anomadsoul, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	geekgirl
`permlink`	r2p3ot
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-17 02:46:06
`last_update`	2021-11-17 02:46:06
`depth`	2
`children`	1
`last_payout`	2021-11-24 02:46:06
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.067 HBD
`curator_payout_value`	0.064 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	208
`author_reputation`	1,586,488,611,824,452
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,799,435
`net_rshares`	85,232,124,935
`author_curate_reward`	""

properties (23)vote details (5)

voter	rshares	pct
anomadsoul	64,373,605,503	100%
stemgeeks	20,394,791,183	20%
stemcuration	345,094,436	20%
yggdrasil.laguna	9,670,406	10%
saboin.stem	108,963,407	20%

@anomadsoul · Nov 17 '21

$0.30

I'm still there and damn, I'm loving every step of the way although I'm getting a little too obsessed with progress and some days I go on for too long without breaks, so I gotta pace myself. 
I will definitely check threejs (never heard of it). I hope that at some point of early 2022 I am able to start developing, if so, you are definitely on the list of hivers I'll tell before release :D

👍 geekgirl, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	anomadsoul
`permlink`	re-geekgirl-20211117t101333862z
`category`	hive-148441
`json_metadata`	{"tags":["ecency"],"app":"ecency/3.0.19-vision","format":"markdown+html"}
`created`	2021-11-17 16:13:33
`last_update`	2021-11-17 16:13:33
`depth`	3
`children`	0
`last_payout`	2021-11-24 16:13:33
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.150 HBD
`curator_payout_value`	0.147 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	392
`author_reputation`	1,681,171,138,068,684
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,811,985
`net_rshares`	189,058,841,017
`author_curate_reward`	""

properties (23)vote details (5)

voter	rshares	pct
geekgirl	168,284,254,954	10%
stemgeeks	20,313,184,561	20%
stemcuration	343,520,729	20%
yggdrasil.laguna	9,551,929	10%
saboin.stem	108,328,844	20%

@benthomaswwd · Nov 16 '21

$0.28

Sounds like a handy tool thanks for sharing very informative have the best day

👍 geekgirl, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	benthomaswwd
`permlink`	r2nz71
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-16 12:11:27
`last_update`	2021-11-16 12:11:27
`depth`	1
`children`	0
`last_payout`	2021-11-23 12:11:27
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.138 HBD
`curator_payout_value`	0.138 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	78
`author_reputation`	21,253,441,713,412
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,784,658
`net_rshares`	189,173,725,939
`author_curate_reward`	""

properties (23)vote details (5)

voter	rshares	pct
geekgirl	168,064,346,912	10%
stemgeeks	20,626,267,788	20%
stemcuration	349,655,501	20%
yggdrasil.laguna	8,092,212	10%
saboin.stem	125,363,526	20%

@emeka4 · Nov 16 '21

$0.24

Thanks updating on stuff like this it's really awesome. We live in a world were technology had gone viral with the essence of making work easier and faster for us to handle and it's also nice knowing about the python programming

👍 geekgirl, boyranking, yggdrasil.laguna

`author`	emeka4
`permlink`	r2nbor
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-16 03:43:45
`last_update`	2021-11-16 03:43:45
`depth`	1
`children`	0
`last_payout`	2021-11-23 03:43:45
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.122 HBD
`curator_payout_value`	0.122 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	228
`author_reputation`	234,154,110,917,475
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,777,694
`net_rshares`	168,455,244,806
`author_curate_reward`	""

properties (23)vote details (3)

voter	rshares	pct
geekgirl	167,730,881,441	10%
yggdrasil.laguna	5,938,845	10%
boyranking	718,424,520	100%

@gabrielatravels · Nov 17 '21

$0.29

I kept saying to myself that I should start learning coding and especially Python. Everything looks so easy when it's explained by someone else but when it comes your turn, things are different. 🙄

👍 geekgirl, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	gabrielatravels
`permlink`	r2pd74
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-17 06:11:33
`last_update`	2021-11-17 06:11:33
`depth`	1
`children`	1
`last_payout`	2021-11-24 06:11:33
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.146 HBD
`curator_payout_value`	0.144 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	196
`author_reputation`	974,812,091,618,976
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,802,213
`net_rshares`	189,546,209,914
`author_curate_reward`	""

properties (23)vote details (5)

voter	rshares	pct
geekgirl	168,603,595,663	10%
stemgeeks	20,476,552,308	20%
stemcuration	346,674,463	20%
yggdrasil.laguna	9,789,553	10%
saboin.stem	109,597,927	20%

@geekgirl · Nov 18 '21

You can do it.

properties (22)

`author`	geekgirl
`permlink`	r2r773
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1"}
`created`	2021-11-18 05:57:06
`last_update`	2021-11-18 05:57:06
`depth`	2
`children`	0
`last_payout`	2021-11-25 05:57:06
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 HBD
`curator_payout_value`	0.000 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	14
`author_reputation`	1,586,488,611,824,452
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,826,238
`net_rshares`	0

@hivebuzz · Nov 16 '21

Congratulations @geekgirl! You have completed the following achievement on the Hive blockchain and have been rewarded with new badge(s):

<table><tr><td><img src="https://images.hive.blog/60x70/http://hivebuzz.me/@geekgirl/posts.png?202111160334"></td><td>You published more than 550 posts.<br>Your next target is to reach 600 posts.</td></tr>
</table>

<sub>_You can view your badges on [your board](https://hivebuzz.me/@geekgirl) and compare yourself to others in the [Ranking](https://hivebuzz.me/ranking)_</sub>
<sub>_If you no longer want to receive notifications, reply to this comment with the word_ `STOP`</sub>


To support your work, I also upvoted your post!

properties (22)

`author`	hivebuzz
`permlink`	hivebuzz-notify-geekgirl-20211116t034837
`category`	hive-148441
`json_metadata`	{"image":["http://hivebuzz.me/notify.t6.png"]}
`created`	2021-11-16 03:48:36
`last_update`	2021-11-16 03:48:36
`depth`	1
`children`	0
`last_payout`	2021-11-23 03:48:36
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 HBD
`curator_payout_value`	0.000 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	670
`author_reputation`	369,247,454,404,928
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,777,764
`net_rshares`	0

@indiaunited · Nov 16 '21

Indiaunited Curation 1637031258247

This post has been manually curated by @bhattg from Indiaunited community. Join us on our [Discord Server](https://discord.gg/bGmS2tE). 

Do you know that you can earn a passive income by delegating to @indiaunited. We share 100 % of the curation rewards with the delegators. 

Here are some handy links for delegations: [100HP](https://hivesigner.com/sign/delegateVestingShares?delegator=&delegatee=indiaunited&vesting_shares=185629.0627800587%20VESTS), [250HP](https://hivesigner.com/sign/delegateVestingShares?delegator=&delegatee=indiaunited&vesting_shares=464072.6569501468%20VESTS), [500HP](https://hivesigner.com/sign/delegateVestingShares?delegator=&delegatee=indiaunited&vesting_shares=928145.3139002935%20VESTS), [1000HP](https://hivesigner.com/sign/delegateVestingShares?delegator=&delegatee=indiaunited&vesting_shares=1856290.627800587%20VESTS). 

Read our latest [announcement post](https://hive.blog/hive-186042/@indiaunited/indiaunited-2-0-active-again-with-a-lot-more-energy-this-time) to get more information. 

[![image.png](https://files.peakd.com/file/peakd-hive/bala41288/46eaz12N-image.png)](https://discord.gg/bGmS2tE) 

<sub>**Please contribute to the community by upvoting this comment and posts made by @indiaunited.**</sub>

properties (22)

`author`	indiaunited
`permlink`	indiaunited-1637031258247
`category`	hive-148441
`json_metadata`	{"app":"hiveblog/0.1","tags":["hive-148441","python","pdf","programming","data","proofofbrain","stem","neoxian"]}
`created`	2021-11-16 02:54:18
`last_update`	2021-11-16 02:54:18
`depth`	1
`children`	0
`last_payout`	2021-11-23 02:54:18
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 HBD
`curator_payout_value`	0.000 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	1,250
`author_reputation`	95,461,361,055,441
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,777,130
`net_rshares`	0

@videoaddiction · Nov 16 '21

$0.40

It has been always a mess to copy text from a PDF file to Word or Notepad. It seems it will be easy to do with Python.

👍 geekgirl, stemgeeks, stemcuration, saboin.stem, yggdrasil.laguna

`author`	videoaddiction
`permlink`	re-geekgirl-20211116t91817891z
`category`	hive-148441
`json_metadata`	{"tags":["hive-148441","python","pdf","programming","data","proofofbrain","stem","neoxian"],"app":"ecency/3.0.23-mobile","format":"markdown+html"}
`created`	2021-11-16 06:18:21
`last_update`	2021-11-16 06:18:21
`depth`	1
`children`	0
`last_payout`	2021-11-23 06:18:21
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.199 HBD
`curator_payout_value`	0.198 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	119
`author_reputation`	165,539,973,605,358
`root_title`	"PDFPLUMBER: Extract Data You Need With This Super Easy To Use Python Module"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	107,779,737
`net_rshares`	274,779,459,470
`author_curate_reward`	""

properties (23)vote details (5)

voter	rshares	pct
geekgirl	253,585,149,242	15%
stemgeeks	20,708,785,465	20%
stemcuration	351,250,498	20%
yggdrasil.laguna	8,207,764	10%
saboin.stem	126,066,501	20%