You are here

tv.wp.pl Logos for program info.

25 posts / 0 new
Last post
Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month
tv.wp.pl Logos for program info.

Do you know what a scraper looks like, which will draw a picture for the EPG description of a given TV program?

I'm not talking about the TV channel logo.

I found the source where the logo of the show is saved, but I can't use it with the code:

In web page:
Pytanie na śniadanie

Webgrab:
showicon.scrub {single|"thumbnailUrl": "||"|"}

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 16 hours

you almost there....Example: showicon.scrub {single|src="||"|"}

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

Sorry, I can't write a post without it turning into a script.

Can you provide a link to the documentation?

I can't deal with it. I try to do it based onet.ini. I was able to examine the source of the page and I know what class the photo is taken from:

Quote:

Kobra - oddział specjalny

The code should look like this?

showicon.scrub {single||

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 16 hours

Documentation is here: http://www.webgrabplus.com/sites/default/files/download/documentation/Ma...

It is also a little complex to take elements in a site.ini, as they can be present in index, detail or description or other elemnts,can change in size, etc. etc. basically there is no "easy answer" withouth reviewing the site.ini.
I did a complete version (.E) now in siteini pack.

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

I don't know, I give up. I can pay for someone to help me. Maybe there is someone who created wp.pl.ini and would like to help?

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 16 hours

You don't need to pay, if you want to learn just read documentation or if you want to use there are 2 versions of wp.pl in siteini pack or you can download directly here :https://github.com/SilentButeo2/webgrabplus-siteinipack/tree/master/site...

PS. the E version is free for use, just run as any other siteini (needs wg++ 2.1.9 or greater)

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

You are the best!

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

new day new problem.

Unfortunately, the wp.pl.E file gets the wrong time.

example:
wp.pl.ini
programme start="20191214230000 +0100"

Miasteczko South Park 23
South Park
SHOTS!!!
Dalsze losy grupy przyjaciół, młodych mieszkańców górskiego miasteczka w stanie Kolorado. Kyle, Cartman, Stan i Kenny przeżyją kolejne niezapomniane przygody.(n)
2019
serial animowany dla dorosłych
S23E3

wp.pl.E.ini:
programme start="20191214230000 +0100"

Rick i Morty 2
Rick and Morty 2
The Ricks Must Be Crazy.
Dalsze przygody ekscentrycznego, uzależnionego od alkoholu naukowca Ricka Sancheza i jego wnuka Morty'ego, którzy podróżują po odległych galaktykach oraz alternatywnych rzeczywistościach.(n)
2015
Serial Animowany Dla Dorosłych

USA
S2 E6

16:9

in wp.pl.ini is fine for me.

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 16 hours

Update siteini.pack

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

wp.pl is good:
programme start="20191215223000 +0100" stop="20191215230000 +0100" channel="Comedy Central Family"
title lang="pl">Przyjaciele 4
Przyjaciele 4
Friends
The One With Rachel's Crush
Serial opowiada o grupie przyjaciół, typowych młodych mieszkańcach Manhattanu, którzy z powodzeniem dzielą czas między miłosne podboje i robienie kariery. Mieszkają właściwie razem i starają się nie rozstawać na dłużej. Nie znoszą samotności i nie wiedzą co to nuda. Tę uroczą gromadkę tworzą: Monica (Courteney Cox), Rachel (Jennifer Aniston), Ross (David Schwimmer), Chandler (Matthew Perry), Joey (Matthew LeBlanc) i Phoebe (Lisa Kudrow). Wszyscy mają ponad dwadzieścia lat, są atrakcyjni i dowcipni, a ich myśli niezmiennie krążą wokół dwóch tematów - kariery i seksu. Młodzi ludzie są przekonani, że życie oferuje im nieograniczone możliwości. W rzeczywistości marzą o prawdziwej miłości i zabiegają o zdobycie upragnionego partnera, lecz najważniejsza jest dla nich przyjaźń
W scenariuszu nowej sztuki, w której gra Kathy, przewidzianych jest kilka namiętnych scen z jej partnerem scenicznym. Joey próbuje uspokoić Chandlera. Z perspektywy swojego aktorskiego doświadczenia tłumaczy przyjacielowi, że kiedy wydaje się, że na scenie czuć "chemię" między aktorami, w rzeczywistości wcale miedzy nimi nie iskrzy. Kiedy Chandler idzie po raz kolejny na sztukę, zauważa, że gra aktorska nie jest dobra i brak w niej emocji. Wówczas dochodzi do wniosku, że jest to spowodowane romansem aktorów w życiu prywatnym. Chandler zarzuca Kathy, że go zdradziła. Rozstają się w niemiłej atmosferze. Wkrótce Chandler postanawia przeprosić Kathy, ale jest już za późno. Dział, w którym pracuje Rachel, zostaje rozwiązany, a ona przeniesiona do innego departamentu. Dziewczyna jest niezadowolona ze zmiany i planuje odejść. Zmienia zdanie, kiedy poznaje przystojnego klienta. Rachel chciałaby się z nim umówić na randkę, ale nie ma pojęcia, jak to zrobić, bo dotąd to zawsze ona była zapraszana.(n)

Dana De Vally Piazza
Rachel Green - Jennifer Aniston
Monica Geller - Courteney Cox
Joey Tribbiani - Matt LeBlanc
Chandler Bing - Matthew Perry
Ross Geller - David Schwimmer
Phoebe Buffay - Lisa Kudrow

1997
serial komediowy
S4E13

wp.pl.E is not:
programme start="20191215223000 +0100" stop="20191215230000 +0100" channel="Comedy Central Family HD"
title lang="pl">Rick i Morty 2
Rick i Morty 2
Rick and Morty 2
Big Trouble in Little Sanchez.
Dalsze przygody ekscentrycznego, uzależnionego od alkoholu naukowca Ricka Sancheza i jego wnuka Morty'ego, którzy podróżują po odległych galaktykach oraz alternatywnych rzeczywistościach.(n)
2015
Serial Animowany Dla Dorosłych

USA
S2 E7

16:9

should correctly be:
programme start="20191215233000 +0100" stop="20191215230000 +0100" channel="Comedy Central Family HD"
title lang="pl">Rick i Morty 2

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 16 hours

My fault, did not change revision, use rev. 10
programme start="20191215223000 +0100" stop="20191215230000 +0100" channel="Comedy Central Family HD"
title lang="pl">Przyjaciele 4 title
title lang="en">Friends

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

The time seems fine. The problem is now:

( 21/194 ) TV.WP.PL.E -- chan. (xmltv_id=CANAL+ 1 HD) -- mode Force
innnnnnnnnnnnnnnnnnnn
Unable to update channel CANAL+ 1 HD
Generic syntax exception:
message:
Current culture: en-GB
time parsing error : String was not recognized as a valid DateTime.
nextstartdatetime time scrubbed :
computer date/time format: 16/12/2019 11:24:44
Existing guide data restored!

I think the beginning of the code should look like tv.wp.pl.ini:
site {url=tv.wp.pl|timezone=Europe/Warsaw|maxdays=7.1|cultureinfo=pl-PL|charset=utf-8|titlematchfactor=60|allowlastdayoverflow}
url_index{url()|https://tv.wp.pl/api/v1/program/|urldate|/|channel|?days=7}
urldate.format {datestring|yyyy-MM-dd}
index_showsplit.scrub {multi()|{"id":|||}

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 16 hours

New revision ..

Floyd
Offline
Donator
Joined: 5 years
Last seen: 1 month

Everything seems fine.

Plum12
Offline
Plum12's picture
Has donated long time ago
Joined: 6 years
Last seen: 2 years

how can i add "thematic_categories" for tv.wp.pl.ini

view-source:https://tv.wp.pl/kanal/tvp-2-hd

something like index_category.scrub {single|"category":"||",|}

I also have a question about rating,
during work gets 7,8,10,
and so on, I did
rating.modify {replace (null) | 16 | 16+} and does it from 4 to 14 shows 4+ to 14+
but since 16 it shows 16 ++ I fixed it rating.modify {replace
but why is this happening?

and how to set capital letters in starting words

eg.. film to Film / film przygodowy to Film Przygodowy

and how to remove this

title lang="pl" \"Gwiazdka na plebanii /title \"
title modify remove only this " but no this \

thanks!

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 2 hours

I assume the 16 rating already has the +(ex 16+)
u can fix this by using
loop{('rating' not "" max=1)|1}
rating.modify {addend(not~ "+")|+}

to fix category..
category.modify {cleanup(style=name)}

for the title after the last index_showsplit line add..
index_showsplit.modify {cleanup(style=jsondecode)}

the will get rid of the \(escape) before the " but the title itself will still have the quotes.

Plum12
Offline
Plum12's picture
Has donated long time ago
Joined: 6 years
Last seen: 2 years

thanks bro!

to fix category..
category.modify {cleanup(style=name)} not help but index_category.modify {cleanup(style=name)} work perfect

I assume the 16 rating already has the +(ex 16+) - I checked before writing the question but maybe not exactly

your entries have fixed it!

=====================================================

index_showsplit.modify {cleanup(style=jsondecode)} no idea

one more question how to add

thematic_categories

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 2 hours

thematic_categories
no idea what u mean by this?

Plum12
Offline
Plum12's picture
Has donated long time ago
Joined: 6 years
Last seen: 2 years

view-source:https://tv.wp.pl/kanal/polsat

thematic_categories":["fantasy/SF","thriller"],

i want replace or add one more line to pl.wp.ini file

index_category.scrub {single|"genre":"||",|}

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 2 hours

change
index_category.scrub {single|"genre":"||",|}
to
index_category.scrub {regex||(?:"genre":"\|"thematic_categories":\[")(.*?)(?:",\|"\])||}
index_category.modify {replace|","|\|}

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 2 hours

oops don't do that as I see a issue.
leave ur original category scrub as original and just add another on the next line
index_category.scrub {single(separator="","")|"thematic_categories":["||"]|"]}

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 2 hours

btw this..
index_category.scrub {single|"genre":"||",|}
shud be
index_category.scrub {single|"genre":"||",|",}
it will make grabbing slightly faster...

Plum12
Offline
Plum12's picture
Has donated long time ago
Joined: 6 years
Last seen: 2 years

wow
I would never do that

I think there is nothing more to add
http://www.wklejto.pl/799659

thanks again

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 2 hours

I also make a typo in the scrub line in post 21 so copy it again

Plum12
Offline
Plum12's picture
Has donated long time ago
Joined: 6 years
Last seen: 2 years

ok

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl