Bit's BLOG - Bit의 블로그 공간입니다.

[Python] 크롤링을 이용해 웹툰 제목과 회차정보 가져오기

2018년 12월 07일2018년 12월 07일 bit Leave a comment

마음의 소리 웹툰의 회차정보와 제목 가져오기

from bs4 import BeautifulSoup
import requests

def get_html(url) :
    _html = ""
    resp = requests.get(url)
    if resp.status_code == 200:
        _html = resp.text
    return _html
    
URL = "http://comic.naver.com/webtoon/list.nhn?titleId=20853&weekday=tue&page=1"
html = get_html(URL)
soup = BeautifulSoup(html, 'html.parser')

webtoon_list = list()
webtoon_area = soup.find("table", {"class": "viewList"}).find_all("td", {"class": "title"}) #파싱한 URL에서 table 태크에서 class 가 viewList 인 html 객체를 가져오고, 그중에서도 td 채크이면서 class 가 title 인 객체를 가져옴

for webtoon_index in webtoon_area:
		info_soup = webtoon_index.find("a") #a태크만 가져옴
		_url = info_soup["href"] #url 가져오기 위해 href속성의 내용만 가져옴
		_text = info_soup.text.split(".") #html코드에서 텍스트만 가져오고 . 구분자로 나눔 (회차와 제목)
		_title  = ""
		_num = _text[0]
		if len(_text) > 1:
			_title = _text[1].strip() #_title에 제목을 넣기 위해서
			
		webtoon_list.append((_num, _title, _url, ))

for print_ in webtoon_list:
    print(print_)

from bs4 import BeautifulSoup

import requests

def get_html(url) :

_html = ""

resp = requests.get(url)

if resp.status_code == 200:

_html = resp.text

return _html

URL = "http://comic.naver.com/webtoon/list.nhn?titleId=20853&weekday=tue&page=1"

html = get_html(URL)

soup = BeautifulSoup(html, 'html.parser')

webtoon_list = list()

webtoon_area = soup.find("table", {"class": "viewList"}).find_all("td", {"class": "title"}) #파싱한 URL에서 table 태크에서 class 가 viewList 인 html 객체를 가져오고, 그중에서도 td 채크이면서 class 가 title 인 객체를 가져옴

for webtoon_index in webtoon_area:

info_soup = webtoon_index.find("a") #a태크만 가져옴

_url = info_soup["href"] #url 가져오기 위해 href속성의 내용만 가져옴

_text = info_soup.text.split(".") #html코드에서 텍스트만 가져오고 . 구분자로 나눔 (회차와 제목)

_title = ""

_num = _text[0]

if len(_text) > 1:

_title = _text[1].strip() #_title에 제목을 넣기 위해서

webtoon_list.append((_num, _title, _url, ))

for print_ in webtoon_list:

print(print_)

검정 바탕 터미널에서 디렉토리 표시 색상을 가독성 좋게 설정하기

2018년 07월 23일2018년 07월 23일 bit Leave a comment

리눅스 설치 후 터미널 접속하여 명령을 실행 해 보면, 디렉토리가 파란색으로 표시된다. (배포판의 버전, ls 색상 설정에 따라 다를 수 있습니다.) 파란색 자체가 안좋은건 아니지만 대부분 터미널 배경색상이 검정색이다 보니 파란색 폰트 가독성이 그리 좋지 않습니다. 아래 그림을 참고해보면 확실히 검정 바탕에 파란색 폰트는 가독성이 좋지 못합니다. /etc/DIR_COLOR 파일을 편집기로 열어서 DIR항목을 찾아본다. 기본 설정값인 01;34를 […]

워드프레스에 나눔고딕 폰트 적용하기(WP Google Fonts 플러그인 이용)

2018년 07월 03일2018년 07월 05일 bit Leave a comment

워드프레스 테마는 주로 영문 위주로 만들어졌기 때문에 한글 폰트를 예쁘게 표현하려면 별도의 조치가 필요합니다. CSS를 직접 수정하여 사용 할 수도 있지만, 테마마다 CSS를 수정하고 관리하기 힘들기 때문에 플러그인을 이용하여 편리하게 폰트를 설정합니다. 1. WP Google Fonts 플러그인을 설치하여 나눔고딕 폰트 설정 Custom CSS 에는 아래 내용을 복사하여 넣습니다.

@import url(“http://fonts.googleapis.com/earlyaccess/nanumgothic.css” ) ;
body, h1, h2, h3, h4, h5, h6, li, p { font-family:”Nanum Gothic”,”NanumGothic” !important ; }

1 2	@import url(“http://fonts.googleapis.com/earlyaccess/nanumgothic.css” ) ; body, h1, h2, h3, h4, h5, h6, li, p { font-family:”Nanum Gothic”,”NanumGothic” !important ; }

2. 글쓰기 편집 화면에서 나눔고딕을 기본 […]

워드프레스 컨텐츠 디렉토리(wp-content)로 이동할 수 없습니다.

2018년 07월 03일2018년 07월 04일 bit Leave a comment

워드프레스에서 FTP를 이용하여 테마, 플러그인 등을 설치/업데이트 시 컨텐츠 디렉토리(wp-content)로 이동 할 수 없거나, 찾을 수 없다는 오류 문구가 나오는 경우 wp-config.php 위 파일을 열어 아래 코드를 찾는다.

require_once(ABSPATH . 'wp-settings.php');

1	require_once(ABSPATH . 'wp-settings.php');

위 코드 밑에 아래 코드를 삽입한다.

if(is_admin()) {
    add_filter('filesystem_method', create_function('$a', 'return "direct";' ));
    define( 'FS_CHMOD_DIR', 0751 );
}

if(is_admin()) {

add_filter('filesystem_method', create_function('$a', 'return "direct";' ));

define( 'FS_CHMOD_DIR', 0751 );

}

워드프레스에서 FTP 정보를 요구 할 경우

2018년 07월 03일2018년 07월 05일 bit

워드프레스 테마, 플러그인 설치 및 업그레이드 시 FTP 정보를 물어보는 팝업이 나올 때 아래와 같이 설정하면 된다. user, 암호, Host 등은 자신의 워드프레스가 설치된 서버의 정보를 입력하면 된다. wp-config.php

/** Synology FTP Setting */
define( 'FS_METHOD', 'ssh2' );
define( 'FTP_USER', 'username' );
define( 'FTP_PASS', 'password' );
define( 'FTP_HOST', 'blog.hostname.com:22' );

/** Synology FTP Setting */

define( 'FS_METHOD', 'ssh2' );

define( 'FTP_USER', 'username' );

define( 'FTP_PASS', 'password' );

define( 'FTP_HOST', 'blog.hostname.com:22' );

위와 같이 내용을 추가하고 파일을 저장 한 후 테마, 플러그인을 설치하면 FTP 정보 입력 팝업이 나오지 않게 된다.

Bit 블로그 시작합니다.

2018년 07월 03일2018년 07월 03일 bit Leave a comment

시작합니다.