Overview

Dataset statistics

Number of variables5
Number of observations78
Missing cells4
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory41.7 B

Variable types

Unsupported3
Categorical1
Text1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15464/S/1/datasetView.do

Alerts

서울책방 추천도서 목록(20.04.) has 1 (1.3%) missing valuesMissing
Unnamed: 2 has 1 (1.3%) missing valuesMissing
Unnamed: 3 has 1 (1.3%) missing valuesMissing
Unnamed: 4 has 1 (1.3%) missing valuesMissing
서울책방 추천도서 목록(20.04.) is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-30 01:01:24.852733
Analysis finished2024-04-30 01:01:26.692103
Duration1.84 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서울책방 추천도서 목록(20.04.)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size756.0 B

Unnamed: 1
Categorical

Distinct7
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size756.0 B
역사/사료
33 
일반행정
18 
문화/관광
14 
연구/논문
통계
 
2
Other values (2)
 
2

Length

Max length5
Median length5
Mean length4.6410256
Min length2

Unique

Unique2 ?
Unique (%)2.6%

Sample

1st row<NA>
2nd row분류
3rd row문화/관광
4th row문화/관광
5th row문화/관광

Common Values

ValueCountFrequency (%)
역사/사료 33
42.3%
일반행정 18
23.1%
문화/관광 14
17.9%
연구/논문 9
 
11.5%
통계 2
 
2.6%
<NA> 1
 
1.3%
분류 1
 
1.3%

Length

2024-04-30T10:01:26.793416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T10:01:26.930572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
역사/사료 33
42.3%
일반행정 18
23.1%
문화/관광 14
17.9%
연구/논문 9
 
11.5%
통계 2
 
2.6%
na 1
 
1.3%
분류 1
 
1.3%

Unnamed: 2
Text

MISSING 

Distinct77
Distinct (%)100.0%
Missing1
Missing (%)1.3%
Memory size756.0 B
2024-04-30T10:01:27.160991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length27
Mean length17.428571
Min length3

Characters and Unicode

Total characters1342
Distinct characters268
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)100.0%

Sample

1st row상품명
2nd row의금부 금오계첩
3rd row서울, 테마 산책길4
4th row책방산책 서울No.2
5th row책방산책-서울편
ValueCountFrequency (%)
서울 9
 
3.8%
서울2천년사 8
 
3.4%
2015 7
 
3.0%
서울의 7
 
3.0%
일제강점기 5
 
2.1%
2천년사 3
 
1.3%
서울통계연보 2
 
0.8%
그여자 2
 
0.8%
자료집 2
 
0.8%
한성부 2
 
0.8%
Other values (183) 190
80.2%
2024-04-30T10:01:27.515560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
 
11.9%
81
 
6.0%
63
 
4.7%
2 35
 
2.6%
35
 
2.6%
28
 
2.1%
) 25
 
1.9%
( 25
 
1.9%
23
 
1.7%
0 21
 
1.6%
Other values (258) 846
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 921
68.6%
Space Separator 160
 
11.9%
Decimal Number 142
 
10.6%
Other Punctuation 31
 
2.3%
Lowercase Letter 27
 
2.0%
Close Punctuation 25
 
1.9%
Open Punctuation 25
 
1.9%
Uppercase Letter 6
 
0.4%
Dash Punctuation 4
 
0.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
 
8.8%
63
 
6.8%
35
 
3.8%
28
 
3.0%
23
 
2.5%
18
 
2.0%
16
 
1.7%
14
 
1.5%
13
 
1.4%
13
 
1.4%
Other values (219) 617
67.0%
Lowercase Letter
ValueCountFrequency (%)
o 4
14.8%
n 3
11.1%
l 3
11.1%
e 3
11.1%
a 3
11.1%
y 2
7.4%
g 2
7.4%
t 2
7.4%
h 1
 
3.7%
i 1
 
3.7%
Other values (3) 3
11.1%
Decimal Number
ValueCountFrequency (%)
2 35
24.6%
0 21
14.8%
1 21
14.8%
5 12
 
8.5%
9 11
 
7.7%
7 10
 
7.0%
3 10
 
7.0%
6 9
 
6.3%
4 7
 
4.9%
8 6
 
4.2%
Other Punctuation
ValueCountFrequency (%)
: 18
58.1%
, 7
 
22.6%
. 3
 
9.7%
% 1
 
3.2%
& 1
 
3.2%
! 1
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
N 1
16.7%
W 1
16.7%
C 1
16.7%
H 1
16.7%
Space Separator
ValueCountFrequency (%)
160
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 908
67.7%
Common 388
28.9%
Latin 33
 
2.5%
Han 13
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
8.9%
63
 
6.9%
35
 
3.9%
28
 
3.1%
23
 
2.5%
18
 
2.0%
16
 
1.8%
14
 
1.5%
13
 
1.4%
13
 
1.4%
Other values (212) 604
66.5%
Common
ValueCountFrequency (%)
160
41.2%
2 35
 
9.0%
) 25
 
6.4%
( 25
 
6.4%
0 21
 
5.4%
1 21
 
5.4%
: 18
 
4.6%
5 12
 
3.1%
9 11
 
2.8%
7 10
 
2.6%
Other values (11) 50
 
12.9%
Latin
ValueCountFrequency (%)
o 4
12.1%
n 3
 
9.1%
l 3
 
9.1%
e 3
 
9.1%
a 3
 
9.1%
S 2
 
6.1%
y 2
 
6.1%
g 2
 
6.1%
t 2
 
6.1%
N 1
 
3.0%
Other values (8) 8
24.2%
Han
ValueCountFrequency (%)
2
15.4%
2
15.4%
2
15.4%
2
15.4%
2
15.4%
2
15.4%
1
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 908
67.7%
ASCII 421
31.4%
CJK 13
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
38.0%
2 35
 
8.3%
) 25
 
5.9%
( 25
 
5.9%
0 21
 
5.0%
1 21
 
5.0%
: 18
 
4.3%
5 12
 
2.9%
9 11
 
2.6%
7 10
 
2.4%
Other values (29) 83
19.7%
Hangul
ValueCountFrequency (%)
81
 
8.9%
63
 
6.9%
35
 
3.9%
28
 
3.1%
23
 
2.5%
18
 
2.0%
16
 
1.8%
14
 
1.5%
13
 
1.4%
13
 
1.4%
Other values (212) 604
66.5%
CJK
ValueCountFrequency (%)
2
15.4%
2
15.4%
2
15.4%
2
15.4%
2
15.4%
2
15.4%
1
7.7%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size756.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size756.0 B

Correlations

2024-04-30T10:01:27.598269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2
Unnamed: 11.0001.000
Unnamed: 21.0001.000

Missing values

2024-04-30T10:01:26.290790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T10:01:26.454467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T10:01:26.604421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서울책방 추천도서 목록(20.04.)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
0NaN<NA><NA>NaNNaN
1상품번호분류상품명판매가격등록일자
211258문화/관광의금부 금오계첩140002020-01-16 00:00:00
310135문화/관광서울, 테마 산책길430002019-01-23 00:00:00
49295문화/관광책방산책 서울No.280002018-03-22 00:00:00
58438문화/관광책방산책-서울편80002017-03-21 00:00:00
67812문화/관광사연있는 나무 이야기(개정판)50002016-08-14 00:00:00
77694문화/관광애들아, 숲으로 가자!50002016-07-08 00:00:00
87472문화/관광리-플레이: 4개의 플랫폼&17번의 이벤트(유휴공간 건축 프로젝트)190002016-04-28 00:00:00
97295문화/관광서울, 테마 산책길120002016-03-09 00:00:00
서울책방 추천도서 목록(20.04.)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
687373일반행정2015서울정원박람회백서50002016-04-06 00:00:00
697372일반행정2015 자치회관 우수사례집40002016-04-04 00:00:00
707313일반행정2015 갈등관리백서(상생의힘3)50002016-03-18 00:00:00
717113일반행정나는 서울시민이다70002016-01-13 00:00:00
727112일반행정서울의 도시 실험:서울도시건축국제비엔날레의시작70002016-01-13 00:00:00
737095일반행정서울브랜드이야기50002016-01-13 00:00:00
747094일반행정2016미래전문가가 말하는 서울의미래100002016-01-13 00:00:00
757093일반행정2015 소통백서50002016-01-13 00:00:00
769055통계서울통계연보 2017150002017-12-08 00:00:00
779035통계서울통계연보 2016150002017-12-01 00:00:00