Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells2
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.4 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description샘플 데이터
Author경기콘텐츠진흥원
URLhttps://www.bigdata-region.kr/#/dataset/976e026a-980e-4d2b-8d9d-8f42fb07ce39

Alerts

서점분류명 has constant value ""Constant
우편번호 has 2 (6.7%) missing valuesMissing
지역서점번호 has unique valuesUnique
서점명 has unique valuesUnique
주소 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:19:06.450780
Analysis finished2023-12-10 14:19:07.535268
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역서점번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:19:07.619596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-10T23:19:07.803550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

서점명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:19:08.060525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length4
Mean length4.7333333
Min length3

Characters and Unicode

Total characters142
Distinct characters70
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row장영실서점
2nd row정글북
3rd row정왕문고
4th row정우사
5th row주엽서점
ValueCountFrequency (%)
장영실서점 1
 
3.3%
정글북 1
 
3.3%
에이스북(리딩북 1
 
3.3%
미스터버티고 1
 
3.3%
우리서점송내 1
 
3.3%
교석서점 1
 
3.3%
작은책방기역 1
 
3.3%
희영서점 1
 
3.3%
홍익서점 1
 
3.3%
호원문고 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:19:08.671347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
8.5%
12
 
8.5%
12
 
8.5%
11
 
7.7%
5
 
3.5%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (60) 74
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138
97.2%
Open Punctuation 2
 
1.4%
Close Punctuation 2
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
8.7%
12
 
8.7%
12
 
8.7%
11
 
8.0%
5
 
3.6%
4
 
2.9%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (58) 70
50.7%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138
97.2%
Common 4
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
8.7%
12
 
8.7%
12
 
8.7%
11
 
8.0%
5
 
3.6%
4
 
2.9%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (58) 70
50.7%
Common
ValueCountFrequency (%)
( 2
50.0%
) 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138
97.2%
ASCII 4
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
8.7%
12
 
8.7%
12
 
8.7%
11
 
8.0%
5
 
3.6%
4
 
2.9%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (58) 70
50.7%
ASCII
ValueCountFrequency (%)
( 2
50.0%
) 2
50.0%

우편번호
Real number (ℝ)

MISSING 

Distinct28
Distinct (%)100.0%
Missing2
Missing (%)6.7%
Infinite0
Infinite (%)0.0%
Mean12402.143
Minimum10306
Maximum18593
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:19:08.855259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10306
5-th percentile10358.85
Q110740.75
median11956.5
Q313591.25
95-th percentile15863.55
Maximum18593
Range8287
Interquartile range (IQR)2850.5

Descriptive statistics

Standard deviation2060.5666
Coefficient of variation (CV)0.16614602
Kurtosis1.6230234
Mean12402.143
Median Absolute Deviation (MAD)1481.5
Skewness1.2469534
Sum347260
Variance4245934.9
MonotonicityNot monotonic
2023-12-10T23:19:09.016566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
10366 1
 
3.3%
12452 1
 
3.3%
10355 1
 
3.3%
10450 1
 
3.3%
11346 1
 
3.3%
11033 1
 
3.3%
13604 1
 
3.3%
13182 1
 
3.3%
13136 1
 
3.3%
11704 1
 
3.3%
Other values (18) 18
60.0%
(Missing) 2
 
6.7%
ValueCountFrequency (%)
10306 1
3.3%
10355 1
3.3%
10366 1
3.3%
10386 1
3.3%
10417 1
3.3%
10450 1
3.3%
10500 1
3.3%
10821 1
3.3%
10929 1
3.3%
11033 1
3.3%
ValueCountFrequency (%)
18593 1
3.3%
16293 1
3.3%
15066 1
3.3%
15040 1
3.3%
14019 1
3.3%
13627 1
3.3%
13604 1
3.3%
13587 1
3.3%
13182 1
3.3%
13136 1
3.3%

서점분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
서점
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서점
2nd row서점
3rd row서점
4th row서점
5th row서점

Common Values

ValueCountFrequency (%)
서점 30
100.0%

Length

2023-12-10T23:19:09.221546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:19:09.358388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서점 30
100.0%

주소
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:19:09.682989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length24
Mean length21.533333
Min length14

Characters and Unicode

Total characters646
Distinct characters107
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row경기도 성남시 분당구 돌마로90번길 4
2nd row경기도 고양시 일산서구 중앙로 1406
3rd row경기도 시흥시 정왕대로 64
4th row경기도 안양시 만안구 병목안로130번길 20
5th row경기도 고양시 일산서구 강성로214번길 116-4/ 1층 1층
ValueCountFrequency (%)
경기도 30
 
19.7%
고양시 8
 
5.3%
성남시 5
 
3.3%
일산동구 4
 
2.6%
1층 4
 
2.6%
분당구 3
 
2.0%
일산서구 3
 
2.0%
파주시 2
 
1.3%
중앙로 2
 
1.3%
남양주시 2
 
1.3%
Other values (87) 89
58.6%
2023-12-10T23:19:10.236801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
122
18.9%
31
 
4.8%
30
 
4.6%
30
 
4.6%
1 30
 
4.6%
30
 
4.6%
24
 
3.7%
16
 
2.5%
2 16
 
2.5%
13
 
2.0%
Other values (97) 304
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 406
62.8%
Space Separator 122
 
18.9%
Decimal Number 112
 
17.3%
Dash Punctuation 3
 
0.5%
Other Punctuation 2
 
0.3%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
7.6%
30
 
7.4%
30
 
7.4%
30
 
7.4%
24
 
5.9%
16
 
3.9%
13
 
3.2%
13
 
3.2%
12
 
3.0%
11
 
2.7%
Other values (82) 196
48.3%
Decimal Number
ValueCountFrequency (%)
1 30
26.8%
2 16
14.3%
3 13
11.6%
6 11
 
9.8%
4 10
 
8.9%
0 9
 
8.0%
8 7
 
6.2%
5 6
 
5.4%
7 5
 
4.5%
9 5
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
/ 1
50.0%
Space Separator
ValueCountFrequency (%)
122
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 406
62.8%
Common 239
37.0%
Latin 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
7.6%
30
 
7.4%
30
 
7.4%
30
 
7.4%
24
 
5.9%
16
 
3.9%
13
 
3.2%
13
 
3.2%
12
 
3.0%
11
 
2.7%
Other values (82) 196
48.3%
Common
ValueCountFrequency (%)
122
51.0%
1 30
 
12.6%
2 16
 
6.7%
3 13
 
5.4%
6 11
 
4.6%
4 10
 
4.2%
0 9
 
3.8%
8 7
 
2.9%
5 6
 
2.5%
7 5
 
2.1%
Other values (4) 10
 
4.2%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 406
62.8%
ASCII 240
37.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
122
50.8%
1 30
 
12.5%
2 16
 
6.7%
3 13
 
5.4%
6 11
 
4.6%
4 10
 
4.2%
0 9
 
3.8%
8 7
 
2.9%
5 6
 
2.5%
7 5
 
2.1%
Other values (5) 11
 
4.6%
Hangul
ValueCountFrequency (%)
31
 
7.6%
30
 
7.4%
30
 
7.4%
30
 
7.4%
24
 
5.9%
16
 
3.9%
13
 
3.2%
13
 
3.2%
12
 
3.0%
11
 
2.7%
Other values (82) 196
48.3%

전화번호
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:19:10.516646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.066667
Min length12

Characters and Unicode

Total characters362
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row031-716-2030
2nd row031-922-5000
3rd row031-432-7900
4th row031-466-4977
5th row031-915-8489
ValueCountFrequency (%)
031-716-2030 1
 
3.3%
031-922-5000 1
 
3.3%
031-976-3205 1
 
3.3%
031-849-6605 1
 
3.3%
031-862-2929 1
 
3.3%
031-832-2405 1
 
3.3%
031-715-2556 1
 
3.3%
031-735-6237 1
 
3.3%
031-743-4983 1
 
3.3%
031-829-1879 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:19:10.996497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 61
16.9%
- 60
16.6%
3 51
14.1%
1 40
11.0%
7 28
7.7%
9 28
7.7%
5 23
 
6.4%
2 19
 
5.2%
4 19
 
5.2%
6 17
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 302
83.4%
Dash Punctuation 60
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 61
20.2%
3 51
16.9%
1 40
13.2%
7 28
9.3%
9 28
9.3%
5 23
 
7.6%
2 19
 
6.3%
4 19
 
6.3%
6 17
 
5.6%
8 16
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 362
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 61
16.9%
- 60
16.6%
3 51
14.1%
1 40
11.0%
7 28
7.7%
9 28
7.7%
5 23
 
6.4%
2 19
 
5.2%
4 19
 
5.2%
6 17
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 362
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 61
16.9%
- 60
16.6%
3 51
14.1%
1 40
11.0%
7 28
7.7%
9 28
7.7%
5 23
 
6.4%
2 19
 
5.2%
4 19
 
5.2%
6 17
 
4.7%

Interactions

2023-12-10T23:19:07.170052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:19:06.684239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:19:07.254339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:19:07.093232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:19:11.144400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역서점번호서점명우편번호주소전화번호
지역서점번호1.0001.0000.3841.0001.000
서점명1.0001.0001.0001.0001.000
우편번호0.3841.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
2023-12-10T23:19:11.291463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역서점번호우편번호
지역서점번호1.000-0.134
우편번호-0.1341.000

Missing values

2023-12-10T23:19:07.356669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:19:07.483490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역서점번호서점명우편번호서점분류명주소전화번호
01장영실서점13627서점경기도 성남시 분당구 돌마로90번길 4031-716-2030
12정글북10386서점경기도 고양시 일산서구 중앙로 1406031-922-5000
23정왕문고15040서점경기도 시흥시 정왕대로 64031-432-7900
34정우사14019서점경기도 안양시 만안구 병목안로130번길 20031-466-4977
45주엽서점<NA>서점경기도 고양시 일산서구 강성로214번길 116-4/ 1층 1층031-915-8489
56중앙문구서적10821서점경기도 파주시 문산읍 문향로67번길 16031-953-9668
67지산문고10417서점경기도 고양시 일산동구 일산로 238031-903-0462
78진학서점16293서점경기도 수원시 장안구 수일로 229031-243-0307
89초원서점10306서점경기도 고양시 일산동구 숲속마을로 40031-907-7345
910커피책방11492서점경기도 양주시 고읍남로191번길 86-7031-844-7890
지역서점번호서점명우편번호서점분류명주소전화번호
2021현대서점10929서점경기도 파주시 명동길 53031-941-1351
2122호원문고11704서점경기도 의정부시 호원동 425번지 한승아파트 128.129호 1층031-829-1879
2223홍익서점13136서점경기도 성남시 수정구 산성대로 523031-743-4983
2324희영서점13182서점경기도 성남시 중원구 광명로 370031-735-6237
2425작은책방기역13604서점경기도 성남시 분당구 불곡남로21번길 1031-715-2556
2526교석서점11033서점경기도 연천군 전곡읍 전곡로164번길 14031-832-2405
2627우리서점송내11346서점경기도 동두천시 지행로 63031-862-2929
2728미스터버티고10450서점경기도 고양시 일산동구 강송로 33031-849-6605
2829에이스북(리딩북)10355서점경기도 고양시 일산동구 중산로 105031-976-3205
2930청평서적12452서점경기도 가평군 청평면 여울길 14070-7815-3316