Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells1
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory43.3 B

Variable types

Categorical2
Numeric1
Text2

Alerts

extr_goods_updt_de has constant value ""Constant
extr_goods_cd has unique valuesUnique

Reproduction

Analysis started2023-12-10 09:51:44.850417
Analysis finished2023-12-10 09:51:46.168689
Duration1.32 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

area_nm
Categorical

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원 영월군
30 
강원 양양군
23 
강원 원주시
20 
강원 고성군
10 
강원 강릉시
Other values (4)
11 

Length

Max length6
Median length6
Mean length5.99
Min length5

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row강원 강릉시
2nd row충북 충주시
3rd row강원 강릉시
4th row강원 강릉시
5th row강원 강릉시

Common Values

ValueCountFrequency (%)
강원 영월군 30
30.0%
강원 양양군 23
23.0%
강원 원주시 20
20.0%
강원 고성군 10
 
10.0%
강원 강릉시 6
 
6.0%
강원 속초시 4
 
4.0%
충북 충주시 3
 
3.0%
강원 양구군 3
 
3.0%
강원 양양 1
 
1.0%

Length

2023-12-10T18:51:46.384763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:51:46.589102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원 97
48.5%
영월군 30
 
15.0%
양양군 23
 
11.5%
원주시 20
 
10.0%
고성군 10
 
5.0%
강릉시 6
 
3.0%
속초시 4
 
2.0%
충북 3
 
1.5%
충주시 3
 
1.5%
양구군 3
 
1.5%

extr_goods_cd
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48166.62
Minimum972
Maximum67572
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:51:46.834702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum972
5-th percentile39346
Q145665
median46479
Q352067.25
95-th percentile59285.9
Maximum67572
Range66600
Interquartile range (IQR)6402.25

Descriptive statistics

Standard deviation7802.8557
Coefficient of variation (CV)0.16199716
Kurtosis13.020599
Mean48166.62
Median Absolute Deviation (MAD)2969.5
Skewness-1.7728343
Sum4816662
Variance60884557
MonotonicityNot monotonic
2023-12-10T18:51:47.070409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
972 1
 
1.0%
46061 1
 
1.0%
54772 1
 
1.0%
54540 1
 
1.0%
54244 1
 
1.0%
53201 1
 
1.0%
52068 1
 
1.0%
52067 1
 
1.0%
46778 1
 
1.0%
46261 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
972 1
1.0%
36069 1
1.0%
36113 1
1.0%
36390 1
1.0%
39327 1
1.0%
39347 1
1.0%
39372 1
1.0%
39523 1
1.0%
39603 1
1.0%
42182 1
1.0%
ValueCountFrequency (%)
67572 1
1.0%
66807 1
1.0%
66796 1
1.0%
60104 1
1.0%
59569 1
1.0%
59271 1
1.0%
58661 1
1.0%
58444 1
1.0%
57806 1
1.0%
57458 1
1.0%
Distinct87
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T18:51:47.449090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length26
Mean length21.75
Min length11

Characters and Unicode

Total characters2175
Distinct characters206
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)77.0%

Sample

1st row[강원 강릉] 아라나비토종 짚라인 왕복권
2nd row[충북 충주] 충주짚라인 12월
3rd row[강원 강릉] 알로하서프
4th row[강원 강릉] 청시행 비치 & 서핑강습
5th row[강원 강릉] 청시행 비치 & 서핑강습 ★핫딜
ValueCountFrequency (%)
강원 84
 
18.6%
영월 30
 
6.6%
양양 21
 
4.6%
원주 20
 
4.4%
동강 13
 
2.9%
오크밸리 10
 
2.2%
고성 10
 
2.2%
래프팅 8
 
1.8%
7
 
1.5%
hkc 6
 
1.3%
Other values (153) 243
53.8%
2023-12-10T18:51:48.101699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
352
 
16.2%
146
 
6.7%
113
 
5.2%
[ 99
 
4.6%
] 99
 
4.6%
65
 
3.0%
54
 
2.5%
39
 
1.8%
38
 
1.7%
37
 
1.7%
Other values (196) 1133
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1437
66.1%
Space Separator 352
 
16.2%
Open Punctuation 99
 
4.6%
Close Punctuation 99
 
4.6%
Uppercase Letter 73
 
3.4%
Decimal Number 35
 
1.6%
Other Punctuation 32
 
1.5%
Other Symbol 16
 
0.7%
Math Symbol 14
 
0.6%
Lowercase Letter 10
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
10.2%
113
 
7.9%
65
 
4.5%
54
 
3.8%
39
 
2.7%
38
 
2.6%
37
 
2.6%
34
 
2.4%
33
 
2.3%
32
 
2.2%
Other values (159) 846
58.9%
Uppercase Letter
ValueCountFrequency (%)
K 18
24.7%
P 11
15.1%
G 11
15.1%
H 6
 
8.2%
C 6
 
8.2%
B 5
 
6.8%
S 4
 
5.5%
N 3
 
4.1%
T 2
 
2.7%
O 2
 
2.7%
Other values (5) 5
 
6.8%
Decimal Number
ValueCountFrequency (%)
1 9
25.7%
2 8
22.9%
8 5
14.3%
3 5
14.3%
0 4
11.4%
9 2
 
5.7%
4 1
 
2.9%
6 1
 
2.9%
Other Punctuation
ValueCountFrequency (%)
& 16
50.0%
/ 13
40.6%
: 2
 
6.2%
! 1
 
3.1%
Lowercase Letter
ValueCountFrequency (%)
g 3
30.0%
p 3
30.0%
k 3
30.0%
x 1
 
10.0%
Space Separator
ValueCountFrequency (%)
352
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 99
100.0%
Close Punctuation
ValueCountFrequency (%)
] 99
100.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Math Symbol
ValueCountFrequency (%)
+ 14
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1437
66.1%
Common 655
30.1%
Latin 83
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
10.2%
113
 
7.9%
65
 
4.5%
54
 
3.8%
39
 
2.7%
38
 
2.6%
37
 
2.6%
34
 
2.4%
33
 
2.3%
32
 
2.2%
Other values (159) 846
58.9%
Latin
ValueCountFrequency (%)
K 18
21.7%
P 11
13.3%
G 11
13.3%
H 6
 
7.2%
C 6
 
7.2%
B 5
 
6.0%
S 4
 
4.8%
g 3
 
3.6%
N 3
 
3.6%
p 3
 
3.6%
Other values (9) 13
15.7%
Common
ValueCountFrequency (%)
352
53.7%
[ 99
 
15.1%
] 99
 
15.1%
& 16
 
2.4%
16
 
2.4%
+ 14
 
2.1%
/ 13
 
2.0%
1 9
 
1.4%
_ 8
 
1.2%
2 8
 
1.2%
Other values (8) 21
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1437
66.1%
ASCII 722
33.2%
Misc Symbols 16
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
352
48.8%
[ 99
 
13.7%
] 99
 
13.7%
K 18
 
2.5%
& 16
 
2.2%
+ 14
 
1.9%
/ 13
 
1.8%
P 11
 
1.5%
G 11
 
1.5%
1 9
 
1.2%
Other values (26) 80
 
11.1%
Hangul
ValueCountFrequency (%)
146
 
10.2%
113
 
7.9%
65
 
4.5%
54
 
3.8%
39
 
2.7%
38
 
2.6%
37
 
2.6%
34
 
2.4%
33
 
2.3%
32
 
2.2%
Other values (159) 846
58.9%
Misc Symbols
ValueCountFrequency (%)
16
100.0%
Distinct53
Distinct (%)53.5%
Missing1
Missing (%)1.0%
Memory size932.0 B
2023-12-10T18:51:48.561369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length19.313131
Min length14

Characters and Unicode

Total characters1912
Distinct characters101
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)28.3%

Sample

1st row강원 강릉시 남항진동 1-4
2nd row충북 충주시 노은면 우성1길 191
3rd row강원 강릉시 옥계면 헌화로 221-4
4th row강원 강릉시 주문진읍 주문북로 222-30
5th row강원 강릉시 주문진읍 주문북로 222-30
ValueCountFrequency (%)
강원 96
19.5%
영월군 30
 
6.1%
영월읍 28
 
5.7%
양양군 23
 
4.7%
지정면 20
 
4.1%
원주시 20
 
4.1%
동강로 15
 
3.0%
현북면 11
 
2.2%
고성군 10
 
2.0%
죽왕면 9
 
1.8%
Other values (100) 230
46.7%
2023-12-10T18:51:49.188945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
393
20.6%
119
 
6.2%
116
 
6.1%
1 72
 
3.8%
66
 
3.5%
60
 
3.1%
59
 
3.1%
59
 
3.1%
2 55
 
2.9%
54
 
2.8%
Other values (91) 859
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1155
60.4%
Space Separator 393
 
20.6%
Decimal Number 331
 
17.3%
Dash Punctuation 33
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
10.3%
116
 
10.0%
66
 
5.7%
60
 
5.2%
59
 
5.1%
59
 
5.1%
54
 
4.7%
47
 
4.1%
44
 
3.8%
35
 
3.0%
Other values (79) 496
42.9%
Decimal Number
ValueCountFrequency (%)
1 72
21.8%
2 55
16.6%
6 49
14.8%
3 33
10.0%
8 30
9.1%
0 27
 
8.2%
9 26
 
7.9%
7 17
 
5.1%
5 13
 
3.9%
4 9
 
2.7%
Space Separator
ValueCountFrequency (%)
393
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1155
60.4%
Common 757
39.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
10.3%
116
 
10.0%
66
 
5.7%
60
 
5.2%
59
 
5.1%
59
 
5.1%
54
 
4.7%
47
 
4.1%
44
 
3.8%
35
 
3.0%
Other values (79) 496
42.9%
Common
ValueCountFrequency (%)
393
51.9%
1 72
 
9.5%
2 55
 
7.3%
6 49
 
6.5%
3 33
 
4.4%
- 33
 
4.4%
8 30
 
4.0%
0 27
 
3.6%
9 26
 
3.4%
7 17
 
2.2%
Other values (2) 22
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1155
60.4%
ASCII 757
39.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
393
51.9%
1 72
 
9.5%
2 55
 
7.3%
6 49
 
6.5%
3 33
 
4.4%
- 33
 
4.4%
8 30
 
4.0%
0 27
 
3.6%
9 26
 
3.4%
7 17
 
2.2%
Other values (2) 22
 
2.9%
Hangul
ValueCountFrequency (%)
119
 
10.3%
116
 
10.0%
66
 
5.7%
60
 
5.2%
59
 
5.1%
59
 
5.1%
54
 
4.7%
47
 
4.1%
44
 
3.8%
35
 
3.0%
Other values (79) 496
42.9%

extr_goods_updt_de
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20211130
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20211130
2nd row20211130
3rd row20211130
4th row20211130
5th row20211130

Common Values

ValueCountFrequency (%)
20211130 100
100.0%

Length

2023-12-10T18:51:49.424197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:51:49.591919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20211130 100
100.0%

Interactions

2023-12-10T18:51:45.643837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:51:49.706400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
area_nmextr_goods_cdextr_goods_nmextr_goods_addr
area_nm1.0000.6391.0001.000
extr_goods_cd0.6391.0000.0000.738
extr_goods_nm1.0000.0001.0000.999
extr_goods_addr1.0000.7380.9991.000
2023-12-10T18:51:49.876736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
extr_goods_cdarea_nm
extr_goods_cd1.0000.404
area_nm0.4041.000

Missing values

2023-12-10T18:51:45.914294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:51:46.106122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

area_nmextr_goods_cdextr_goods_nmextr_goods_addrextr_goods_updt_de
0강원 강릉시972[강원 강릉] 아라나비토종 짚라인 왕복권강원 강릉시 남항진동 1-420211130
1충북 충주시49161[충북 충주] 충주짚라인 12월충북 충주시 노은면 우성1길 19120211130
2강원 강릉시44702[강원 강릉] 알로하서프강원 강릉시 옥계면 헌화로 221-420211130
3강원 강릉시46381[강원 강릉] 청시행 비치 & 서핑강습강원 강릉시 주문진읍 주문북로 222-3020211130
4강원 강릉시46512[강원 강릉] 청시행 비치 & 서핑강습 ★핫딜강원 강릉시 주문진읍 주문북로 222-3020211130
5강원 강릉시46568[강원 강릉] 청시행 비치 & 서핑강원 강릉시 주문진읍 주문북로 222-3020211130
6강원 강릉시46569[강원 강릉] 청시행 비치 & 서핑파티강원 강릉시 주문진읍 주문북로 222-3020211130
7충북 충주시50739[충북 충주] 충주 짚라인 : 1월충북 충주시 노은면 우성1길 19120211130
8강원 고성군42480[강원 고성] 오션투유리조트 전동바이크강원 고성군 죽왕면 삼포해변길 920211130
9강원 고성군44648[강원 고성] 오션투유리조트 전동바이크강원 고성군 죽왕면 삼포해변길 920211130
area_nmextr_goods_cdextr_goods_nmextr_goods_addrextr_goods_updt_de
90강원 원주시48655[강원 원주] 오크밸리 얼리버드 리프트+렌탈PKG강원 원주시 지정면 오크밸리1길 6620211130
91강원 원주시49414[강원 원주] 강윤정프로샵★렌탈샵_오크밸리강원 원주시 지정면 구재로 21620211130
92강원 원주시49998[강원 원주] 오크밸리 리조트 SNB 렌탈&강습권강원 원주시 지정면 구재로 26520211130
93강원 원주시50253[강원 원주] 한솔오크밸리 휴레포츠 스키 렌탈/강습강원 원주시 지정면 판대리 558-220211130
94강원 원주시50269[강원 원주] 오크밸리 SNB 렌탈샵 강습권 패키지강원 원주시 지정면 구재로 26520211130
95강원 원주시50312[강원 원주] 오크밸리 SNB 렌탈샵 장비렌탈강원 원주시 지정면 구재로 26520211130
96강원 원주시50461[강원 원주] 19/20 얼리버드 오크밸리 리프트+스키렌탈pkg강원 원주시 지정면 오크밸리1길 6620211130
97강원 원주시50465[강원 원주] 오크밸리 에스스키 렌탈/강습강원 원주시 지정면 구재로 4620211130
98강원 원주시51151[강원 원주] 19/20 오크밸리 야놀자 특가★강원 원주시 지정면 오크밸리1길 6620211130
99강원 원주시51830[강원 원주] 오크밸리 리프트/리프트+단품PKG강원 원주시 지정면 오크밸리1길 6620211130