Overview

Dataset statistics

Number of variables6
Number of observations404
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.1 KiB
Average record size in memory48.3 B

Variable types

Categorical2
DateTime1
Text3

Dataset

Description충청남도 논산시 미용업에 대한 공공데이터입니다. 해당데이터는 업소명, 행정동, 주소, 전화번호 정보를 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=389&beforeMenuCd=DOM_000000201001001000&publicdatapk=15054218

Alerts

행정구역 is highly imbalanced (53.5%)Imbalance

Reproduction

Analysis started2024-01-09 19:54:55.025109
Analysis finished2024-01-09 19:54:55.650057
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct15
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
미용업
155 
일반미용업
118 
피부미용업
62 
네일미용업
20 
종합미용업
17 
Other values (10)
32 

Length

Max length23
Median length5
Mean length5.0272277
Min length3

Unique

Unique2 ?
Unique (%)0.5%

Sample

1st row미용업
2nd row미용업
3rd row미용업
4th row미용업
5th row미용업

Common Values

ValueCountFrequency (%)
미용업 155
38.4%
일반미용업 118
29.2%
피부미용업 62
 
15.3%
네일미용업 20
 
5.0%
종합미용업 17
 
4.2%
네일미용업, 화장ㆍ분장 미용업 8
 
2.0%
일반미용업, 피부미용업 5
 
1.2%
피부미용업, 네일미용업 5
 
1.2%
일반미용업, 화장ㆍ분장 미용업 4
 
1.0%
일반미용업, 네일미용업 2
 
0.5%
Other values (5) 8
 
2.0%

Length

2024-01-10T04:54:55.738345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미용업 174
38.0%
일반미용업 132
28.8%
피부미용업 76
16.6%
네일미용업 40
 
8.7%
화장ㆍ분장 19
 
4.1%
종합미용업 17
 
3.7%
Distinct382
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum1966-07-01 00:00:00
Maximum2023-10-18 00:00:00
2024-01-10T04:54:55.917356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:54:56.098950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct399
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-01-10T04:54:56.406132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length5.480198
Min length2

Characters and Unicode

Total characters2214
Distinct characters385
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique394 ?
Unique (%)97.5%

Sample

1st row에덴미용원
2nd row스타미용실
3rd row꽃가마
4th row숙녀
5th row성모
ValueCountFrequency (%)
에스테틱 3
 
0.7%
논산점 3
 
0.7%
미용실 3
 
0.7%
논산 2
 
0.4%
오늘 2
 
0.4%
2
 
0.4%
헤어샵 2
 
0.4%
헤어 2
 
0.4%
헤어아트 2
 
0.4%
우리미용실 2
 
0.4%
Other values (426) 430
94.9%
2024-01-10T04:54:56.847055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
 
6.8%
147
 
6.6%
98
 
4.4%
72
 
3.3%
71
 
3.2%
58
 
2.6%
49
 
2.2%
39
 
1.8%
37
 
1.7%
35
 
1.6%
Other values (375) 1457
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1985
89.7%
Lowercase Letter 84
 
3.8%
Space Separator 49
 
2.2%
Uppercase Letter 34
 
1.5%
Open Punctuation 17
 
0.8%
Close Punctuation 17
 
0.8%
Other Punctuation 16
 
0.7%
Decimal Number 11
 
0.5%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
151
 
7.6%
147
 
7.4%
98
 
4.9%
72
 
3.6%
71
 
3.6%
58
 
2.9%
39
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
Other values (327) 1242
62.6%
Lowercase Letter
ValueCountFrequency (%)
n 10
11.9%
e 10
11.9%
a 9
10.7%
o 8
9.5%
s 7
8.3%
i 7
8.3%
l 5
 
6.0%
h 5
 
6.0%
r 3
 
3.6%
g 3
 
3.6%
Other values (9) 17
20.2%
Uppercase Letter
ValueCountFrequency (%)
S 7
20.6%
O 4
11.8%
B 3
8.8%
J 3
8.8%
N 3
8.8%
M 3
8.8%
H 2
 
5.9%
T 2
 
5.9%
L 2
 
5.9%
Y 2
 
5.9%
Other values (3) 3
8.8%
Decimal Number
ValueCountFrequency (%)
0 4
36.4%
3 2
18.2%
4 1
 
9.1%
9 1
 
9.1%
8 1
 
9.1%
1 1
 
9.1%
5 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
# 4
25.0%
, 4
25.0%
& 4
25.0%
. 2
12.5%
' 2
12.5%
Space Separator
ValueCountFrequency (%)
49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1983
89.6%
Latin 118
 
5.3%
Common 111
 
5.0%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
151
 
7.6%
147
 
7.4%
98
 
4.9%
72
 
3.6%
71
 
3.6%
58
 
2.9%
39
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
Other values (325) 1240
62.5%
Latin
ValueCountFrequency (%)
n 10
 
8.5%
e 10
 
8.5%
a 9
 
7.6%
o 8
 
6.8%
s 7
 
5.9%
S 7
 
5.9%
i 7
 
5.9%
l 5
 
4.2%
h 5
 
4.2%
O 4
 
3.4%
Other values (22) 46
39.0%
Common
ValueCountFrequency (%)
49
44.1%
( 17
 
15.3%
) 17
 
15.3%
# 4
 
3.6%
, 4
 
3.6%
0 4
 
3.6%
& 4
 
3.6%
. 2
 
1.8%
' 2
 
1.8%
3 2
 
1.8%
Other values (6) 6
 
5.4%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1983
89.6%
ASCII 229
 
10.3%
CJK 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
151
 
7.6%
147
 
7.4%
98
 
4.9%
72
 
3.6%
71
 
3.6%
58
 
2.9%
39
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
Other values (325) 1240
62.5%
ASCII
ValueCountFrequency (%)
49
21.4%
( 17
 
7.4%
) 17
 
7.4%
n 10
 
4.4%
e 10
 
4.4%
a 9
 
3.9%
o 8
 
3.5%
s 7
 
3.1%
S 7
 
3.1%
i 7
 
3.1%
Other values (38) 88
38.4%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

행정구역
Categorical

IMBALANCE 

Distinct13
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
취암동
275 
연무읍
43 
강경읍
37 
부창동
 
22
양촌면
 
7
Other values (8)
 
20

Length

Max length4
Median length3
Mean length3.0049505
Min length3

Unique

Unique2 ?
Unique (%)0.5%

Sample

1st row연무읍
2nd row연산면
3rd row취암동
4th row취암동
5th row부창동

Common Values

ValueCountFrequency (%)
취암동 275
68.1%
연무읍 43
 
10.6%
강경읍 37
 
9.2%
부창동 22
 
5.4%
양촌면 7
 
1.7%
연산면 6
 
1.5%
성동면 3
 
0.7%
광석면 3
 
0.7%
가야곡면 2
 
0.5%
상월면 2
 
0.5%
Other values (3) 4
 
1.0%

Length

2024-01-10T04:54:57.030057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취암동 275
68.1%
연무읍 43
 
10.6%
강경읍 37
 
9.2%
부창동 22
 
5.4%
양촌면 7
 
1.7%
연산면 6
 
1.5%
성동면 3
 
0.7%
광석면 3
 
0.7%
가야곡면 2
 
0.5%
상월면 2
 
0.5%
Other values (3) 4
 
1.0%

주소
Text

Distinct392
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-01-10T04:54:57.384368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length41.5
Mean length27.378713
Min length18

Characters and Unicode

Total characters11061
Distinct characters162
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique382 ?
Unique (%)94.6%

Sample

1st row충청남도 논산시 연무읍 연무로178번길 2-1
2nd row충청남도 논산시 연산면 황산벌로 1534
3rd row충청남도 논산시 해월로 229 (화지동)
4th row충청남도 논산시 대화로70번길 9-4 (화지동)
5th row충청남도 논산시 중앙로505번길 10 (대교동)
ValueCountFrequency (%)
충청남도 404
17.6%
논산시 404
17.6%
취암동 122
 
5.3%
1층 88
 
3.8%
내동 75
 
3.3%
화지동 47
 
2.1%
연무읍 43
 
1.9%
중앙로 42
 
1.8%
강경읍 37
 
1.6%
시민로 33
 
1.4%
Other values (445) 997
43.5%
2024-01-10T04:54:57.865785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1888
 
17.1%
1 522
 
4.7%
498
 
4.5%
450
 
4.1%
422
 
3.8%
407
 
3.7%
406
 
3.7%
404
 
3.7%
404
 
3.7%
398
 
3.6%
Other values (152) 5262
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6189
56.0%
Decimal Number 2032
 
18.4%
Space Separator 1888
 
17.1%
Open Punctuation 299
 
2.7%
Close Punctuation 299
 
2.7%
Other Punctuation 210
 
1.9%
Dash Punctuation 142
 
1.3%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
498
 
8.0%
450
 
7.3%
422
 
6.8%
407
 
6.6%
406
 
6.6%
404
 
6.5%
404
 
6.5%
398
 
6.4%
326
 
5.3%
213
 
3.4%
Other values (135) 2261
36.5%
Decimal Number
ValueCountFrequency (%)
1 522
25.7%
2 293
14.4%
3 226
11.1%
0 202
 
9.9%
4 192
 
9.4%
8 158
 
7.8%
9 136
 
6.7%
5 107
 
5.3%
7 100
 
4.9%
6 96
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
G 1
50.0%
Space Separator
ValueCountFrequency (%)
1888
100.0%
Open Punctuation
ValueCountFrequency (%)
( 299
100.0%
Close Punctuation
ValueCountFrequency (%)
) 299
100.0%
Other Punctuation
ValueCountFrequency (%)
, 210
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 142
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6189
56.0%
Common 4870
44.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
498
 
8.0%
450
 
7.3%
422
 
6.8%
407
 
6.6%
406
 
6.6%
404
 
6.5%
404
 
6.5%
398
 
6.4%
326
 
5.3%
213
 
3.4%
Other values (135) 2261
36.5%
Common
ValueCountFrequency (%)
1888
38.8%
1 522
 
10.7%
( 299
 
6.1%
) 299
 
6.1%
2 293
 
6.0%
3 226
 
4.6%
, 210
 
4.3%
0 202
 
4.1%
4 192
 
3.9%
8 158
 
3.2%
Other values (5) 581
 
11.9%
Latin
ValueCountFrequency (%)
I 1
50.0%
G 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6189
56.0%
ASCII 4872
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1888
38.8%
1 522
 
10.7%
( 299
 
6.1%
) 299
 
6.1%
2 293
 
6.0%
3 226
 
4.6%
, 210
 
4.3%
0 202
 
4.1%
4 192
 
3.9%
8 158
 
3.2%
Other values (7) 583
 
12.0%
Hangul
ValueCountFrequency (%)
498
 
8.0%
450
 
7.3%
422
 
6.8%
407
 
6.6%
406
 
6.6%
404
 
6.5%
404
 
6.5%
398
 
6.4%
326
 
5.3%
213
 
3.4%
Other values (135) 2261
36.5%
Distinct243
Distinct (%)60.3%
Missing1
Missing (%)0.2%
Memory size3.3 KiB
2024-01-10T04:54:58.271950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length11.168734
Min length7

Characters and Unicode

Total characters4501
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique242 ?
Unique (%)60.0%

Sample

1st row041-741-7890
2nd row 041-735-0006
3rd row데이터 부존재
4th row041-735-3089
5th row041 -733 -0450
ValueCountFrequency (%)
041 229
25.4%
데이터 161
17.9%
부존재 161
17.9%
735 30
 
3.3%
733 16
 
1.8%
736 14
 
1.6%
734 14
 
1.6%
732 11
 
1.2%
745 11
 
1.2%
742 7
 
0.8%
Other values (244) 247
27.4%
2024-01-10T04:54:58.875088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
627
13.9%
- 484
10.8%
4 413
 
9.2%
0 374
 
8.3%
7 345
 
7.7%
1 343
 
7.6%
3 290
 
6.4%
5 208
 
4.6%
161
 
3.6%
161
 
3.6%
Other values (8) 1095
24.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2424
53.9%
Other Letter 966
 
21.5%
Space Separator 627
 
13.9%
Dash Punctuation 484
 
10.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 413
17.0%
0 374
15.4%
7 345
14.2%
1 343
14.2%
3 290
12.0%
5 208
8.6%
2 148
 
6.1%
6 118
 
4.9%
8 102
 
4.2%
9 83
 
3.4%
Other Letter
ValueCountFrequency (%)
161
16.7%
161
16.7%
161
16.7%
161
16.7%
161
16.7%
161
16.7%
Space Separator
ValueCountFrequency (%)
627
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 484
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3535
78.5%
Hangul 966
 
21.5%

Most frequent character per script

Common
ValueCountFrequency (%)
627
17.7%
- 484
13.7%
4 413
11.7%
0 374
10.6%
7 345
9.8%
1 343
9.7%
3 290
8.2%
5 208
 
5.9%
2 148
 
4.2%
6 118
 
3.3%
Other values (2) 185
 
5.2%
Hangul
ValueCountFrequency (%)
161
16.7%
161
16.7%
161
16.7%
161
16.7%
161
16.7%
161
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3535
78.5%
Hangul 966
 
21.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
627
17.7%
- 484
13.7%
4 413
11.7%
0 374
10.6%
7 345
9.8%
1 343
9.7%
3 290
8.2%
5 208
 
5.9%
2 148
 
4.2%
6 118
 
3.3%
Other values (2) 185
 
5.2%
Hangul
ValueCountFrequency (%)
161
16.7%
161
16.7%
161
16.7%
161
16.7%
161
16.7%
161
16.7%

Correlations

2024-01-10T04:54:58.973104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명행정구역
업종명1.0000.000
행정구역0.0001.000
2024-01-10T04:54:59.058629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정구역업종명
행정구역1.0000.000
업종명0.0001.000
2024-01-10T04:54:59.139341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명행정구역
업종명1.0000.000
행정구역0.0001.000

Missing values

2024-01-10T04:54:55.490752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:54:55.596786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명신고일자업소명행정구역주소전화번호
0미용업1966-07-01에덴미용원연무읍충청남도 논산시 연무읍 연무로178번길 2-1041-741-7890
1미용업1967-05-12스타미용실연산면충청남도 논산시 연산면 황산벌로 1534041-735-0006
2미용업1968-10-25꽃가마취암동충청남도 논산시 해월로 229 (화지동)데이터 부존재
3미용업1970-12-23숙녀취암동충청남도 논산시 대화로70번길 9-4 (화지동)041-735-3089
4미용업1972-11-08성모부창동충청남도 논산시 중앙로505번길 10 (대교동)041 -733 -0450
5미용업1974-01-21중앙취암동충청남도 논산시 해월로 170 (반월동)041 -735 -8227
6미용업1974-06-08귀희강경읍충청남도 논산시 강경읍 대흥로 13041 -745 -0482
7미용업1976-08-07은지헤어라인강경읍충청남도 논산시 강경읍 대흥로 14041 -745 -7790
8미용업1978-04-15수정미용실취암동충청남도 논산시 중앙로 498-10 (화지동)041- 735-6826
9미용업1979-06-05은희취암동충청남도 논산시 해월로 222 (반월동)041 -735 -2668
업종명신고일자업소명행정구역주소전화번호
394피부미용업2023-08-18Hana Spa(하나스파)취암동충청남도 논산시 대화로70번길 21, 2층 (화지동)데이터 부존재
395피부미용업2023-08-30나루에스테틱취암동충청남도 논산시 중앙로 420, 1층 (취암동)데이터 부존재
396일반미용업2023-09-04오늘 헤어취암동충청남도 논산시 해월로 175 (재)기독교대한감리회유지재단 1층 (반월동)데이터 부존재
397피부미용업2023-09-14더예쁨슈가링왁싱취암동충청남도 논산시 시민로 177, 2층 (내동)데이터 부존재
398화장ㆍ분장 미용업2023-09-14더예쁨 속눈썹취암동충청남도 논산시 시민로 177, 2층 (내동)데이터 부존재
399네일미용업2023-09-25네일플랜취암동충청남도 논산시 중앙로 216, 3층 (내동)데이터 부존재
400피부미용업2023-10-04올가(드림)뷰티취암동충청남도 논산시 시민로258번길 22, 제이파크아파트상가 105호 (내동)데이터 부존재
401피부미용업2023-10-06트윙클아이취암동충청남도 논산시 중앙로 204, 1층 (내동)데이터 부존재
402네일미용업, 화장ㆍ분장 미용업2023-10-17잇두 논산점취암동충청남도 논산시 중앙로384번길 34, 1층 (취암동)데이터 부존재
403피부미용업2023-10-18거꾸로 가는 시간 논산점취암동충청남도 논산시 중앙로 356, 1층 (취암동)데이터 부존재