Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells25349
Missing cells (%)19.5%
Duplicate rows314
Duplicate rows (%)3.1%
Total size in memory1.1 MiB
Average record size in memory114.0 B

Variable types

Text4
Categorical6
Numeric1
Unsupported1
DateTime1

Dataset

Description당진시 금연시설물 정보(금연구역구분, 위반신고전화번호, 금연시설명, 과태료,금연시설물주소, 관리기관명)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=398&beforeMenuCd=DOM_000000201001001000&publicdatapk=15042403

Alerts

금연구역범위상세 has constant value ""Constant
시도명 has constant value ""Constant
시군구명 has constant value ""Constant
금연구역지정근거명 has constant value ""Constant
위반신고전화번호 has constant value ""Constant
관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 314 (3.1%) duplicate rowsDuplicates
금연구역면적 has 7405 (74.1%) missing valuesMissing
위반과태료 has 10000 (100.0%) missing valuesMissing
소재지도로명주소 has 635 (6.3%) missing valuesMissing
소재지지번주소 has 7309 (73.1%) missing valuesMissing
금연구역면적 is highly skewed (γ1 = 43.63945475)Skewed
위반과태료 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 21:36:30.604738
Analysis finished2024-01-09 21:36:32.113517
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct7814
Distinct (%)78.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T06:36:32.291428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length27
Mean length7.676
Min length1

Characters and Unicode

Total characters76760
Distinct characters938
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5629 ?
Unique (%)56.3%

Sample

1st row송산초등학교
2nd row이서방치킨
3rd row만민식당
4th row풀잎어린이집(71거3356)
5th row순성중명아파트 놀이터
ValueCountFrequency (%)
놀이터 113
 
1.0%
gs25 65
 
0.6%
씨유 53
 
0.5%
놀이시설 40
 
0.3%
당구장 33
 
0.3%
1 32
 
0.3%
잡화상 31
 
0.3%
주식회사 31
 
0.3%
담배소매업소 27
 
0.2%
어린이집 26
 
0.2%
Other values (7963) 11183
96.1%
2024-01-10T06:36:32.644286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 2086
 
2.7%
) 2084
 
2.7%
1862
 
2.4%
1764
 
2.3%
1653
 
2.2%
1214
 
1.6%
1184
 
1.5%
1144
 
1.5%
1087
 
1.4%
1073
 
1.4%
Other values (928) 61609
80.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 64131
83.5%
Decimal Number 4472
 
5.8%
Open Punctuation 2086
 
2.7%
Close Punctuation 2084
 
2.7%
Space Separator 1653
 
2.2%
Uppercase Letter 1189
 
1.5%
Lowercase Letter 563
 
0.7%
Connector Punctuation 206
 
0.3%
Other Punctuation 162
 
0.2%
Dash Punctuation 153
 
0.2%
Other values (3) 61
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1862
 
2.9%
1764
 
2.8%
1214
 
1.9%
1184
 
1.8%
1144
 
1.8%
1087
 
1.7%
1073
 
1.7%
1060
 
1.7%
918
 
1.4%
859
 
1.3%
Other values (845) 51966
81.0%
Uppercase Letter
ValueCountFrequency (%)
C 204
17.2%
G 180
15.1%
S 171
14.4%
P 120
10.1%
A 51
 
4.3%
B 50
 
4.2%
O 46
 
3.9%
E 39
 
3.3%
U 34
 
2.9%
K 32
 
2.7%
Other values (16) 262
22.0%
Lowercase Letter
ValueCountFrequency (%)
m 188
33.4%
e 64
 
11.4%
o 40
 
7.1%
n 35
 
6.2%
a 31
 
5.5%
c 28
 
5.0%
r 19
 
3.4%
t 17
 
3.0%
f 17
 
3.0%
i 17
 
3.0%
Other values (15) 107
19.0%
Other Punctuation
ValueCountFrequency (%)
. 65
40.1%
, 40
24.7%
& 29
17.9%
· 5
 
3.1%
? 5
 
3.1%
! 5
 
3.1%
% 4
 
2.5%
# 4
 
2.5%
2
 
1.2%
: 1
 
0.6%
Other values (2) 2
 
1.2%
Decimal Number
ValueCountFrequency (%)
1 818
18.3%
2 655
14.6%
7 645
14.4%
0 577
12.9%
5 415
9.3%
3 369
8.3%
4 279
 
6.2%
9 256
 
5.7%
6 247
 
5.5%
8 211
 
4.7%
Other Symbol
ValueCountFrequency (%)
54
96.4%
° 2
 
3.6%
Math Symbol
ValueCountFrequency (%)
~ 3
75.0%
+ 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 2086
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2084
100.0%
Space Separator
ValueCountFrequency (%)
1653
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 153
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 64179
83.6%
Common 10823
 
14.1%
Latin 1752
 
2.3%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1862
 
2.9%
1764
 
2.7%
1214
 
1.9%
1184
 
1.8%
1144
 
1.8%
1087
 
1.7%
1073
 
1.7%
1060
 
1.7%
918
 
1.4%
859
 
1.3%
Other values (841) 52014
81.0%
Latin
ValueCountFrequency (%)
C 204
 
11.6%
m 188
 
10.7%
G 180
 
10.3%
S 171
 
9.8%
P 120
 
6.8%
e 64
 
3.7%
A 51
 
2.9%
B 50
 
2.9%
O 46
 
2.6%
o 40
 
2.3%
Other values (41) 638
36.4%
Common
ValueCountFrequency (%)
( 2086
19.3%
) 2084
19.3%
1653
15.3%
1 818
 
7.6%
2 655
 
6.1%
7 645
 
6.0%
0 577
 
5.3%
5 415
 
3.8%
3 369
 
3.4%
4 279
 
2.6%
Other values (21) 1242
11.5%
Han
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64125
83.5%
ASCII 12566
 
16.4%
None 63
 
0.1%
CJK 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 2086
16.6%
) 2084
16.6%
1653
13.2%
1 818
 
6.5%
2 655
 
5.2%
7 645
 
5.1%
0 577
 
4.6%
5 415
 
3.3%
3 369
 
2.9%
4 279
 
2.2%
Other values (69) 2985
23.8%
Hangul
ValueCountFrequency (%)
1862
 
2.9%
1764
 
2.8%
1214
 
1.9%
1184
 
1.8%
1144
 
1.8%
1087
 
1.7%
1073
 
1.7%
1060
 
1.7%
918
 
1.4%
859
 
1.3%
Other values (840) 51960
81.0%
None
ValueCountFrequency (%)
54
85.7%
· 5
 
7.9%
2
 
3.2%
° 2
 
3.2%
CJK
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

금연구역범위상세
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
건물 및 영엄장
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물 및 영엄장
2nd row건물 및 영엄장
3rd row건물 및 영엄장
4th row건물 및 영엄장
5th row건물 및 영엄장

Common Values

ValueCountFrequency (%)
건물 및 영엄장 10000
100.0%

Length

2024-01-10T06:36:32.748207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:36:32.813408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물 10000
33.3%
10000
33.3%
영엄장 10000
33.3%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 10000
100.0%

Length

2024-01-10T06:36:32.884342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:36:32.949523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
당진시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row당진시
2nd row당진시
3rd row당진시
4th row당진시
5th row당진시

Common Values

ValueCountFrequency (%)
당진시 10000
100.0%

Length

2024-01-10T06:36:33.020036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:36:33.086807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당진시 10000
100.0%
Distinct58
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T06:36:33.197643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length6.0131
Min length2

Characters and Unicode

Total characters60131
Distinct characters123
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row초등학교
2nd row음식점
3rd row음식점
4th row어린이운송용 승합차
5th row어린이놀이시설
ValueCountFrequency (%)
음식점 4472
32.5%
1096
 
8.0%
사무용건축물 1095
 
8.0%
공장 1095
 
8.0%
복합건축물 1095
 
8.0%
담배소매업소 989
 
7.2%
버스정류장 428
 
3.1%
버스정류소 391
 
2.8%
어린이놀이시설 362
 
2.6%
어린이운송용 250
 
1.8%
Other values (57) 2468
18.0%
2024-01-10T06:36:33.460620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4486
 
7.5%
4472
 
7.4%
4472
 
7.4%
3741
 
6.2%
2832
 
4.7%
2332
 
3.9%
2190
 
3.6%
2190
 
3.6%
1558
 
2.6%
1431
 
2.4%
Other values (113) 30427
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54610
90.8%
Space Separator 3741
 
6.2%
Other Punctuation 1291
 
2.1%
Decimal Number 268
 
0.4%
Lowercase Letter 134
 
0.2%
Uppercase Letter 87
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4486
 
8.2%
4472
 
8.2%
4472
 
8.2%
2832
 
5.2%
2332
 
4.3%
2190
 
4.0%
2190
 
4.0%
1558
 
2.9%
1431
 
2.6%
1400
 
2.6%
Other values (105) 27247
49.9%
Uppercase Letter
ValueCountFrequency (%)
G 29
33.3%
L 29
33.3%
P 29
33.3%
Decimal Number
ValueCountFrequency (%)
1 134
50.0%
0 134
50.0%
Space Separator
ValueCountFrequency (%)
3741
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1291
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 134
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54610
90.8%
Common 5300
 
8.8%
Latin 221
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4486
 
8.2%
4472
 
8.2%
4472
 
8.2%
2832
 
5.2%
2332
 
4.3%
2190
 
4.0%
2190
 
4.0%
1558
 
2.9%
1431
 
2.6%
1400
 
2.6%
Other values (105) 27247
49.9%
Common
ValueCountFrequency (%)
3741
70.6%
, 1291
 
24.4%
1 134
 
2.5%
0 134
 
2.5%
Latin
ValueCountFrequency (%)
m 134
60.6%
G 29
 
13.1%
L 29
 
13.1%
P 29
 
13.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54610
90.8%
ASCII 5521
 
9.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4486
 
8.2%
4472
 
8.2%
4472
 
8.2%
2832
 
5.2%
2332
 
4.3%
2190
 
4.0%
2190
 
4.0%
1558
 
2.9%
1431
 
2.6%
1400
 
2.6%
Other values (105) 27247
49.9%
ASCII
ValueCountFrequency (%)
3741
67.8%
, 1291
 
23.4%
1 134
 
2.4%
0 134
 
2.4%
m 134
 
2.4%
G 29
 
0.5%
L 29
 
0.5%
P 29
 
0.5%

금연구역지정근거명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국민건강증진법 제9조
10000 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국민건강증진법 제9조
2nd row국민건강증진법 제9조
3rd row국민건강증진법 제9조
4th row국민건강증진법 제9조
5th row국민건강증진법 제9조

Common Values

ValueCountFrequency (%)
국민건강증진법 제9조 10000
100.0%

Length

2024-01-10T06:36:33.568183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:36:33.640779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국민건강증진법 10000
50.0%
제9조 10000
50.0%

금연구역면적
Real number (ℝ)

MISSING  SKEWED 

Distinct2202
Distinct (%)84.9%
Missing7405
Missing (%)74.1%
Infinite0
Infinite (%)0.0%
Mean2639.8985
Minimum0
Maximum2581987
Zeros65
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:36:33.722822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile23.442
Q154.6
median100
Q3240.72
95-th percentile5364.763
Maximum2581987
Range2581987
Interquartile range (IQR)186.12

Descriptive statistics

Standard deviation53971.229
Coefficient of variation (CV)20.444433
Kurtosis2035.1137
Mean2639.8985
Median Absolute Deviation (MAD)57.1
Skewness43.639455
Sum6850536.7
Variance2.9128935 × 109
MonotonicityNot monotonic
2024-01-10T06:36:33.832415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 65
 
0.7%
50.0 7
 
0.1%
72.0 7
 
0.1%
66.0 6
 
0.1%
84.0 6
 
0.1%
59.4 6
 
0.1%
33.0 6
 
0.1%
34.0 5
 
0.1%
52.0 5
 
0.1%
135.0 5
 
0.1%
Other values (2192) 2477
 
24.8%
(Missing) 7405
74.1%
ValueCountFrequency (%)
0.0 65
0.7%
6.6 1
 
< 0.1%
7.68 1
 
< 0.1%
8.55 1
 
< 0.1%
10.15 1
 
< 0.1%
10.2 1
 
< 0.1%
11.78 1
 
< 0.1%
12.0 2
 
< 0.1%
12.53 1
 
< 0.1%
13.5 1
 
< 0.1%
ValueCountFrequency (%)
2581987.034 1
< 0.1%
834695.36 1
< 0.1%
325809.0 1
< 0.1%
207246.49 1
< 0.1%
134389.0 1
< 0.1%
105739.68 1
< 0.1%
50978.22 1
< 0.1%
49022.61 1
< 0.1%
48855.19 1
< 0.1%
42031.26 1
< 0.1%

위반과태료
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

위반신고전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
041-360-6053
10000 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row041-360-6053
2nd row041-360-6053
3rd row041-360-6053
4th row041-360-6053
5th row041-360-6053

Common Values

ValueCountFrequency (%)
041-360-6053 10000
100.0%

Length

2024-01-10T06:36:33.932449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:36:34.001550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
041-360-6053 10000
100.0%
Distinct5898
Distinct (%)63.0%
Missing635
Missing (%)6.3%
Memory size156.2 KiB
2024-01-10T06:36:34.264768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length53
Mean length25.302403
Min length11

Characters and Unicode

Total characters236957
Distinct characters527
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3521 ?
Unique (%)37.6%

Sample

1st row충청남도 당진시 ??송산면?상거길?19-1
2nd row충청남도 당진시 송악읍 반촌로 267
3rd row충청남도 당진시 석문면 대호만로 2277
4th row충청남도 당진시 계성2길 51 벽산아파트 102동 102호
5th row충청남도 당진시 순성로 453-30(순성면, 순성 중명아파트)(봉소리 58)
ValueCountFrequency (%)
충청남도 9365
 
18.5%
당진시 9365
 
18.5%
송악읍 1687
 
3.3%
읍내동 1359
 
2.7%
신평면 1051
 
2.1%
합덕읍 730
 
1.4%
석문면 684
 
1.4%
송산면 555
 
1.1%
당진중앙2로 409
 
0.8%
1층 341
 
0.7%
Other values (5005) 25053
49.5%
2024-01-10T06:36:34.688580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41233
 
17.4%
10879
 
4.6%
10503
 
4.4%
10094
 
4.3%
9824
 
4.1%
9689
 
4.1%
9533
 
4.0%
9372
 
4.0%
1 9282
 
3.9%
2 6101
 
2.6%
Other values (517) 110447
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 141395
59.7%
Space Separator 41233
 
17.4%
Decimal Number 38818
 
16.4%
Open Punctuation 4537
 
1.9%
Close Punctuation 4531
 
1.9%
Dash Punctuation 3840
 
1.6%
Other Punctuation 2347
 
1.0%
Uppercase Letter 205
 
0.1%
Math Symbol 28
 
< 0.1%
Lowercase Letter 21
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10879
 
7.7%
10503
 
7.4%
10094
 
7.1%
9824
 
6.9%
9689
 
6.9%
9533
 
6.7%
9372
 
6.6%
5695
 
4.0%
4052
 
2.9%
3946
 
2.8%
Other values (463) 57808
40.9%
Uppercase Letter
ValueCountFrequency (%)
B 37
18.0%
A 36
17.6%
C 21
10.2%
L 20
9.8%
G 15
7.3%
F 9
 
4.4%
I 9
 
4.4%
P 8
 
3.9%
H 8
 
3.9%
E 8
 
3.9%
Other values (9) 34
16.6%
Decimal Number
ValueCountFrequency (%)
1 9282
23.9%
2 6101
15.7%
3 4694
12.1%
0 3262
 
8.4%
4 3108
 
8.0%
7 2766
 
7.1%
5 2722
 
7.0%
6 2570
 
6.6%
8 2299
 
5.9%
9 2014
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 1690
72.0%
. 373
 
15.9%
? 244
 
10.4%
: 11
 
0.5%
@ 9
 
0.4%
8
 
0.3%
· 6
 
0.3%
/ 4
 
0.2%
& 2
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
a 10
47.6%
e 6
28.6%
c 2
 
9.5%
n 1
 
4.8%
d 1
 
4.8%
b 1
 
4.8%
Math Symbol
ValueCountFrequency (%)
~ 24
85.7%
< 2
 
7.1%
> 2
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 4535
> 99.9%
[ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 4529
> 99.9%
] 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
41233
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3840
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 141395
59.7%
Common 95336
40.2%
Latin 226
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10879
 
7.7%
10503
 
7.4%
10094
 
7.1%
9824
 
6.9%
9689
 
6.9%
9533
 
6.7%
9372
 
6.6%
5695
 
4.0%
4052
 
2.9%
3946
 
2.8%
Other values (463) 57808
40.9%
Common
ValueCountFrequency (%)
41233
43.3%
1 9282
 
9.7%
2 6101
 
6.4%
3 4694
 
4.9%
( 4535
 
4.8%
) 4529
 
4.8%
- 3840
 
4.0%
0 3262
 
3.4%
4 3108
 
3.3%
7 2766
 
2.9%
Other values (19) 11986
 
12.6%
Latin
ValueCountFrequency (%)
B 37
16.4%
A 36
15.9%
C 21
 
9.3%
L 20
 
8.8%
G 15
 
6.6%
a 10
 
4.4%
F 9
 
4.0%
I 9
 
4.0%
P 8
 
3.5%
H 8
 
3.5%
Other values (15) 53
23.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 141392
59.7%
ASCII 95548
40.3%
None 14
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41233
43.2%
1 9282
 
9.7%
2 6101
 
6.4%
3 4694
 
4.9%
( 4535
 
4.7%
) 4529
 
4.7%
- 3840
 
4.0%
0 3262
 
3.4%
4 3108
 
3.3%
7 2766
 
2.9%
Other values (42) 12198
 
12.8%
Hangul
ValueCountFrequency (%)
10879
 
7.7%
10503
 
7.4%
10094
 
7.1%
9824
 
6.9%
9689
 
6.9%
9533
 
6.7%
9372
 
6.6%
5695
 
4.0%
4052
 
2.9%
3946
 
2.8%
Other values (461) 57805
40.9%
None
ValueCountFrequency (%)
8
57.1%
· 6
42.9%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%

소재지지번주소
Text

MISSING 

Distinct2389
Distinct (%)88.8%
Missing7309
Missing (%)73.1%
Memory size156.2 KiB
2024-01-10T06:36:34.982393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length24.693794
Min length13

Characters and Unicode

Total characters66451
Distinct characters340
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2158 ?
Unique (%)80.2%

Sample

1st row충청남도 당진시 신평면 상오리 260번지 3호
2nd row충청남도 당진시 대덕동 1465번지
3rd row충청남도 당진시 석문면 교로리 844-9 비치타운
4th row충청남도 당진시 채운동 355-3
5th row충청남도 당진시 순성면 봉소리 437번지 5호
ValueCountFrequency (%)
충청남도 2691
19.0%
당진시 2691
19.0%
읍내동 509
 
3.6%
송악읍 447
 
3.2%
신평면 305
 
2.2%
석문면 216
 
1.5%
합덕읍 196
 
1.4%
1호 192
 
1.4%
복운리 169
 
1.2%
송산면 165
 
1.2%
Other values (1980) 6596
46.5%
2024-01-10T06:36:35.379037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15299
23.0%
2968
 
4.5%
2809
 
4.2%
2793
 
4.2%
2793
 
4.2%
2792
 
4.2%
2703
 
4.1%
2694
 
4.1%
1 2386
 
3.6%
1988
 
3.0%
Other values (330) 27226
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39282
59.1%
Space Separator 15299
 
23.0%
Decimal Number 11173
 
16.8%
Dash Punctuation 553
 
0.8%
Uppercase Letter 47
 
0.1%
Other Punctuation 30
 
< 0.1%
Close Punctuation 29
 
< 0.1%
Open Punctuation 29
 
< 0.1%
Lowercase Letter 5
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2968
 
7.6%
2809
 
7.2%
2793
 
7.1%
2793
 
7.1%
2792
 
7.1%
2703
 
6.9%
2694
 
6.9%
1988
 
5.1%
1851
 
4.7%
1716
 
4.4%
Other values (293) 14175
36.1%
Uppercase Letter
ValueCountFrequency (%)
C 10
21.3%
K 7
14.9%
B 5
10.6%
T 5
10.6%
A 4
 
8.5%
E 3
 
6.4%
G 3
 
6.4%
F 3
 
6.4%
H 1
 
2.1%
I 1
 
2.1%
Other values (5) 5
10.6%
Decimal Number
ValueCountFrequency (%)
1 2386
21.4%
2 1490
13.3%
3 1183
10.6%
5 1111
9.9%
6 1072
9.6%
4 1022
9.1%
9 886
 
7.9%
0 732
 
6.6%
8 712
 
6.4%
7 579
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 21
70.0%
. 5
 
16.7%
/ 2
 
6.7%
@ 2
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
e 3
60.0%
n 1
 
20.0%
o 1
 
20.0%
Space Separator
ValueCountFrequency (%)
15299
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 553
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39282
59.1%
Common 27117
40.8%
Latin 52
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2968
 
7.6%
2809
 
7.2%
2793
 
7.1%
2793
 
7.1%
2792
 
7.1%
2703
 
6.9%
2694
 
6.9%
1988
 
5.1%
1851
 
4.7%
1716
 
4.4%
Other values (293) 14175
36.1%
Common
ValueCountFrequency (%)
15299
56.4%
1 2386
 
8.8%
2 1490
 
5.5%
3 1183
 
4.4%
5 1111
 
4.1%
6 1072
 
4.0%
4 1022
 
3.8%
9 886
 
3.3%
0 732
 
2.7%
8 712
 
2.6%
Other values (9) 1224
 
4.5%
Latin
ValueCountFrequency (%)
C 10
19.2%
K 7
13.5%
B 5
9.6%
T 5
9.6%
A 4
 
7.7%
E 3
 
5.8%
G 3
 
5.8%
e 3
 
5.8%
F 3
 
5.8%
H 1
 
1.9%
Other values (8) 8
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39282
59.1%
ASCII 27169
40.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15299
56.3%
1 2386
 
8.8%
2 1490
 
5.5%
3 1183
 
4.4%
5 1111
 
4.1%
6 1072
 
3.9%
4 1022
 
3.8%
9 886
 
3.3%
0 732
 
2.7%
8 712
 
2.6%
Other values (27) 1276
 
4.7%
Hangul
ValueCountFrequency (%)
2968
 
7.6%
2809
 
7.2%
2793
 
7.1%
2793
 
7.1%
2792
 
7.1%
2703
 
6.9%
2694
 
6.9%
1988
 
5.1%
1851
 
4.7%
1716
 
4.4%
Other values (293) 14175
36.1%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
당진시보건소
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row당진시보건소
2nd row당진시보건소
3rd row당진시보건소
4th row당진시보건소
5th row당진시보건소

Common Values

ValueCountFrequency (%)
당진시보건소 10000
100.0%

Length

2024-01-10T06:36:35.485715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:36:35.550840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당진시보건소 10000
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-12-09 00:00:00
Maximum2019-12-09 00:00:00
2024-01-10T06:36:35.608613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:36:35.675088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T06:36:31.699268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:36:35.724291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금연구역구분금연구역면적
금연구역구분1.0000.000
금연구역면적0.0001.000

Missing values

2024-01-10T06:36:31.804904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:36:31.950535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T06:36:32.058602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

금연구역명금연구역범위상세시도명시군구명금연구역구분금연구역지정근거명금연구역면적위반과태료위반신고전화번호소재지도로명주소소재지지번주소관리기관명데이터기준일자
5710송산초등학교건물 및 영엄장충청남도당진시초등학교국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 ??송산면?상거길?19-1<NA>당진시보건소2019-12-09
5245이서방치킨건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조28.7<NA>041-360-6053충청남도 당진시 송악읍 반촌로 267<NA>당진시보건소2019-12-09
3504만민식당건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조113.95<NA>041-360-6053충청남도 당진시 석문면 대호만로 2277<NA>당진시보건소2019-12-09
8442풀잎어린이집(71거3356)건물 및 영엄장충청남도당진시어린이운송용 승합차국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 계성2길 51 벽산아파트 102동 102호<NA>당진시보건소2019-12-09
2160순성중명아파트 놀이터건물 및 영엄장충청남도당진시어린이놀이시설국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 순성로 453-30(순성면, 순성 중명아파트)(봉소리 58)<NA>당진시보건소2019-12-09
10644꼬꼬닭개장건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 신평면 덕평로 1188충청남도 당진시 신평면 상오리 260번지 3호당진시보건소2019-12-09
5151본죽기지시리점건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조33.0<NA>041-360-6053충청남도 당진시 송악읍 반촌로 98-1<NA>당진시보건소2019-12-09
10125동경참치(먹거리길)건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 먹거리길 42-41 (대덕동)충청남도 당진시 대덕동 1465번지당진시보건소2019-12-09
8340EIE국제어학원(74마9208)건물 및 영엄장충청남도당진시어린이운송용 승합차국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 밤절로42-127 1층 (원당동)<NA>당진시보건소2019-12-09
1706복합건축물-61건물 및 영엄장충청남도당진시사무용건축물, 공장 및 복합건축물국민건강증진법 제9조2334.0<NA>041-360-6053충청남도 당진시 석문면 왜목길 35(비치타운)충청남도 당진시 석문면 교로리 844-9 비치타운당진시보건소2019-12-09
금연구역명금연구역범위상세시도명시군구명금연구역구분금연구역지정근거명금연구역면적위반과태료위반신고전화번호소재지도로명주소소재지지번주소관리기관명데이터기준일자
4591자연산미꾸라지건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조54.48<NA>041-360-6053충청남도 당진시 정미면 염솔로 418-1<NA>당진시보건소2019-12-09
8848해나루 어린이집(경계10m)건물 및 영엄장충청남도당진시어린이집경계10m국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 남부로 8<NA>당진시보건소2019-12-09
8450아이캔어린이집(79구3171)건물 및 영엄장충청남도당진시어린이운송용 승합차국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 송악읍 정곡로 20-19<NA>당진시보건소2019-12-09
4656주인마음건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조52.0<NA>041-360-6053충청남도 당진시 합덕읍 덕평로 377<NA>당진시보건소2019-12-09
13985브레인상사건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 북문길 82 (읍내동)<NA>당진시보건소2019-12-09
7748당진 사랑마루건물 및 영엄장충청남도당진시사회복지시설국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 신평면 거산3거리길 74-11, 103동 201호 (당진신평코아루아파트)<NA>당진시보건소2019-12-09
8040송산세안아파트 놀이터건물 및 영엄장충청남도당진시어린이놀이시설국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 당산1로 563(송산면, 세안근로복지아파트)(매곡리 330)<NA>당진시보건소2019-12-09
6726스타덤PC방 당진본점건물 및 영엄장충청남도당진시게임제공업소국민건강증진법 제9조<NA><NA>041-360-6053충청남도 당진시 당진중앙2로 133, 2층 202~203호 (읍내동)<NA>당진시보건소2019-12-09
3116여제호프건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조146.53<NA>041-360-6053충청남도 당진시 신평면 신평시장2길 8<NA>당진시보건소2019-12-09
2742동경일식건물 및 영엄장충청남도당진시음식점국민건강증진법 제9조232.1<NA>041-360-6053충청남도 당진시 당진중앙2로 236 (읍내동)<NA>당진시보건소2019-12-09

Duplicate rows

Most frequently occurring

금연구역명금연구역범위상세시도명시군구명금연구역구분금연구역지정근거명금연구역면적위반신고전화번호소재지도로명주소소재지지번주소관리기관명데이터기준일자# duplicates
0(주)아산개발건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 송악읍 서해로 6622<NA>당진시보건소2019-12-092
1(주)코리아세븐 당진한진점건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 송악읍 부곡공단로 350-5<NA>당진시보건소2019-12-092
2(주)코리아세븐 신성대인성관점건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 정미면 대학로 1<NA>당진시보건소2019-12-092
311호 천사어린이공원건물 및 영엄장충청남도당진시어린이놀이시설국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 우강면 송산리<NA>당진시보건소2019-12-092
4153 당구장건물 및 영엄장충청남도당진시실내체육시설국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 대호지면 대호로 18, 1층<NA>당진시보건소2019-12-092
5365 골프존건물 및 영엄장충청남도당진시실내체육시설국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 신평면 신평길 100<NA>당진시보건소2019-12-092
6365할인마트건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 송산면 송산로 817<NA>당진시보건소2019-12-092
7CU 당진전망대점건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 신평면 삽교천3길 93. F동 153호<NA>당진시보건소2019-12-092
8GS25 당진송산건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 송산면 틀모시로 1122<NA>당진시보건소2019-12-092
9GS25 당진채운점건물 및 영엄장충청남도당진시담배소매업소국민건강증진법 제9조<NA>041-360-6053충청남도 당진시 서부로 246 (채운동)<NA>당진시보건소2019-12-092