Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells18451
Missing cells (%)13.2%
Duplicate rows471
Duplicate rows (%)4.7%
Total size in memory1.2 MiB
Average record size in memory124.0 B

Variable types

Categorical3
Text7
Numeric3
Unsupported1

Dataset

Description강남구 식품위생업소의 업소명, 업태, 업종, 소재지, 행정처분 일자, 행정처분 내역, 법적근거, 적발구분 등 데이터를 제공합니다
URLhttps://www.data.go.kr/data/15075958/fileData.do

Alerts

행정처분상태 has constant value ""Constant
Dataset has 471 (4.7%) duplicate rowsDuplicates
지도점검일자 is highly overall correlated with 위반일자High correlation
위반일자 is highly overall correlated with 지도점검일자High correlation
업종명 is highly imbalanced (52.6%)Imbalance
적발구분 is highly imbalanced (50.5%)Imbalance
위반내역분류 has 10000 (100.0%) missing valuesMissing
처분기간 has 8390 (83.9%) missing valuesMissing
지도점검일자 is highly skewed (γ1 = -79.66730454)Skewed
위반일자 is highly skewed (γ1 = -65.66091583)Skewed
위반내역분류 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:11:20.416694
Analysis finished2023-12-12 22:11:24.451605
Duration4.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반음식점
6313 
단란주점
1174 
유흥주점영업
1034 
휴게음식점
 
336
건강기능식품일반판매업
 
246
Other values (13)
897 

Length

Max length13
Median length5
Mean length5.3734
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row단란주점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 6313
63.1%
단란주점 1174
 
11.7%
유흥주점영업 1034
 
10.3%
휴게음식점 336
 
3.4%
건강기능식품일반판매업 246
 
2.5%
유통전문판매업 246
 
2.5%
식품제조가공업 208
 
2.1%
즉석판매제조가공업 192
 
1.9%
식품소분업 56
 
0.6%
식품등 수입판매업 56
 
0.6%
Other values (8) 139
 
1.4%

Length

2023-12-13T07:11:24.522387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 6313
62.8%
단란주점 1174
 
11.7%
유흥주점영업 1034
 
10.3%
휴게음식점 336
 
3.3%
건강기능식품일반판매업 246
 
2.4%
유통전문판매업 246
 
2.4%
식품제조가공업 208
 
2.1%
즉석판매제조가공업 192
 
1.9%
수입판매업 56
 
0.6%
식품등 56
 
0.6%
Other values (9) 195
 
1.9%
Distinct68
Distinct (%)0.7%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2023-12-13T07:11:24.750685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length3.5184369
Min length2

Characters and Unicode

Total characters35114
Distinct characters142
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row한식
2nd row일식
3rd row탕류(보신용)
4th row단란주점
5th row한식
ValueCountFrequency (%)
경양식 2585
25.6%
한식 2283
22.7%
단란주점 1174
11.6%
룸살롱 781
 
7.7%
분식 445
 
4.4%
일식 285
 
2.8%
유통전문판매업 246
 
2.4%
식품제조가공업 207
 
2.1%
즉석판매제조가공업 192
 
1.9%
중국식 170
 
1.7%
Other values (59) 1711
17.0%
2023-12-13T07:11:25.126284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6257
17.8%
2585
 
7.4%
2585
 
7.4%
2283
 
6.5%
1338
 
3.8%
1234
 
3.5%
1212
 
3.5%
1174
 
3.3%
1052
 
3.0%
873
 
2.5%
Other values (132) 14521
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34266
97.6%
Close Punctuation 296
 
0.8%
Open Punctuation 296
 
0.8%
Other Punctuation 157
 
0.4%
Space Separator 99
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6257
18.3%
2585
 
7.5%
2585
 
7.5%
2283
 
6.7%
1338
 
3.9%
1234
 
3.6%
1212
 
3.5%
1174
 
3.4%
1052
 
3.1%
873
 
2.5%
Other values (127) 13673
39.9%
Other Punctuation
ValueCountFrequency (%)
/ 152
96.8%
, 5
 
3.2%
Close Punctuation
ValueCountFrequency (%)
) 296
100.0%
Open Punctuation
ValueCountFrequency (%)
( 296
100.0%
Space Separator
ValueCountFrequency (%)
99
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34266
97.6%
Common 848
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6257
18.3%
2585
 
7.5%
2585
 
7.5%
2283
 
6.7%
1338
 
3.9%
1234
 
3.6%
1212
 
3.5%
1174
 
3.4%
1052
 
3.1%
873
 
2.5%
Other values (127) 13673
39.9%
Common
ValueCountFrequency (%)
) 296
34.9%
( 296
34.9%
/ 152
17.9%
99
 
11.7%
, 5
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34266
97.6%
ASCII 848
 
2.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6257
18.3%
2585
 
7.5%
2585
 
7.5%
2283
 
6.7%
1338
 
3.9%
1234
 
3.6%
1212
 
3.5%
1174
 
3.4%
1052
 
3.1%
873
 
2.5%
Other values (127) 13673
39.9%
ASCII
ValueCountFrequency (%)
) 296
34.9%
( 296
34.9%
/ 152
17.9%
99
 
11.7%
, 5
 
0.6%
Distinct6084
Distinct (%)60.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:11:25.426113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34
Mean length4.5324
Min length1

Characters and Unicode

Total characters45324
Distinct characters986
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4308 ?
Unique (%)43.1%

Sample

1st row뚝심한우암소직판장
2nd row카와세미
3rd row민속주점(보물점방)
4th row
5th row을지로골뱅이와진짜생맥주
ValueCountFrequency (%)
주식회사 51
 
0.5%
28
 
0.3%
에구찌 26
 
0.2%
25
 
0.2%
23
 
0.2%
스시히로바 21
 
0.2%
비스트로 20
 
0.2%
센스 19
 
0.2%
영동식당 19
 
0.2%
19
 
0.2%
Other values (6349) 10584
97.7%
2023-12-13T07:11:25.925083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1565
 
3.5%
1394
 
3.1%
) 993
 
2.2%
( 989
 
2.2%
954
 
2.1%
879
 
1.9%
837
 
1.8%
657
 
1.4%
612
 
1.4%
512
 
1.1%
Other values (976) 35932
79.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40393
89.1%
Close Punctuation 993
 
2.2%
Open Punctuation 989
 
2.2%
Space Separator 837
 
1.8%
Uppercase Letter 800
 
1.8%
Lowercase Letter 773
 
1.7%
Decimal Number 465
 
1.0%
Other Punctuation 67
 
0.1%
Dash Punctuation 5
 
< 0.1%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1565
 
3.9%
1394
 
3.5%
954
 
2.4%
879
 
2.2%
657
 
1.6%
612
 
1.5%
512
 
1.3%
485
 
1.2%
479
 
1.2%
441
 
1.1%
Other values (901) 32415
80.2%
Uppercase Letter
ValueCountFrequency (%)
E 65
 
8.1%
O 61
 
7.6%
G 55
 
6.9%
S 54
 
6.8%
B 52
 
6.5%
W 48
 
6.0%
A 48
 
6.0%
I 47
 
5.9%
L 42
 
5.2%
C 40
 
5.0%
Other values (16) 288
36.0%
Lowercase Letter
ValueCountFrequency (%)
o 100
12.9%
e 86
11.1%
i 74
9.6%
t 69
8.9%
n 66
8.5%
a 55
 
7.1%
r 54
 
7.0%
s 48
 
6.2%
u 41
 
5.3%
l 36
 
4.7%
Other values (14) 144
18.6%
Decimal Number
ValueCountFrequency (%)
2 95
20.4%
1 86
18.5%
0 71
15.3%
5 41
8.8%
8 37
 
8.0%
3 33
 
7.1%
4 31
 
6.7%
9 28
 
6.0%
7 28
 
6.0%
6 15
 
3.2%
Other Punctuation
ValueCountFrequency (%)
& 22
32.8%
. 13
19.4%
10
14.9%
' 8
 
11.9%
? 6
 
9.0%
, 5
 
7.5%
; 1
 
1.5%
1
 
1.5%
: 1
 
1.5%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 993
100.0%
Open Punctuation
ValueCountFrequency (%)
( 989
100.0%
Space Separator
ValueCountFrequency (%)
837
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40393
89.1%
Common 3356
 
7.4%
Latin 1575
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1565
 
3.9%
1394
 
3.5%
954
 
2.4%
879
 
2.2%
657
 
1.6%
612
 
1.5%
512
 
1.3%
485
 
1.2%
479
 
1.2%
441
 
1.1%
Other values (901) 32415
80.2%
Latin
ValueCountFrequency (%)
o 100
 
6.3%
e 86
 
5.5%
i 74
 
4.7%
t 69
 
4.4%
n 66
 
4.2%
E 65
 
4.1%
O 61
 
3.9%
a 55
 
3.5%
G 55
 
3.5%
S 54
 
3.4%
Other values (42) 890
56.5%
Common
ValueCountFrequency (%)
) 993
29.6%
( 989
29.5%
837
24.9%
2 95
 
2.8%
1 86
 
2.6%
0 71
 
2.1%
5 41
 
1.2%
8 37
 
1.1%
3 33
 
1.0%
4 31
 
0.9%
Other values (13) 143
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40391
89.1%
ASCII 4918
 
10.9%
None 11
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1565
 
3.9%
1394
 
3.5%
954
 
2.4%
879
 
2.2%
657
 
1.6%
612
 
1.5%
512
 
1.3%
485
 
1.2%
479
 
1.2%
441
 
1.1%
Other values (900) 32413
80.2%
ASCII
ValueCountFrequency (%)
) 993
20.2%
( 989
20.1%
837
17.0%
o 100
 
2.0%
2 95
 
1.9%
1 86
 
1.7%
e 86
 
1.7%
i 74
 
1.5%
0 71
 
1.4%
t 69
 
1.4%
Other values (61) 1518
30.9%
None
ValueCountFrequency (%)
10
90.9%
1
 
9.1%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct5768
Distinct (%)57.7%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T07:11:26.311664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length62
Mean length28.45179
Min length20

Characters and Unicode

Total characters284461
Distinct characters426
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3834 ?
Unique (%)38.3%

Sample

1st row서울특별시 강남구 도곡동 542번지 1호 1층
2nd row서울특별시 강남구 신사동 651번지 8호 하늬솔빌딩
3rd row서울특별시 강남구 역삼동 747번지 24호 지하1층
4th row서울특별시 강남구 역삼동 823번지 8호
5th row서울특별시 강남구 대치동 988번지 2호 지상1층
ValueCountFrequency (%)
서울특별시 9998
17.9%
강남구 9998
17.9%
역삼동 2560
 
4.6%
논현동 1913
 
3.4%
지하1층 1636
 
2.9%
신사동 1546
 
2.8%
삼성동 1132
 
2.0%
대치동 953
 
1.7%
0호 914
 
1.6%
청담동 861
 
1.5%
Other values (2324) 24266
43.5%
2023-12-13T07:11:26.918621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70382
24.7%
14334
 
5.0%
1 13095
 
4.6%
10330
 
3.6%
10143
 
3.6%
10139
 
3.6%
10130
 
3.6%
10063
 
3.5%
10041
 
3.5%
10031
 
3.5%
Other values (416) 115773
40.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 160719
56.5%
Space Separator 70382
24.7%
Decimal Number 51540
 
18.1%
Other Punctuation 924
 
0.3%
Dash Punctuation 243
 
0.1%
Uppercase Letter 233
 
0.1%
Open Punctuation 176
 
0.1%
Close Punctuation 169
 
0.1%
Math Symbol 60
 
< 0.1%
Lowercase Letter 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14334
 
8.9%
10330
 
6.4%
10143
 
6.3%
10139
 
6.3%
10130
 
6.3%
10063
 
6.3%
10041
 
6.2%
10031
 
6.2%
10016
 
6.2%
10008
 
6.2%
Other values (365) 55484
34.5%
Uppercase Letter
ValueCountFrequency (%)
B 83
35.6%
A 19
 
8.2%
S 14
 
6.0%
F 12
 
5.2%
C 12
 
5.2%
T 11
 
4.7%
D 9
 
3.9%
E 8
 
3.4%
L 8
 
3.4%
K 7
 
3.0%
Other values (12) 50
21.5%
Decimal Number
ValueCountFrequency (%)
1 13095
25.4%
2 6629
12.9%
6 4968
 
9.6%
3 4281
 
8.3%
0 4045
 
7.8%
5 3901
 
7.6%
4 3883
 
7.5%
7 3819
 
7.4%
8 3646
 
7.1%
9 3273
 
6.4%
Lowercase Letter
ValueCountFrequency (%)
e 3
23.1%
b 2
15.4%
a 2
15.4%
n 2
15.4%
t 1
 
7.7%
g 1
 
7.7%
m 1
 
7.7%
o 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 706
76.4%
. 204
 
22.1%
/ 10
 
1.1%
& 3
 
0.3%
; 1
 
0.1%
Space Separator
ValueCountFrequency (%)
70382
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 243
100.0%
Open Punctuation
ValueCountFrequency (%)
( 176
100.0%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Math Symbol
ValueCountFrequency (%)
~ 60
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 160719
56.5%
Common 123496
43.4%
Latin 246
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14334
 
8.9%
10330
 
6.4%
10143
 
6.3%
10139
 
6.3%
10130
 
6.3%
10063
 
6.3%
10041
 
6.2%
10031
 
6.2%
10016
 
6.2%
10008
 
6.2%
Other values (365) 55484
34.5%
Latin
ValueCountFrequency (%)
B 83
33.7%
A 19
 
7.7%
S 14
 
5.7%
F 12
 
4.9%
C 12
 
4.9%
T 11
 
4.5%
D 9
 
3.7%
E 8
 
3.3%
L 8
 
3.3%
K 7
 
2.8%
Other values (20) 63
25.6%
Common
ValueCountFrequency (%)
70382
57.0%
1 13095
 
10.6%
2 6629
 
5.4%
6 4968
 
4.0%
3 4281
 
3.5%
0 4045
 
3.3%
5 3901
 
3.2%
4 3883
 
3.1%
7 3819
 
3.1%
8 3646
 
3.0%
Other values (11) 4847
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 160716
56.5%
ASCII 123742
43.5%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70382
56.9%
1 13095
 
10.6%
2 6629
 
5.4%
6 4968
 
4.0%
3 4281
 
3.5%
0 4045
 
3.3%
5 3901
 
3.2%
4 3883
 
3.1%
7 3819
 
3.1%
8 3646
 
2.9%
Other values (41) 5093
 
4.1%
Hangul
ValueCountFrequency (%)
14334
 
8.9%
10330
 
6.4%
10143
 
6.3%
10139
 
6.3%
10130
 
6.3%
10063
 
6.3%
10041
 
6.2%
10031
 
6.2%
10016
 
6.2%
10008
 
6.2%
Other values (364) 55481
34.5%
Compat Jamo
ValueCountFrequency (%)
3
100.0%

지도점검일자
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct4138
Distinct (%)41.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20085229
Minimum2005011
Maximum20220211
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:11:27.056848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2005011
5-th percentile19970199
Q120040522
median20090104
Q320141028
95-th percentile20200401
Maximum20220211
Range18215200
Interquartile range (IQR)100506.25

Descriptive statistics

Standard deviation195055.48
Coefficient of variation (CV)0.0097113893
Kurtosis7384.3498
Mean20085229
Median Absolute Deviation (MAD)50406
Skewness-79.667305
Sum2.0085229 × 1011
Variance3.8046641 × 1010
MonotonicityNot monotonic
2023-12-13T07:11:27.189301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20050113 438
 
4.4%
20070911 154
 
1.5%
20190101 88
 
0.9%
20200101 85
 
0.9%
20161231 78
 
0.8%
20061212 68
 
0.7%
20070912 65
 
0.7%
20210101 64
 
0.6%
20190705 49
 
0.5%
20210401 47
 
0.5%
Other values (4128) 8864
88.6%
ValueCountFrequency (%)
2005011 1
 
< 0.1%
19851207 1
 
< 0.1%
19860726 3
< 0.1%
19861010 2
< 0.1%
19861216 1
 
< 0.1%
19870327 1
 
< 0.1%
19870510 1
 
< 0.1%
19870515 1
 
< 0.1%
19880301 1
 
< 0.1%
19880626 1
 
< 0.1%
ValueCountFrequency (%)
20220211 2
< 0.1%
20220210 1
 
< 0.1%
20220205 1
 
< 0.1%
20220128 2
< 0.1%
20220120 1
 
< 0.1%
20220119 3
< 0.1%
20220118 3
< 0.1%
20220114 3
< 0.1%
20220107 4
< 0.1%
20220104 3
< 0.1%

행정처분상태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
처분확정
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row처분확정
2nd row처분확정
3rd row처분확정
4th row처분확정
5th row처분확정

Common Values

ValueCountFrequency (%)
처분확정 10000
100.0%

Length

2023-12-13T07:11:27.322263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:11:27.406793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
처분확정 10000
100.0%
Distinct5422
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:11:27.668011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length183
Median length138
Mean length20.1302
Min length2

Characters and Unicode

Total characters201302
Distinct characters333
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4490 ?
Unique (%)44.9%

Sample

1st row과태료50만원(2013.02.14완납)
2nd row과태료부과15만원_자진납부12만원
3rd row영업정지 1월갈음 과징금 10,800,000원 부과/시설개수명령(즉시)
4th row과태료30만원 부과 및 시정명령
5th row영업소폐쇄(직권폐업-12.29일자)
ValueCountFrequency (%)
영업소폐쇄 837
 
4.8%
영업정지 657
 
3.8%
시정명령 553
 
3.2%
437
 
2.5%
과태료 343
 
2.0%
과징금 335
 
1.9%
자진납부 319
 
1.8%
갈음 312
 
1.8%
과태료부과 255
 
1.5%
231
 
1.3%
Other values (6602) 13049
75.3%
2023-12-13T07:11:28.154192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17336
 
8.6%
. 16688
 
8.3%
1 15731
 
7.8%
2 11952
 
5.9%
) 7467
 
3.7%
( 7460
 
3.7%
7357
 
3.7%
5960
 
3.0%
5958
 
3.0%
5821
 
2.9%
Other values (323) 99572
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89433
44.4%
Decimal Number 67360
33.5%
Other Punctuation 19256
 
9.6%
Close Punctuation 7477
 
3.7%
Open Punctuation 7471
 
3.7%
Space Separator 7357
 
3.7%
Math Symbol 2067
 
1.0%
Dash Punctuation 835
 
0.4%
Lowercase Letter 19
 
< 0.1%
Modifier Symbol 13
 
< 0.1%
Other values (2) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5960
 
6.7%
5958
 
6.7%
5821
 
6.5%
5500
 
6.1%
5259
 
5.9%
4006
 
4.5%
3751
 
4.2%
3588
 
4.0%
3137
 
3.5%
3106
 
3.5%
Other values (271) 43347
48.5%
Lowercase Letter
ValueCountFrequency (%)
r 2
 
10.5%
d 2
 
10.5%
h 2
 
10.5%
l 2
 
10.5%
m 1
 
5.3%
u 1
 
5.3%
o 1
 
5.3%
s 1
 
5.3%
w 1
 
5.3%
p 1
 
5.3%
Other values (5) 5
26.3%
Other Punctuation
ValueCountFrequency (%)
. 16688
86.7%
, 1685
 
8.8%
: 267
 
1.4%
/ 206
 
1.1%
156
 
0.8%
? 142
 
0.7%
% 88
 
0.5%
' 15
 
0.1%
* 5
 
< 0.1%
3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 17336
25.7%
1 15731
23.4%
2 11952
17.7%
5 4045
 
6.0%
3 3922
 
5.8%
9 3314
 
4.9%
7 2959
 
4.4%
4 2801
 
4.2%
8 2672
 
4.0%
6 2628
 
3.9%
Math Symbol
ValueCountFrequency (%)
~ 1772
85.7%
262
 
12.7%
+ 18
 
0.9%
× 9
 
0.4%
> 4
 
0.2%
< 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 7467
99.9%
] 10
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 7460
99.9%
[ 11
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
N 2
66.7%
O 1
33.3%
Space Separator
ValueCountFrequency (%)
7357
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 835
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 13
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 111847
55.6%
Hangul 89433
44.4%
Latin 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5960
 
6.7%
5958
 
6.7%
5821
 
6.5%
5500
 
6.1%
5259
 
5.9%
4006
 
4.5%
3751
 
4.2%
3588
 
4.0%
3137
 
3.5%
3106
 
3.5%
Other values (271) 43347
48.5%
Common
ValueCountFrequency (%)
0 17336
15.5%
. 16688
14.9%
1 15731
14.1%
2 11952
10.7%
) 7467
6.7%
( 7460
6.7%
7357
6.6%
5 4045
 
3.6%
3 3922
 
3.5%
9 3314
 
3.0%
Other values (25) 16575
14.8%
Latin
ValueCountFrequency (%)
r 2
 
9.1%
N 2
 
9.1%
d 2
 
9.1%
h 2
 
9.1%
l 2
 
9.1%
m 1
 
4.5%
O 1
 
4.5%
u 1
 
4.5%
o 1
 
4.5%
s 1
 
4.5%
Other values (7) 7
31.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 111439
55.4%
Hangul 89421
44.4%
Arrows 262
 
0.1%
Punctuation 156
 
0.1%
None 12
 
< 0.1%
Compat Jamo 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17336
15.6%
. 16688
15.0%
1 15731
14.1%
2 11952
10.7%
) 7467
6.7%
( 7460
6.7%
7357
6.6%
5 4045
 
3.6%
3 3922
 
3.5%
9 3314
 
3.0%
Other values (38) 16167
14.5%
Hangul
ValueCountFrequency (%)
5960
 
6.7%
5958
 
6.7%
5821
 
6.5%
5500
 
6.2%
5259
 
5.9%
4006
 
4.5%
3751
 
4.2%
3588
 
4.0%
3137
 
3.5%
3106
 
3.5%
Other values (265) 43335
48.5%
Arrows
ValueCountFrequency (%)
262
100.0%
Punctuation
ValueCountFrequency (%)
156
100.0%
None
ValueCountFrequency (%)
× 9
75.0%
3
 
25.0%
Compat Jamo
ValueCountFrequency (%)
6
50.0%
2
 
16.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%

위반내역분류
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB
Distinct553
Distinct (%)5.6%
Missing39
Missing (%)0.4%
Memory size156.2 KiB
2023-12-13T07:11:28.473975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length10.098083
Min length1

Characters and Unicode

Total characters100587
Distinct characters125
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique347 ?
Unique (%)3.5%

Sample

1st row식품위생법
2nd row법 제82조제2항
3rd row식품위생법 제57조,제58조,제65조
4th row식품위생법
5th row식품위생법 제58조
ValueCountFrequency (%)
식품위생법 6375
27.7%
4677
20.3%
1877
 
8.1%
제75조 1832
 
8.0%
제71조 1572
 
6.8%
제58조 1253
 
5.4%
제74조 927
 
4.0%
제101조제2항제1호 422
 
1.8%
58조 252
 
1.1%
제55조 178
 
0.8%
Other values (447) 3670
15.9%
2023-12-13T07:11:28.912728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13131
13.1%
11660
11.6%
10472
10.4%
9383
9.3%
6901
 
6.9%
6885
 
6.8%
6845
 
6.8%
6783
 
6.7%
7 5448
 
5.4%
1 4827
 
4.8%
Other values (115) 18252
18.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63990
63.6%
Decimal Number 21979
 
21.9%
Space Separator 13131
 
13.1%
Other Punctuation 1459
 
1.5%
Dash Punctuation 13
 
< 0.1%
Uppercase Letter 5
 
< 0.1%
Math Symbol 4
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11660
18.2%
10472
16.4%
9383
14.7%
6901
10.8%
6885
10.8%
6845
10.7%
6783
10.6%
1878
 
2.9%
908
 
1.4%
787
 
1.2%
Other values (90) 1488
 
2.3%
Decimal Number
ValueCountFrequency (%)
7 5448
24.8%
1 4827
22.0%
5 4544
20.7%
8 2168
 
9.9%
2 1642
 
7.5%
4 1368
 
6.2%
0 879
 
4.0%
3 524
 
2.4%
6 511
 
2.3%
9 68
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 1443
98.9%
? 13
 
0.9%
. 2
 
0.1%
; 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
F 2
40.0%
K 1
20.0%
D 1
20.0%
S 1
20.0%
Space Separator
ValueCountFrequency (%)
13131
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63990
63.6%
Common 36592
36.4%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11660
18.2%
10472
16.4%
9383
14.7%
6901
10.8%
6885
10.8%
6845
10.7%
6783
10.6%
1878
 
2.9%
908
 
1.4%
787
 
1.2%
Other values (90) 1488
 
2.3%
Common
ValueCountFrequency (%)
13131
35.9%
7 5448
14.9%
1 4827
 
13.2%
5 4544
 
12.4%
8 2168
 
5.9%
2 1642
 
4.5%
, 1443
 
3.9%
4 1368
 
3.7%
0 879
 
2.4%
3 524
 
1.4%
Other values (11) 618
 
1.7%
Latin
ValueCountFrequency (%)
F 2
40.0%
K 1
20.0%
D 1
20.0%
S 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63990
63.6%
ASCII 36597
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13131
35.9%
7 5448
14.9%
1 4827
 
13.2%
5 4544
 
12.4%
8 2168
 
5.9%
2 1642
 
4.5%
, 1443
 
3.9%
4 1368
 
3.7%
0 879
 
2.4%
3 524
 
1.4%
Other values (15) 623
 
1.7%
Hangul
ValueCountFrequency (%)
11660
18.2%
10472
16.4%
9383
14.7%
6901
10.8%
6885
10.8%
6845
10.7%
6783
10.6%
1878
 
2.9%
908
 
1.4%
787
 
1.2%
Other values (90) 1488
 
2.3%

위반일자
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct4260
Distinct (%)42.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20084396
Minimum2000126
Maximum20220803
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:11:29.078788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000126
5-th percentile19970228
Q120040525
median20090110
Q320141029
95-th percentile20200401
Maximum20220803
Range18220677
Interquartile range (IQR)100504

Descriptive statistics

Standard deviation219189.81
Coefficient of variation (CV)0.010913438
Kurtosis5066.3557
Mean20084396
Median Absolute Deviation (MAD)50397.5
Skewness-65.660916
Sum2.0084396 × 1011
Variance4.8044171 × 1010
MonotonicityNot monotonic
2023-12-13T07:11:29.225894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20050113 438
 
4.4%
20070910 151
 
1.5%
20190101 116
 
1.2%
20200101 82
 
0.8%
20070911 79
 
0.8%
20161231 78
 
0.8%
20061212 67
 
0.7%
20210101 54
 
0.5%
20190705 46
 
0.5%
20070124 46
 
0.5%
Other values (4250) 8843
88.4%
ValueCountFrequency (%)
2000126 1
< 0.1%
10090722 1
< 0.1%
19550522 1
< 0.1%
19851207 1
< 0.1%
19860726 2
< 0.1%
19860826 1
< 0.1%
19861010 2
< 0.1%
19861216 1
< 0.1%
19870327 1
< 0.1%
19870510 1
< 0.1%
ValueCountFrequency (%)
20220803 1
 
< 0.1%
20220211 2
< 0.1%
20220210 1
 
< 0.1%
20220205 1
 
< 0.1%
20220128 2
< 0.1%
20220120 1
 
< 0.1%
20220119 3
< 0.1%
20220118 3
< 0.1%
20220114 3
< 0.1%
20220111 1
 
< 0.1%
Distinct4097
Distinct (%)41.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:11:29.581807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length191
Median length123
Mean length15.3539
Min length1

Characters and Unicode

Total characters153539
Distinct characters728
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3099 ?
Unique (%)31.0%

Sample

1st row건강진단미필 3명
2nd row재난배상책임보험 가입 의무기간 위반(30일 이하)
3rd row일반음식점 영업장에 손님이 이용할 수 있는 자막용영상장치 설치/일반음식점 영업자가 손님이 노래부르도록 허용
4th row허가증업소에 미보관 상호변경신고 미실시(외부간판에 표기된 상호가 다름)
5th row시설물 멸실
ValueCountFrequency (%)
827
 
3.0%
무단폐업 789
 
2.9%
시설물멸실 506
 
1.8%
미필 465
 
1.7%
설치 409
 
1.5%
영업장 384
 
1.4%
유흥접객원고용 372
 
1.3%
위생교육 357
 
1.3%
멸실 332
 
1.2%
종업원 325
 
1.2%
Other values (5040) 22854
82.7%
2023-12-13T07:11:30.407915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17921
 
11.7%
5818
 
3.8%
3539
 
2.3%
3360
 
2.2%
3329
 
2.2%
2696
 
1.8%
) 2626
 
1.7%
( 2624
 
1.7%
2594
 
1.7%
2458
 
1.6%
Other values (718) 106574
69.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 118673
77.3%
Space Separator 17921
 
11.7%
Decimal Number 7702
 
5.0%
Other Punctuation 3311
 
2.2%
Close Punctuation 2631
 
1.7%
Open Punctuation 2630
 
1.7%
Dash Punctuation 222
 
0.1%
Uppercase Letter 169
 
0.1%
Lowercase Letter 141
 
0.1%
Math Symbol 73
 
< 0.1%
Other values (5) 66
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5818
 
4.9%
3539
 
3.0%
3360
 
2.8%
3329
 
2.8%
2696
 
2.3%
2594
 
2.2%
2458
 
2.1%
2362
 
2.0%
2110
 
1.8%
2098
 
1.8%
Other values (625) 88309
74.4%
Uppercase Letter
ValueCountFrequency (%)
R 31
18.3%
S 20
11.8%
E 18
10.7%
N 18
10.7%
A 13
7.7%
D 12
 
7.1%
I 11
 
6.5%
C 8
 
4.7%
G 8
 
4.7%
H 7
 
4.1%
Other values (10) 23
13.6%
Lowercase Letter
ValueCountFrequency (%)
g 51
36.2%
m 19
 
13.5%
o 16
 
11.3%
b 10
 
7.1%
l 8
 
5.7%
e 5
 
3.5%
d 4
 
2.8%
a 4
 
2.8%
y 4
 
2.8%
n 3
 
2.1%
Other values (9) 17
 
12.1%
Other Punctuation
ValueCountFrequency (%)
, 1336
40.4%
/ 886
26.8%
. 845
25.5%
: 128
 
3.9%
? 67
 
2.0%
' 12
 
0.4%
% 10
 
0.3%
8
 
0.2%
* 8
 
0.2%
# 4
 
0.1%
Other values (3) 7
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 1870
24.3%
2 1850
24.0%
0 1223
15.9%
3 653
 
8.5%
9 452
 
5.9%
6 433
 
5.6%
4 370
 
4.8%
8 329
 
4.3%
5 305
 
4.0%
7 217
 
2.8%
Math Symbol
ValueCountFrequency (%)
+ 20
27.4%
~ 19
26.0%
> 13
17.8%
= 11
15.1%
< 5
 
6.8%
3
 
4.1%
× 2
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 2624
99.8%
[ 3
 
0.1%
1
 
< 0.1%
{ 1
 
< 0.1%
1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
26
86.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Close Punctuation
ValueCountFrequency (%)
) 2626
99.8%
] 3
 
0.1%
1
 
< 0.1%
} 1
 
< 0.1%
Other Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
² 1
33.3%
Final Punctuation
ValueCountFrequency (%)
10
66.7%
5
33.3%
Initial Punctuation
ValueCountFrequency (%)
5
50.0%
5
50.0%
Space Separator
ValueCountFrequency (%)
17921
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 222
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 118671
77.3%
Common 34555
 
22.5%
Latin 310
 
0.2%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5818
 
4.9%
3539
 
3.0%
3360
 
2.8%
3329
 
2.8%
2696
 
2.3%
2594
 
2.2%
2458
 
2.1%
2362
 
2.0%
2110
 
1.8%
2098
 
1.8%
Other values (623) 88307
74.4%
Common
ValueCountFrequency (%)
17921
51.9%
) 2626
 
7.6%
( 2624
 
7.6%
1 1870
 
5.4%
2 1850
 
5.4%
, 1336
 
3.9%
0 1223
 
3.5%
/ 886
 
2.6%
. 845
 
2.4%
3 653
 
1.9%
Other values (43) 2721
 
7.9%
Latin
ValueCountFrequency (%)
g 51
16.5%
R 31
 
10.0%
S 20
 
6.5%
m 19
 
6.1%
E 18
 
5.8%
N 18
 
5.8%
o 16
 
5.2%
A 13
 
4.2%
D 12
 
3.9%
I 11
 
3.5%
Other values (29) 101
32.6%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 118666
77.3%
ASCII 34789
 
22.7%
Punctuation 27
 
< 0.1%
CJK Compat 26
 
< 0.1%
None 16
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Arrows 3
 
< 0.1%
CJK 3
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17921
51.5%
) 2626
 
7.5%
( 2624
 
7.5%
1 1870
 
5.4%
2 1850
 
5.3%
, 1336
 
3.8%
0 1223
 
3.5%
/ 886
 
2.5%
. 845
 
2.4%
3 653
 
1.9%
Other values (63) 2955
 
8.5%
Hangul
ValueCountFrequency (%)
5818
 
4.9%
3539
 
3.0%
3360
 
2.8%
3329
 
2.8%
2696
 
2.3%
2594
 
2.2%
2458
 
2.1%
2362
 
2.0%
2110
 
1.8%
2098
 
1.8%
Other values (618) 88302
74.4%
CJK Compat
ValueCountFrequency (%)
26
100.0%
Punctuation
ValueCountFrequency (%)
10
37.0%
5
18.5%
5
18.5%
5
18.5%
2
 
7.4%
None
ValueCountFrequency (%)
8
50.0%
× 2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
² 1
 
6.2%
1
 
6.2%
Arrows
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Distinct5422
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:11:30.670090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length183
Median length138
Mean length20.1302
Min length2

Characters and Unicode

Total characters201302
Distinct characters333
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4490 ?
Unique (%)44.9%

Sample

1st row과태료50만원(2013.02.14완납)
2nd row과태료부과15만원_자진납부12만원
3rd row영업정지 1월갈음 과징금 10,800,000원 부과/시설개수명령(즉시)
4th row과태료30만원 부과 및 시정명령
5th row영업소폐쇄(직권폐업-12.29일자)
ValueCountFrequency (%)
영업소폐쇄 837
 
4.8%
영업정지 657
 
3.8%
시정명령 553
 
3.2%
437
 
2.5%
과태료 343
 
2.0%
과징금 335
 
1.9%
자진납부 319
 
1.8%
갈음 312
 
1.8%
과태료부과 255
 
1.5%
231
 
1.3%
Other values (6602) 13049
75.3%
2023-12-13T07:11:31.112410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17336
 
8.6%
. 16688
 
8.3%
1 15731
 
7.8%
2 11952
 
5.9%
) 7467
 
3.7%
( 7460
 
3.7%
7357
 
3.7%
5960
 
3.0%
5958
 
3.0%
5821
 
2.9%
Other values (323) 99572
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89433
44.4%
Decimal Number 67360
33.5%
Other Punctuation 19256
 
9.6%
Close Punctuation 7477
 
3.7%
Open Punctuation 7471
 
3.7%
Space Separator 7357
 
3.7%
Math Symbol 2067
 
1.0%
Dash Punctuation 835
 
0.4%
Lowercase Letter 19
 
< 0.1%
Modifier Symbol 13
 
< 0.1%
Other values (2) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5960
 
6.7%
5958
 
6.7%
5821
 
6.5%
5500
 
6.1%
5259
 
5.9%
4006
 
4.5%
3751
 
4.2%
3588
 
4.0%
3137
 
3.5%
3106
 
3.5%
Other values (271) 43347
48.5%
Lowercase Letter
ValueCountFrequency (%)
r 2
 
10.5%
d 2
 
10.5%
h 2
 
10.5%
l 2
 
10.5%
m 1
 
5.3%
u 1
 
5.3%
o 1
 
5.3%
s 1
 
5.3%
w 1
 
5.3%
p 1
 
5.3%
Other values (5) 5
26.3%
Other Punctuation
ValueCountFrequency (%)
. 16688
86.7%
, 1685
 
8.8%
: 267
 
1.4%
/ 206
 
1.1%
156
 
0.8%
? 142
 
0.7%
% 88
 
0.5%
' 15
 
0.1%
* 5
 
< 0.1%
3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 17336
25.7%
1 15731
23.4%
2 11952
17.7%
5 4045
 
6.0%
3 3922
 
5.8%
9 3314
 
4.9%
7 2959
 
4.4%
4 2801
 
4.2%
8 2672
 
4.0%
6 2628
 
3.9%
Math Symbol
ValueCountFrequency (%)
~ 1772
85.7%
262
 
12.7%
+ 18
 
0.9%
× 9
 
0.4%
> 4
 
0.2%
< 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 7467
99.9%
] 10
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 7460
99.9%
[ 11
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
N 2
66.7%
O 1
33.3%
Space Separator
ValueCountFrequency (%)
7357
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 835
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 13
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 111847
55.6%
Hangul 89433
44.4%
Latin 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5960
 
6.7%
5958
 
6.7%
5821
 
6.5%
5500
 
6.1%
5259
 
5.9%
4006
 
4.5%
3751
 
4.2%
3588
 
4.0%
3137
 
3.5%
3106
 
3.5%
Other values (271) 43347
48.5%
Common
ValueCountFrequency (%)
0 17336
15.5%
. 16688
14.9%
1 15731
14.1%
2 11952
10.7%
) 7467
6.7%
( 7460
6.7%
7357
6.6%
5 4045
 
3.6%
3 3922
 
3.5%
9 3314
 
3.0%
Other values (25) 16575
14.8%
Latin
ValueCountFrequency (%)
r 2
 
9.1%
N 2
 
9.1%
d 2
 
9.1%
h 2
 
9.1%
l 2
 
9.1%
m 1
 
4.5%
O 1
 
4.5%
u 1
 
4.5%
o 1
 
4.5%
s 1
 
4.5%
Other values (7) 7
31.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 111439
55.4%
Hangul 89421
44.4%
Arrows 262
 
0.1%
Punctuation 156
 
0.1%
None 12
 
< 0.1%
Compat Jamo 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17336
15.6%
. 16688
15.0%
1 15731
14.1%
2 11952
10.7%
) 7467
6.7%
( 7460
6.7%
7357
6.6%
5 4045
 
3.6%
3 3922
 
3.5%
9 3314
 
3.0%
Other values (38) 16167
14.5%
Hangul
ValueCountFrequency (%)
5960
 
6.7%
5958
 
6.7%
5821
 
6.5%
5500
 
6.2%
5259
 
5.9%
4006
 
4.5%
3751
 
4.2%
3588
 
4.0%
3137
 
3.5%
3106
 
3.5%
Other values (265) 43335
48.5%
Arrows
ValueCountFrequency (%)
262
100.0%
Punctuation
ValueCountFrequency (%)
156
100.0%
None
ValueCountFrequency (%)
× 9
75.0%
3
 
25.0%
Compat Jamo
ValueCountFrequency (%)
6
50.0%
2
 
16.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%

처분기간
Real number (ℝ)

MISSING 

Distinct30
Distinct (%)1.9%
Missing8390
Missing (%)83.9%
Infinite0
Infinite (%)0.0%
Mean13.119876
Minimum0
Maximum30
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:11:31.224837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q18
median15
Q315
95-th percentile21
Maximum30
Range30
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.8597396
Coefficient of variation (CV)0.37041049
Kurtosis0.91543307
Mean13.119876
Median Absolute Deviation (MAD)0
Skewness0.10286507
Sum21123
Variance23.617069
MonotonicityNot monotonic
2023-12-13T07:11:31.322003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
15 967
 
9.7%
7 314
 
3.1%
10 56
 
0.6%
8 29
 
0.3%
22 28
 
0.3%
5 28
 
0.3%
3 28
 
0.3%
18 19
 
0.2%
29 17
 
0.2%
17 13
 
0.1%
Other values (20) 111
 
1.1%
(Missing) 8390
83.9%
ValueCountFrequency (%)
0 2
 
< 0.1%
1 8
 
0.1%
2 2
 
< 0.1%
3 28
 
0.3%
4 12
 
0.1%
5 28
 
0.3%
6 8
 
0.1%
7 314
3.1%
8 29
 
0.3%
10 56
 
0.6%
ValueCountFrequency (%)
30 1
 
< 0.1%
29 17
0.2%
28 9
 
0.1%
27 2
 
< 0.1%
26 1
 
< 0.1%
25 9
 
0.1%
24 1
 
< 0.1%
23 11
 
0.1%
22 28
0.3%
21 4
 
< 0.1%

적발구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수시
5558 
기타
4197 
합동
 
183
일제
 
61
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0002
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row기타
2nd row수시
3rd row수시
4th row기타
5th row수시

Common Values

ValueCountFrequency (%)
수시 5558
55.6%
기타 4197
42.0%
합동 183
 
1.8%
일제 61
 
0.6%
<NA> 1
 
< 0.1%

Length

2023-12-13T07:11:31.460943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:11:31.561374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수시 5558
55.6%
기타 4197
42.0%
합동 183
 
1.8%
일제 61
 
0.6%
na 1
 
< 0.1%

Interactions

2023-12-13T07:11:23.577736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:22.865729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:23.223823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:23.672812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:22.974360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:23.355020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:23.771939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:23.112220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:11:23.461380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:11:31.619077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업태명지도점검일자위반일자처분기간적발구분
업종명1.0001.000NaN0.0000.3070.357
업태명1.0001.000NaN0.0000.3680.452
지도점검일자NaNNaN1.000NaNNaNNaN
위반일자0.0000.000NaN1.000NaN0.000
처분기간0.3070.368NaNNaN1.0000.151
적발구분0.3570.452NaN0.0000.1511.000
2023-12-13T07:11:31.698265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명적발구분
업종명1.0000.202
적발구분0.2021.000
2023-12-13T07:11:31.763728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지도점검일자위반일자처분기간업종명적발구분
지도점검일자1.0001.000-0.0830.0000.000
위반일자1.0001.000-0.0840.0000.000
처분기간-0.083-0.0841.0000.1400.114
업종명0.0000.0000.1401.0000.202
적발구분0.0000.0000.1140.2021.000

Missing values

2023-12-13T07:11:23.943382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:11:24.185065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:11:24.348869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명업태명업소명소재지지번지도점검일자행정처분상태처분명위반내역분류법적근거위반일자위반내용처분내용처분기간적발구분
9484일반음식점한식뚝심한우암소직판장서울특별시 강남구 도곡동 542번지 1호 1층20130118처분확정과태료50만원(2013.02.14완납)<NA>식품위생법20130118건강진단미필 3명과태료50만원(2013.02.14완납)<NA>기타
10942일반음식점일식카와세미서울특별시 강남구 신사동 651번지 8호 하늬솔빌딩20191203처분확정과태료부과15만원_자진납부12만원<NA>법 제82조제2항20191203재난배상책임보험 가입 의무기간 위반(30일 이하)과태료부과15만원_자진납부12만원<NA>수시
25869일반음식점탕류(보신용)민속주점(보물점방)서울특별시 강남구 역삼동 747번지 24호 지하1층20070714처분확정영업정지 1월갈음 과징금 10,800,000원 부과/시설개수명령(즉시)<NA>식품위생법 제57조,제58조,제65조20070714일반음식점 영업장에 손님이 이용할 수 있는 자막용영상장치 설치/일반음식점 영업자가 손님이 노래부르도록 허용영업정지 1월갈음 과징금 10,800,000원 부과/시설개수명령(즉시)<NA>수시
14441단란주점단란주점서울특별시 강남구 역삼동 823번지 8호20120426처분확정과태료30만원 부과 및 시정명령<NA>식품위생법20120426허가증업소에 미보관 상호변경신고 미실시(외부간판에 표기된 상호가 다름)과태료30만원 부과 및 시정명령<NA>기타
24675일반음식점한식을지로골뱅이와진짜생맥주서울특별시 강남구 대치동 988번지 2호 지상1층20061212처분확정영업소폐쇄(직권폐업-12.29일자)<NA>식품위생법 제58조20061212시설물 멸실영업소폐쇄(직권폐업-12.29일자)<NA>수시
18683일반음식점뷔페식화운틴서울특별시 강남구 역삼동 604번지 11호 1층20130414처분확정영업소폐쇄(2013.06.22)<NA>식품위생법20130414영업장소 이전신고 미이행영업소폐쇄(2013.06.22)<NA>기타
23321일반음식점경양식스타플레이어서울특별시 강남구 청담동 84번지 7호 지상1층20050113처분확정영업소폐쇄<NA>식품위생법 제58조20050113무단폐업 및 시설물멸실영업소폐쇄<NA>수시
24836일반음식점한식미리네 맛집서울특별시 강남구 일원동 659번지 6호 지상1층20140305처분확정과태료20만원(2014.3.22납부)<NA>식품위생법 제71조 및 제75조20140305위생모미착용과태료20만원(2014.3.22납부)<NA>기타
10956일반음식점경양식엘든(ELDEN)서울특별시 강남구 신사동 661번지 9호20210110처분확정영업정지1개월(2021.5.28.~6.26.)<NA>법 제71조 및 법 제75조20210110손님이 춤을 추는 것을 허용(1차2회)영업정지1개월(2021.5.28.~6.26.)<NA>수시
25673일반음식점한식진진바라서울특별시 강남구 역삼동 708번지 20호 지하2층 B201,B202,B20320090310처분확정시정명령(09.06.19한)<NA>식품위생법20090310원산지 등 서류 미보관시정명령(09.06.19한)<NA>기타
업종명업태명업소명소재지지번지도점검일자행정처분상태처분명위반내역분류법적근거위반일자위반내용처분내용처분기간적발구분
32533단란주점단란주점지빠서울특별시 강남구 삼성동 34번지 3호 지하1층20020904처분확정영업정지3월(2003.1.30-4.29)<NA>식품위생법제31조20020904유흥접객원고용영업정지3월(2003.1.30-4.29)<NA>수시
21105일반음식점분식유가네칼국수서울특별시 강남구 대치동 1008번지 3호19981112처분확정시정명령<NA>식품위생법19981212허가증미게시시정명령<NA>수시
4391일반음식점한식에스엠서울특별시 강남구 신사동 659번지 14호 지하119990620처분확정영업정지2월(99.7.30-9.29)<NA>식품위생법19990720미성년자주류제공(99.5.14)영업정지2월(99.7.30-9.29)<NA>수시
20645일반음식점분식안주나라서울특별시 강남구 대치동 897번지 3호20021004처분확정영업정지2월(2002.12.5-2003.2.4)<NA>식품위생법제31조20021004청소년주류제공영업정지2월(2002.12.5-2003.2.4)<NA>수시
6453일반음식점경양식펍아일랜드서울특별시 강남구 신사동 663번지 20호 지상2층20120719처분확정영업소폐쇄(2012.09.10)<NA>식품위생법20120719영업소 멸실영업소폐쇄(2012.09.10)<NA>기타
23588일반음식점일식스시히로바서울특별시 강남구 삼성동 70번지 0호 지상1,2층20080520처분확정과태료부과30만원(2008.8.1한)<NA>식품위생법20080520수족관물 부적합 판정과태료부과30만원(2008.8.1한)<NA>기타
16997건강기능식품일반판매업전자상거래(통신판매업)(주)인티머스서울특별시 강남구 대치동 968번지 6호 중부빌딩지상4층20061128처분확정영업소폐쇄<NA>420061128시설물멸실영업소폐쇄<NA>수시
12959단란주점단란주점에스더서울특별시 강남구 역삼동 673번지 31호19980725처분확정강남서유선통보8/26<NA>식품위생법19980825동석작배강남서유선통보8/26<NA>수시
10010일반음식점경양식르블랑서울특별시 강남구 청담동 80번지 6호 지하1층20210306처분확정시설개수명령(2022.2.10. 이행완료) ※담당자:이영준<NA>법 제71조, 법 제74조 및 법 제75조20210306영업장에 자동반주장치 설치시설개수명령(2022.2.10. 이행완료) ※담당자:이영준<NA>기타
18070일반음식점경양식엠투서울특별시 강남구 논현동 88번지 7호19950228처분확정시정지시<NA>식품위생법19950331보건증미소지(1차,2/7)시정지시<NA>수시

Duplicate rows

Most frequently occurring

업종명업태명업소명소재지지번지도점검일자행정처분상태처분명법적근거위반일자위반내용처분내용처분기간적발구분# duplicates
91유통전문판매업유통전문판매업엘크로파낙스(주)서울특별시 강남구 역삼동 702번지 2호 삼성제일빌딩20190618처분확정시정명령식품 등의 표시?광고에 관한 법률20190618제8조3항 및 제8조4항 위반시정명령<NA>기타4
92유통전문판매업유통전문판매업엘크로파낙스(주)서울특별시 강남구 역삼동 702번지 2호 삼성제일빌딩20190618처분확정영업정지7일법 제14조부터 제17조까지20190618거짓?과장된 표시 또는 광고 - 포장용기(캔)에 GINSENO SIDE Rg1+Rb1+Rg3 = 12mg/g”으로 표시하여, 원료 홍삼농축액의 진세노사이드 함량과 배합비율로 계산된 ‘엘크로’제품 1캔의 진세노사이드 함량인 “진세노사이드 Rg1+Rb1+Rg3 = 0.09mg/g”과 다르게 표시영업정지7일7기타4
206일반음식점경양식슈가클럽서울특별시 강남구 신사동 631번지 34호 지하1,2층20080827처분확정시설개수명령및 시정명령식품위생법 제 57조20080827단란주점 형태영업음향기기 설치시설개수명령및 시정명령<NA>기타4
217일반음식점경양식안개자니서울특별시 강남구 대치동 961번지 11호 지하1층20050803처분확정(변경처분)영업정지 1월(05.11.1 ~ 11.30)식품위생법 제58조20050803단란형태영업,자동영상반주기설치(변경처분)영업정지 1월(05.11.1 ~ 11.30)<NA>수시4
262일반음식점김밥(도시락)락캔롤서울특별시 강남구 논현동 209번지 0호 지상1층108호20080527처분확정영업정지7일(08.9.11~9.17)변경처분-과징금252만원 미납식품위생법20080527영업장외 테이블영업영업정지7일(08.9.11~9.17)변경처분-과징금252만원 미납7수시4
358일반음식점한식독도는우리땅서울특별시 강남구 논현동 235번지 10호 지상1층20040707처분확정영업정지15일(04.8.27-9.10)식품위생법 58조20040707영업장무단확장영업정지15일(04.8.27-9.10)15수시4
359일반음식점한식독도는우리땅서울특별시 강남구 논현동 235번지 10호 지상1층20040707처분확정영업정지7일 및 과징금 200만원부과(04.8.27-9.2)식품위생법 58조20040707영업장무단확장영업정지7일 및 과징금 200만원부과(04.8.27-9.2)7수시4
445일반음식점호프/통닭전주시원이콩나물국밥서울특별시 강남구 삼성동 153번지 64호 지상1층20070704처분확정과태료20만원 부과/시정명령(07.9.17까지)식품위생법 제31조, 제3조20070704신고된 상호와 간판 상이 표기/조리원 위생모 미착용과태료20만원 부과/시정명령(07.9.17까지)<NA>수시4
3건강기능식품유통전문판매업건강기능식품유통전문판매업(주)스카이글로벌스서울특별시 강남구 개포동 1229번지 11호20190101처분확정과태20만원부과법 제47조제1항제6호201901012018년 위생교육 미이수과태20만원부과<NA>기타3
7건강기능식품유통전문판매업건강기능식품유통전문판매업유한회사 스노우볼컴퍼니서울특별시 강남구 역삼동 646번지 20호20200629처분확정품목제조정지15일식품 등의 표시?광고에 관한 법률20200629심의결과에 따르지 않은 광고(1차)품목제조정지15일15기타3