Overview

Dataset statistics

Number of variables17
Number of observations10000
Missing cells42840
Missing cells (%)25.2%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory1.4 MiB
Average record size in memory146.0 B

Variable types

Categorical4
DateTime6
Text5
Numeric2

Dataset

Description대전광역시 2021년 12월 기준 인허가업소 정보입니다. 행정안전부에서 제공하는 190여개 업소정보의 공통항목만 추출하여, 하나의 파일로 합친 자료입니다. 2021년 공공데이터 기업매칭지원사업으로 추진하였습니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15097221/fileData.do

Alerts

데이터갱신구분 has constant value ""Constant
데이터갱신일자 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
영업상태명 is highly overall correlated with 상세영업상태명High correlation
상세영업상태명 is highly overall correlated with 개방자치단체명 and 1 other fieldsHigh correlation
개방자치단체명 is highly overall correlated with 상세영업상태명High correlation
상세영업상태명 is highly imbalanced (54.0%)Imbalance
인허가취소일자 has 9788 (97.9%) missing valuesMissing
폐업일자 has 3103 (31.0%) missing valuesMissing
휴업시작일자 has 9976 (99.8%) missing valuesMissing
휴업종료일자 has 9976 (99.8%) missing valuesMissing
재개업일자 has 9997 (> 99.9%) missing valuesMissing
좌표정보(x) has 895 (8.9%) zerosZeros
좌표정보(y) has 895 (8.9%) zerosZeros

Reproduction

Analysis started2023-12-12 09:33:02.051280
Analysis finished2023-12-12 09:33:05.605856
Duration3.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동구
4007 
대덕구
3014 
서구
2978 
대전광역시
 
1

Length

Max length5
Median length2
Mean length2.3017
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row대덕구
2nd row대덕구
3rd row동구
4th row서구
5th row동구

Common Values

ValueCountFrequency (%)
동구 4007
40.1%
대덕구 3014
30.1%
서구 2978
29.8%
대전광역시 1
 
< 0.1%

Length

2023-12-12T18:33:05.702899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:05.830210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동구 4007
40.1%
대덕구 3014
30.1%
서구 2978
29.8%
대전광역시 1
 
< 0.1%

데이터갱신구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신규생성
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규생성
2nd row신규생성
3rd row신규생성
4th row신규생성
5th row신규생성

Common Values

ValueCountFrequency (%)
신규생성 10000
100.0%

Length

2023-12-12T18:33:05.975402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:06.086853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규생성 10000
100.0%

데이터갱신일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 00:00:00
Maximum2018-08-31 00:00:00
2023-12-12T18:33:06.186435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:06.291091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct97
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:33:06.559493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length5
Mean length5.8031
Min length2

Characters and Unicode

Total characters58031
Distinct characters159
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)0.1%

Sample

1st row동물병원
2nd row일반음식점
3rd row의원
4th row일반음식점
5th row즉석판매제조가공업
ValueCountFrequency (%)
일반음식점 2417
24.0%
담배소매업 1227
12.2%
통신판매업 850
 
8.4%
즉석판매제조가공업 702
 
7.0%
미용업 524
 
5.2%
휴게음식점 464
 
4.6%
식품자동판매기업 379
 
3.8%
건강기능식품일반판매업 347
 
3.4%
축산판매업 269
 
2.7%
방문판매업 208
 
2.1%
Other values (88) 2694
26.7%
2023-12-12T18:33:07.145303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6604
 
11.4%
4546
 
7.8%
4044
 
7.0%
3404
 
5.9%
2988
 
5.1%
2882
 
5.0%
2876
 
5.0%
2862
 
4.9%
1726
 
3.0%
1269
 
2.2%
Other values (149) 24830
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57514
99.1%
Open Punctuation 212
 
0.4%
Close Punctuation 212
 
0.4%
Space Separator 81
 
0.1%
Other Punctuation 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6604
 
11.5%
4546
 
7.9%
4044
 
7.0%
3404
 
5.9%
2988
 
5.2%
2882
 
5.0%
2876
 
5.0%
2862
 
5.0%
1726
 
3.0%
1269
 
2.2%
Other values (144) 24313
42.3%
Other Punctuation
ValueCountFrequency (%)
· 7
58.3%
/ 5
41.7%
Open Punctuation
ValueCountFrequency (%)
( 212
100.0%
Close Punctuation
ValueCountFrequency (%)
) 212
100.0%
Space Separator
ValueCountFrequency (%)
81
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57514
99.1%
Common 517
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6604
 
11.5%
4546
 
7.9%
4044
 
7.0%
3404
 
5.9%
2988
 
5.2%
2882
 
5.0%
2876
 
5.0%
2862
 
5.0%
1726
 
3.0%
1269
 
2.2%
Other values (144) 24313
42.3%
Common
ValueCountFrequency (%)
( 212
41.0%
) 212
41.0%
81
 
15.7%
· 7
 
1.4%
/ 5
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57514
99.1%
ASCII 510
 
0.9%
None 7
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6604
 
11.5%
4546
 
7.9%
4044
 
7.0%
3404
 
5.9%
2988
 
5.2%
2882
 
5.0%
2876
 
5.0%
2862
 
5.0%
1726
 
3.0%
1269
 
2.2%
Other values (144) 24313
42.3%
ASCII
ValueCountFrequency (%)
( 212
41.6%
) 212
41.6%
81
 
15.9%
/ 5
 
1.0%
None
ValueCountFrequency (%)
· 7
100.0%
Distinct9288
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:33:07.494830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length6.0942
Min length1

Characters and Unicode

Total characters60942
Distinct characters1031
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8768 ?
Unique (%)87.7%

Sample

1st row둘리동물병원
2nd row일광식당
3rd row고려내과의원
4th row임페리얼식당
5th row삼영상회
ValueCountFrequency (%)
주식회사 62
 
0.6%
노래연습장 45
 
0.4%
식당 23
 
0.2%
주)참푸드 22
 
0.2%
gs25 20
 
0.2%
씨유 18
 
0.2%
pc방 16
 
0.1%
더원씨푸드 14
 
0.1%
14
 
0.1%
한아름농특산 14
 
0.1%
Other values (9730) 10748
97.7%
2023-12-12T18:33:08.053325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1549
 
2.5%
1248
 
2.0%
1241
 
2.0%
1011
 
1.7%
999
 
1.6%
948
 
1.6%
938
 
1.5%
859
 
1.4%
848
 
1.4%
823
 
1.4%
Other values (1021) 50478
82.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55895
91.7%
Uppercase Letter 1044
 
1.7%
Space Separator 999
 
1.6%
Close Punctuation 804
 
1.3%
Open Punctuation 791
 
1.3%
Decimal Number 743
 
1.2%
Lowercase Letter 535
 
0.9%
Other Punctuation 95
 
0.2%
Dash Punctuation 17
 
< 0.1%
Other Symbol 13
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1549
 
2.8%
1248
 
2.2%
1241
 
2.2%
1011
 
1.8%
948
 
1.7%
938
 
1.7%
859
 
1.5%
848
 
1.5%
823
 
1.5%
669
 
1.2%
Other values (938) 45761
81.9%
Uppercase Letter
ValueCountFrequency (%)
C 141
13.5%
S 127
12.2%
P 117
 
11.2%
G 101
 
9.7%
L 43
 
4.1%
I 41
 
3.9%
O 40
 
3.8%
A 40
 
3.8%
E 40
 
3.8%
T 39
 
3.7%
Other values (16) 315
30.2%
Lowercase Letter
ValueCountFrequency (%)
e 64
 
12.0%
o 58
 
10.8%
n 43
 
8.0%
a 41
 
7.7%
l 38
 
7.1%
i 33
 
6.2%
r 28
 
5.2%
m 25
 
4.7%
s 24
 
4.5%
c 21
 
3.9%
Other values (14) 160
29.9%
Other Punctuation
ValueCountFrequency (%)
. 48
50.5%
& 18
 
18.9%
, 17
 
17.9%
: 3
 
3.2%
* 3
 
3.2%
# 1
 
1.1%
1
 
1.1%
! 1
 
1.1%
/ 1
 
1.1%
· 1
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 236
31.8%
5 141
19.0%
0 105
14.1%
1 84
 
11.3%
4 60
 
8.1%
3 44
 
5.9%
7 29
 
3.9%
8 16
 
2.2%
9 15
 
2.0%
6 13
 
1.7%
Letter Number
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 803
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 790
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
999
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Other Symbol
ValueCountFrequency (%)
13
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55906
91.7%
Common 3451
 
5.7%
Latin 1583
 
2.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1549
 
2.8%
1248
 
2.2%
1241
 
2.2%
1011
 
1.8%
948
 
1.7%
938
 
1.7%
859
 
1.5%
848
 
1.5%
823
 
1.5%
669
 
1.2%
Other values (937) 45772
81.9%
Latin
ValueCountFrequency (%)
C 141
 
8.9%
S 127
 
8.0%
P 117
 
7.4%
G 101
 
6.4%
e 64
 
4.0%
o 58
 
3.7%
L 43
 
2.7%
n 43
 
2.7%
a 41
 
2.6%
I 41
 
2.6%
Other values (43) 807
51.0%
Common
ValueCountFrequency (%)
999
28.9%
) 803
23.3%
( 790
22.9%
2 236
 
6.8%
5 141
 
4.1%
0 105
 
3.0%
1 84
 
2.4%
4 60
 
1.7%
. 48
 
1.4%
3 44
 
1.3%
Other values (19) 141
 
4.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55892
91.7%
ASCII 5028
 
8.3%
None 15
 
< 0.1%
Number Forms 4
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1549
 
2.8%
1248
 
2.2%
1241
 
2.2%
1011
 
1.8%
948
 
1.7%
938
 
1.7%
859
 
1.5%
848
 
1.5%
823
 
1.5%
669
 
1.2%
Other values (935) 45758
81.9%
ASCII
ValueCountFrequency (%)
999
19.9%
) 803
16.0%
( 790
15.7%
2 236
 
4.7%
C 141
 
2.8%
5 141
 
2.8%
S 127
 
2.5%
P 117
 
2.3%
0 105
 
2.1%
G 101
 
2.0%
Other values (67) 1468
29.2%
None
ValueCountFrequency (%)
13
86.7%
1
 
6.7%
· 1
 
6.7%
Number Forms
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct6907
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:33:08.451863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length20.2213
Min length8

Characters and Unicode

Total characters202213
Distinct characters404
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6220 ?
Unique (%)62.2%

Sample

1st row대전광역시 대덕구 중리동 391-5번지
2nd row대전광역시 대덕구 상서동 411번지 1층
3rd row대전광역시 동구 성남2동 56번지5호
4th row대전광역시 서구 도마동 127-12번지
5th row대전광역시 동구 정동 ,
ValueCountFrequency (%)
대전광역시 9997
22.8%
동구 3882
 
8.8%
대덕구 2993
 
6.8%
서구 2986
 
6.8%
번지 1346
 
3.1%
1116
 
2.5%
1층 721
 
1.6%
가양동 625
 
1.4%
갈마동 590
 
1.3%
도마동 576
 
1.3%
Other values (5803) 19091
43.5%
2023-12-12T18:33:09.157395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36118
17.9%
14295
 
7.1%
13722
 
6.8%
10605
 
5.2%
10048
 
5.0%
10036
 
5.0%
10004
 
4.9%
10000
 
4.9%
7627
 
3.8%
1 7409
 
3.7%
Other values (394) 72349
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 130430
64.5%
Space Separator 36118
 
17.9%
Decimal Number 29077
 
14.4%
Dash Punctuation 4354
 
2.2%
Open Punctuation 791
 
0.4%
Close Punctuation 791
 
0.4%
Other Punctuation 435
 
0.2%
Uppercase Letter 206
 
0.1%
Math Symbol 7
 
< 0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14295
11.0%
13722
10.5%
10605
 
8.1%
10048
 
7.7%
10036
 
7.7%
10004
 
7.7%
10000
 
7.7%
7627
 
5.8%
7070
 
5.4%
3127
 
2.4%
Other values (352) 33896
26.0%
Uppercase Letter
ValueCountFrequency (%)
B 46
22.3%
G 41
19.9%
S 39
18.9%
A 39
18.9%
C 9
 
4.4%
T 7
 
3.4%
K 6
 
2.9%
F 3
 
1.5%
D 3
 
1.5%
O 2
 
1.0%
Other values (9) 11
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 7409
25.5%
2 3806
13.1%
3 3159
10.9%
4 2911
 
10.0%
0 2427
 
8.3%
5 2089
 
7.2%
6 2062
 
7.1%
7 1757
 
6.0%
8 1742
 
6.0%
9 1715
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 304
69.9%
@ 78
 
17.9%
/ 36
 
8.3%
. 15
 
3.4%
& 2
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
a 2
66.7%
c 1
33.3%
Space Separator
ValueCountFrequency (%)
36118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4354
100.0%
Open Punctuation
ValueCountFrequency (%)
( 791
100.0%
Close Punctuation
ValueCountFrequency (%)
) 791
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 130430
64.5%
Common 71574
35.4%
Latin 209
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14295
11.0%
13722
10.5%
10605
 
8.1%
10048
 
7.7%
10036
 
7.7%
10004
 
7.7%
10000
 
7.7%
7627
 
5.8%
7070
 
5.4%
3127
 
2.4%
Other values (352) 33896
26.0%
Common
ValueCountFrequency (%)
36118
50.5%
1 7409
 
10.4%
- 4354
 
6.1%
2 3806
 
5.3%
3 3159
 
4.4%
4 2911
 
4.1%
0 2427
 
3.4%
5 2089
 
2.9%
6 2062
 
2.9%
7 1757
 
2.5%
Other values (11) 5482
 
7.7%
Latin
ValueCountFrequency (%)
B 46
22.0%
G 41
19.6%
S 39
18.7%
A 39
18.7%
C 9
 
4.3%
T 7
 
3.3%
K 6
 
2.9%
F 3
 
1.4%
D 3
 
1.4%
O 2
 
1.0%
Other values (11) 14
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 130430
64.5%
ASCII 71783
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
36118
50.3%
1 7409
 
10.3%
- 4354
 
6.1%
2 3806
 
5.3%
3 3159
 
4.4%
4 2911
 
4.1%
0 2427
 
3.4%
5 2089
 
2.9%
6 2062
 
2.9%
7 1757
 
2.4%
Other values (32) 5691
 
7.9%
Hangul
ValueCountFrequency (%)
14295
11.0%
13722
10.5%
10605
 
8.1%
10048
 
7.7%
10036
 
7.7%
10004
 
7.7%
10000
 
7.7%
7627
 
5.8%
7070
 
5.4%
3127
 
2.4%
Other values (352) 33896
26.0%
Distinct3438
Distinct (%)34.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:33:09.640066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length1
Mean length2.7927
Min length1

Characters and Unicode

Total characters27927
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2626 ?
Unique (%)26.3%

Sample

1st row50.81
2nd row40.15
3rd row0
4th row21.06
5th row44.8
ValueCountFrequency (%)
0 4995
50.0%
1 25
 
0.2%
33 23
 
0.2%
10 17
 
0.2%
18 16
 
0.2%
4 15
 
0.1%
32 15
 
0.1%
2 15
 
0.1%
26.4 14
 
0.1%
8 13
 
0.1%
Other values (3428) 4852
48.5%
2023-12-12T18:33:10.275852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6012
21.5%
. 4513
16.2%
2 2616
9.4%
1 2578
9.2%
3 2091
 
7.5%
4 1954
 
7.0%
5 1792
 
6.4%
6 1746
 
6.3%
8 1659
 
5.9%
7 1478
 
5.3%
Other values (2) 1488
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 23394
83.8%
Other Punctuation 4533
 
16.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6012
25.7%
2 2616
11.2%
1 2578
11.0%
3 2091
 
8.9%
4 1954
 
8.4%
5 1792
 
7.7%
6 1746
 
7.5%
8 1659
 
7.1%
7 1478
 
6.3%
9 1468
 
6.3%
Other Punctuation
ValueCountFrequency (%)
. 4513
99.6%
, 20
 
0.4%

Most occurring scripts

ValueCountFrequency (%)
Common 27927
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6012
21.5%
. 4513
16.2%
2 2616
9.4%
1 2578
9.2%
3 2091
 
7.5%
4 1954
 
7.0%
5 1792
 
6.4%
6 1746
 
6.3%
8 1659
 
5.9%
7 1478
 
5.3%
Other values (2) 1488
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27927
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6012
21.5%
. 4513
16.2%
2 2616
9.4%
1 2578
9.2%
3 2091
 
7.5%
4 1954
 
7.0%
5 1792
 
6.4%
6 1746
 
6.3%
8 1659
 
5.9%
7 1478
 
5.3%
Other values (2) 1488
 
5.3%
Distinct5340
Distinct (%)53.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:33:10.631873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9992
Min length6

Characters and Unicode

Total characters99992
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2701 ?
Unique (%)27.0%

Sample

1st row2007-10-18
2nd row2017-09-12
3rd row1993-02-20
4th row1997-05-19
5th row2008-02-28
ValueCountFrequency (%)
1994-12-01 21
 
0.2%
1995-01-05 19
 
0.2%
2004-06-18 16
 
0.2%
1998-12-15 14
 
0.1%
2003-10-22 9
 
0.1%
2012-11-16 9
 
0.1%
1998-12-18 8
 
0.1%
2001-12-28 8
 
0.1%
2002-12-03 8
 
0.1%
2009-07-15 8
 
0.1%
Other values (5330) 9880
98.8%
2023-12-12T18:33:11.153394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 25137
25.1%
- 19996
20.0%
1 15484
15.5%
2 14585
14.6%
9 6759
 
6.8%
3 3436
 
3.4%
8 3201
 
3.2%
4 2988
 
3.0%
7 2916
 
2.9%
6 2800
 
2.8%
Other values (4) 2690
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79984
80.0%
Dash Punctuation 19996
 
20.0%
Lowercase Letter 12
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 25137
31.4%
1 15484
19.4%
2 14585
18.2%
9 6759
 
8.5%
3 3436
 
4.3%
8 3201
 
4.0%
4 2988
 
3.7%
7 2916
 
3.6%
6 2800
 
3.5%
5 2678
 
3.3%
Lowercase Letter
ValueCountFrequency (%)
l 8
66.7%
n 2
 
16.7%
u 2
 
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 19996
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99980
> 99.9%
Latin 12
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 25137
25.1%
- 19996
20.0%
1 15484
15.5%
2 14585
14.6%
9 6759
 
6.8%
3 3436
 
3.4%
8 3201
 
3.2%
4 2988
 
3.0%
7 2916
 
2.9%
6 2800
 
2.8%
Latin
ValueCountFrequency (%)
l 8
66.7%
n 2
 
16.7%
u 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99992
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 25137
25.1%
- 19996
20.0%
1 15484
15.5%
2 14585
14.6%
9 6759
 
6.8%
3 3436
 
3.4%
8 3201
 
3.2%
4 2988
 
3.0%
7 2916
 
2.9%
6 2800
 
2.8%
Other values (4) 2690
 
2.7%

인허가취소일자
Date

MISSING 

Distinct127
Distinct (%)59.9%
Missing9788
Missing (%)97.9%
Memory size156.2 KiB
Minimum2000-01-14 00:00:00
Maximum2018-06-27 00:00:00
2023-12-12T18:33:11.318707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:11.488828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

폐업일자
Date

MISSING 

Distinct3356
Distinct (%)48.7%
Missing3103
Missing (%)31.0%
Memory size156.2 KiB
Minimum1995-12-29 00:00:00
Maximum2018-09-03 00:00:00
2023-12-12T18:33:11.679345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:11.823114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct23
Distinct (%)95.8%
Missing9976
Missing (%)99.8%
Memory size156.2 KiB
Minimum2003-05-19 00:00:00
Maximum2018-03-30 00:00:00
2023-12-12T18:33:11.978630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:12.412253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

휴업종료일자
Date

MISSING 

Distinct24
Distinct (%)100.0%
Missing9976
Missing (%)99.8%
Memory size156.2 KiB
Minimum2004-05-18 00:00:00
Maximum2019-03-29 00:00:00
2023-12-12T18:33:12.611886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:12.819604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)

재개업일자
Date

MISSING 

Distinct3
Distinct (%)100.0%
Missing9997
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2006-02-13 00:00:00
Maximum2016-11-16 00:00:00
2023-12-12T18:33:13.014757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:13.179043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
폐업
6898 
영업/정상
2598 
취소/말소/만료/정지/중지
 
500
휴업
 
4

Length

Max length14
Median length2
Mean length3.3794
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row영업/정상
3rd row영업/정상
4th row폐업
5th row영업/정상

Common Values

ValueCountFrequency (%)
폐업 6898
69.0%
영업/정상 2598
 
26.0%
취소/말소/만료/정지/중지 500
 
5.0%
휴업 4
 
< 0.1%

Length

2023-12-12T18:33:13.351324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:13.501788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 6898
69.0%
영업/정상 2598
 
26.0%
취소/말소/만료/정지/중지 500
 
5.0%
휴업 4
 
< 0.1%

상세영업상태명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
폐업
5569 
폐업처리
1297 
영업
1253 
정상영업
645 
영업중
 
548
Other values (22)
688 

Length

Max length8
Median length2
Mean length2.5482
Min length2

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row폐업
2nd row영업
3rd row영업중
4th row폐업
5th row영업

Common Values

ValueCountFrequency (%)
폐업 5569
55.7%
폐업처리 1297
 
13.0%
영업 1253
 
12.5%
정상영업 645
 
6.5%
영업중 548
 
5.5%
직권말소 226
 
2.3%
직권취소 141
 
1.4%
정상 106
 
1.1%
지정취소 44
 
0.4%
등록취소 32
 
0.3%
Other values (17) 139
 
1.4%

Length

2023-12-12T18:33:13.655809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
폐업 5569
55.7%
폐업처리 1297
 
13.0%
영업 1253
 
12.5%
정상영업 645
 
6.5%
영업중 548
 
5.5%
직권말소 226
 
2.3%
직권취소 141
 
1.4%
정상 106
 
1.1%
지정취소 44
 
0.4%
등록취소 32
 
0.3%
Other values (17) 139
 
1.4%

좌표정보(x)
Real number (ℝ)

ZEROS 

Distinct5986
Distinct (%)59.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean216058.43
Minimum0
Maximum247647.58
Zeros895
Zeros (%)8.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:33:13.855456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1233815.96
median238076.67
Q3239329.4
95-th percentile241058.04
Maximum247647.58
Range247647.58
Interquartile range (IQR)5513.4391

Descriptive statistics

Standard deviation67807.558
Coefficient of variation (CV)0.31383898
Kurtosis6.2409657
Mean216058.43
Median Absolute Deviation (MAD)2113.8411
Skewness-2.8666065
Sum2.1605843 × 109
Variance4.5978649 × 109
MonotonicityNot monotonic
2023-12-12T18:33:14.085137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 895
 
8.9%
234901.1939 113
 
1.1%
234850.9449 59
 
0.6%
241064.7676 49
 
0.5%
239185.9009 40
 
0.4%
239596.3861 39
 
0.4%
233873.026 36
 
0.4%
239122.6384 34
 
0.3%
236019.8935 32
 
0.3%
233907.1178 29
 
0.3%
Other values (5976) 8674
86.7%
ValueCountFrequency (%)
0.0 895
8.9%
180363.59 1
 
< 0.1%
187325.764 1
 
< 0.1%
228281.8169 1
 
< 0.1%
228301.1349 1
 
< 0.1%
228809.5045 1
 
< 0.1%
228899.315 1
 
< 0.1%
229196.6951 1
 
< 0.1%
229324.5601 1
 
< 0.1%
229332.57 6
 
0.1%
ValueCountFrequency (%)
247647.5767 1
 
< 0.1%
246242.4175 1
 
< 0.1%
245081.354 1
 
< 0.1%
244624.4945 1
 
< 0.1%
244202.0828 1
 
< 0.1%
244152.2747 1
 
< 0.1%
243973.8127 1
 
< 0.1%
243917.3439 3
< 0.1%
243313.6517 1
 
< 0.1%
243201.532 1
 
< 0.1%

좌표정보(y)
Real number (ℝ)

ZEROS 

Distinct5984
Distinct (%)59.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean288082.19
Minimum0
Maximum463532.15
Zeros895
Zeros (%)8.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:33:14.339424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1313575.98
median315759.27
Q3317226.47
95-th percentile326786.66
Maximum463532.15
Range463532.15
Interquartile range (IQR)3650.4894

Descriptive statistics

Standard deviation90416.27
Coefficient of variation (CV)0.31385581
Kurtosis6.2409211
Mean288082.19
Median Absolute Deviation (MAD)1658.6211
Skewness-2.8652063
Sum2.8808219 × 109
Variance8.1751018 × 109
MonotonicityNot monotonic
2023-12-12T18:33:14.566170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 895
 
8.9%
315605.266 113
 
1.1%
317347.418 59
 
0.6%
311893.458 49
 
0.5%
317088.4126 40
 
0.4%
314379.0917 39
 
0.4%
317345.7572 36
 
0.4%
316705.6937 34
 
0.3%
318423.1792 32
 
0.3%
316901.2614 29
 
0.3%
Other values (5974) 8674
86.7%
ValueCountFrequency (%)
0.0 895
8.9%
300961.1738 1
 
< 0.1%
301134.3525 1
 
< 0.1%
302458.8731 1
 
< 0.1%
302471.5959 1
 
< 0.1%
302520.8864 1
 
< 0.1%
302559.2281 1
 
< 0.1%
302616.6748 1
 
< 0.1%
302668.9934 1
 
< 0.1%
302774.324 1
 
< 0.1%
ValueCountFrequency (%)
463532.15 1
< 0.1%
422755.9488 1
< 0.1%
348461.001 1
< 0.1%
330537.7066 1
< 0.1%
330402.621 1
< 0.1%
328822.0261 1
< 0.1%
328398.7423 1
< 0.1%
328388.502 1
< 0.1%
328273.6531 1
< 0.1%
328225.9164 1
< 0.1%

Interactions

2023-12-12T18:33:04.727332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:04.490449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:04.864981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:33:04.597514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:33:14.692268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개방자치단체명개방서비스명휴업시작일자휴업종료일자재개업일자영업상태명상세영업상태명좌표정보(x)좌표정보(y)
개방자치단체명1.0000.5501.0001.000NaN0.0790.8160.0560.550
개방서비스명0.5501.0001.0001.0001.0000.7310.9330.5590.514
휴업시작일자1.0001.0001.0001.000NaN1.0001.0000.0000.000
휴업종료일자1.0001.0001.0001.000NaN1.0001.0001.0001.000
재개업일자NaN1.000NaNNaN1.000NaN1.000NaNNaN
영업상태명0.0790.7311.0001.000NaN1.0001.0000.0850.165
상세영업상태명0.8160.9331.0001.0001.0001.0001.0000.3980.307
좌표정보(x)0.0560.5590.0001.000NaN0.0850.3981.0001.000
좌표정보(y)0.5500.5140.0001.000NaN0.1650.3071.0001.000
2023-12-12T18:33:14.860681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개방자치단체명영업상태명상세영업상태명
개방자치단체명1.0000.0310.583
영업상태명0.0311.0000.999
상세영업상태명0.5830.9991.000
2023-12-12T18:33:14.988915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
좌표정보(x)좌표정보(y)개방자치단체명영업상태명상세영업상태명
좌표정보(x)1.0000.2630.0530.0800.201
좌표정보(y)0.2631.0000.2410.0660.165
개방자치단체명0.0530.2411.0000.0310.583
영업상태명0.0800.0660.0311.0000.999
상세영업상태명0.2010.1650.5830.9991.000

Missing values

2023-12-12T18:33:05.023394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:33:05.301168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T18:33:05.499665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

개방자치단체명데이터갱신구분데이터갱신일자개방서비스명사업장명지번주소소재지면적인허가일자인허가취소일자폐업일자휴업시작일자휴업종료일자재개업일자영업상태명상세영업상태명좌표정보(x)좌표정보(y)
18891대덕구신규생성2018-08-31동물병원둘리동물병원대전광역시 대덕구 중리동 391-5번지50.812007-10-18<NA>2007-12-28<NA><NA><NA>폐업폐업238073.7724318064.061
6050대덕구신규생성2018-08-31일반음식점일광식당대전광역시 대덕구 상서동 411번지 1층40.152017-09-12<NA><NA><NA><NA><NA>영업/정상영업237942.2738324636.7212
30456동구신규생성2018-08-31의원고려내과의원대전광역시 동구 성남2동 56번지5호01993-02-20<NA><NA><NA><NA><NA>영업/정상영업중239123.6178315654.6319
61429서구신규생성2018-08-31일반음식점임페리얼식당대전광역시 서구 도마동 127-12번지21.061997-05-19<NA>2003-11-06<NA><NA><NA>폐업폐업233862.565312896.8518
41275동구신규생성2018-08-31즉석판매제조가공업삼영상회대전광역시 동구 정동 ,44.82008-02-28<NA><NA><NA><NA><NA>영업/정상영업238898.7543314491.088
57842서구신규생성2018-08-31일반음식점럭키분식대전광역시 서구 괴정동 50-60번지 1층44.022005-09-23<NA>2006-03-14<NA><NA><NA>폐업폐업234669.7971315369.3099
22663동구신규생성2018-08-31일반음식점풍전식당대전광역시 동구 가양동 423-3번지85.521992-11-24<NA>2007-01-16<NA><NA><NA>폐업폐업239737.5506316714.1965
66352서구신규생성2018-08-31일반음식점행오버대전광역시 서구 둔산동 1063번지181.182017-05-29<NA><NA><NA><NA><NA>영업/정상영업233713.163316682.6647
9263대덕구신규생성2018-08-31즉석판매제조가공업반찬투정대전광역시 대덕구 송촌동 번지 선비마을단지상가 호16.722008-04-10<NA>2009-09-24<NA><NA><NA>폐업폐업239761.4132318571.6026
447대덕구신규생성2018-08-31미용업진주미용실대전광역시 대덕구 대화동 251-2번지22.12002-10-31<NA>2009-05-04<NA><NA><NA>폐업폐업236771.7518318225.5642
개방자치단체명데이터갱신구분데이터갱신일자개방서비스명사업장명지번주소소재지면적인허가일자인허가취소일자폐업일자휴업시작일자휴업종료일자재개업일자영업상태명상세영업상태명좌표정보(x)좌표정보(y)
55658서구신규생성2018-08-31건강기능식품일반판매업다움생식대전지사대전광역시 서구 관저동 번지 층호02013-04-12<NA>2014-03-24<NA><NA><NA>폐업폐업230447.4033311004.8472
56903서구신규생성2018-08-31건강기능식품일반판매업한민상사대전광역시 서구 괴정동 케이지빌아파트 호02006-10-09<NA>2008-09-09<NA><NA><NA>폐업폐업233614.498315343.2981
39942동구신규생성2018-08-31노래연습장업핫쨈동전노래연습장대전광역시 동구 자양동 140-22번지 2층02016-05-27<NA><NA><NA><NA><NA>영업/정상영업중239994.5147314921.936
6183대덕구신규생성2018-08-31즉석판매제조가공업낙원떡집대전광역시 대덕구 석봉동30.31996-06-20<NA>2003-08-26<NA><NA><NA>폐업폐업238106.2839327644.1634
16174대덕구신규생성2018-08-31담배소매업감나무골식당대전광역시 대덕구 이현동 260번지 2호01999-02-08<NA>2002-09-30<NA><NA><NA>폐업폐업처리241655.4979323387.8105
12494대덕구신규생성2018-08-31식품소분업중도매인37번대전광역시 대덕구 오정동 외필지 건어물동(층)15.942006-11-09<NA><NA><NA><NA><NA>영업/정상영업236370.2501317501.7726
62121서구신규생성2018-08-31일반음식점경주마차대전광역시 서구 도마동 171-15번지15.111994-11-17<NA>2003-01-20<NA><NA><NA>폐업폐업233848.5427312562.158
8710대덕구신규생성2018-08-31일반음식점마쯔리식당대전광역시 대덕구 송촌동 467-2번지47.22002-08-09<NA>2008-10-20<NA><NA><NA>폐업폐업239154.6108318194.0116
55553서구신규생성2018-08-31통신판매업크림(cream)대전광역시 서구 관저동 번지 원앙마을 단지02018-08-24<NA><NA><NA><NA><NA>영업/정상정상영업230369.5712311456.7499
62165서구신규생성2018-08-31체력단련장업대영스포츠대전광역시 서구 도마동 172-15번지 도마라이프클리닉02005-12-23<NA>2018-07-27<NA><NA><NA>취소/말소/만료/정지/중지직권말소234014.3608312578.2273

Duplicate rows

Most frequently occurring

개방자치단체명데이터갱신구분데이터갱신일자개방서비스명사업장명지번주소소재지면적인허가일자인허가취소일자폐업일자휴업시작일자휴업종료일자재개업일자영업상태명상세영업상태명좌표정보(x)좌표정보(y)# duplicates
0서구신규생성2018-08-31담배소매업(주)코리아세븐갈마점대전광역시 서구 갈마동 1073호02001-10-08<NA>2004-10-28<NA><NA><NA>폐업폐업처리233545.6155316250.95712