Overview

Dataset statistics

Number of variables5
Number of observations493
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.9 KiB
Average record size in memory41.3 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description대전광역시 동구 관내 의료기기업소 등록 현황으로서,영업소 명칭, 영업소 소재지(주소) 및 영업소 행정동 등의 정보를 포함하고 있습니다.
Author대전광역시 동구
URLhttps://www.data.go.kr/data/15067213/fileData.do

Alerts

영업구분 is highly imbalanced (78.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:30:01.518407
Analysis finished2023-12-12 07:30:02.124956
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct493
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean247
Minimum1
Maximum493
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T16:30:02.194101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25.6
Q1124
median247
Q3370
95-th percentile468.4
Maximum493
Range492
Interquartile range (IQR)246

Descriptive statistics

Standard deviation142.46111
Coefficient of variation (CV)0.57676561
Kurtosis-1.2
Mean247
Median Absolute Deviation (MAD)123
Skewness0
Sum121771
Variance20295.167
MonotonicityStrictly increasing
2023-12-12T16:30:02.343431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
340 1
 
0.2%
338 1
 
0.2%
337 1
 
0.2%
336 1
 
0.2%
335 1
 
0.2%
334 1
 
0.2%
333 1
 
0.2%
332 1
 
0.2%
331 1
 
0.2%
Other values (483) 483
98.0%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
493 1
0.2%
492 1
0.2%
491 1
0.2%
490 1
0.2%
489 1
0.2%
488 1
0.2%
487 1
0.2%
486 1
0.2%
485 1
0.2%
484 1
0.2%

영업구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
판매업
476 
판매(임대)업
 
17

Length

Max length7
Median length3
Mean length3.137931
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row판매(임대)업
2nd row판매업
3rd row판매업
4th row판매업
5th row판매업

Common Values

ValueCountFrequency (%)
판매업 476
96.6%
판매(임대)업 17
 
3.4%

Length

2023-12-12T16:30:02.475257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:30:02.579876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
판매업 476
96.6%
판매(임대)업 17
 
3.4%
Distinct492
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T16:30:02.831950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length8.3853955
Min length2

Characters and Unicode

Total characters4134
Distinct characters383
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique491 ?
Unique (%)99.6%

Sample

1st row더숄더 어깨교정운동센터
2nd row더센헬스케어
3rd row씨유대전판암삼정점
4th row코어모션
5th row드림컴퍼니
ValueCountFrequency (%)
지에스25 39
 
5.6%
씨유 37
 
5.3%
세븐일레븐 29
 
4.1%
주식회사 14
 
2.0%
gs25 7
 
1.0%
cu 6
 
0.9%
이마트24 4
 
0.6%
대전성남점 4
 
0.6%
대전터미널점 3
 
0.4%
지에스25(gs25 3
 
0.4%
Other values (529) 554
79.1%
2023-12-12T16:30:03.274961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
207
 
5.0%
190
 
4.6%
172
 
4.2%
153
 
3.7%
140
 
3.4%
102
 
2.5%
101
 
2.4%
93
 
2.2%
88
 
2.1%
81
 
2.0%
Other values (373) 2807
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3497
84.6%
Space Separator 207
 
5.0%
Decimal Number 163
 
3.9%
Uppercase Letter 105
 
2.5%
Close Punctuation 68
 
1.6%
Open Punctuation 66
 
1.6%
Lowercase Letter 24
 
0.6%
Other Symbol 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
5.4%
172
 
4.9%
153
 
4.4%
140
 
4.0%
102
 
2.9%
101
 
2.9%
93
 
2.7%
88
 
2.5%
81
 
2.3%
65
 
1.9%
Other values (328) 2312
66.1%
Uppercase Letter
ValueCountFrequency (%)
S 25
23.8%
G 18
17.1%
C 13
12.4%
U 12
11.4%
M 7
 
6.7%
K 3
 
2.9%
O 3
 
2.9%
H 3
 
2.9%
N 3
 
2.9%
I 3
 
2.9%
Other values (10) 15
14.3%
Lowercase Letter
ValueCountFrequency (%)
e 4
16.7%
l 3
12.5%
a 3
12.5%
s 2
8.3%
t 2
8.3%
c 2
8.3%
u 2
8.3%
n 1
 
4.2%
r 1
 
4.2%
h 1
 
4.2%
Other values (3) 3
12.5%
Decimal Number
ValueCountFrequency (%)
2 79
48.5%
5 70
42.9%
4 7
 
4.3%
1 3
 
1.8%
3 2
 
1.2%
0 1
 
0.6%
7 1
 
0.6%
Space Separator
ValueCountFrequency (%)
207
100.0%
Close Punctuation
ValueCountFrequency (%)
) 68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3500
84.7%
Common 505
 
12.2%
Latin 129
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
5.4%
172
 
4.9%
153
 
4.4%
140
 
4.0%
102
 
2.9%
101
 
2.9%
93
 
2.7%
88
 
2.5%
81
 
2.3%
65
 
1.9%
Other values (329) 2315
66.1%
Latin
ValueCountFrequency (%)
S 25
19.4%
G 18
14.0%
C 13
 
10.1%
U 12
 
9.3%
M 7
 
5.4%
e 4
 
3.1%
K 3
 
2.3%
l 3
 
2.3%
a 3
 
2.3%
O 3
 
2.3%
Other values (23) 38
29.5%
Common
ValueCountFrequency (%)
207
41.0%
2 79
 
15.6%
5 70
 
13.9%
) 68
 
13.5%
( 66
 
13.1%
4 7
 
1.4%
1 3
 
0.6%
3 2
 
0.4%
0 1
 
0.2%
7 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3497
84.6%
ASCII 634
 
15.3%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
207
32.6%
2 79
 
12.5%
5 70
 
11.0%
) 68
 
10.7%
( 66
 
10.4%
S 25
 
3.9%
G 18
 
2.8%
C 13
 
2.1%
U 12
 
1.9%
4 7
 
1.1%
Other values (34) 69
 
10.9%
Hangul
ValueCountFrequency (%)
190
 
5.4%
172
 
4.9%
153
 
4.4%
140
 
4.0%
102
 
2.9%
101
 
2.9%
93
 
2.7%
88
 
2.5%
81
 
2.3%
65
 
1.9%
Other values (328) 2312
66.1%
None
ValueCountFrequency (%)
3
100.0%
Distinct491
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T16:30:03.590470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length46
Mean length29.752535
Min length19

Characters and Unicode

Total characters14668
Distinct characters220
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique489 ?
Unique (%)99.2%

Sample

1st row대전광역시 동구 새들2길 30, 204호 (신흥동)
2nd row대전광역시 동구 석천로 43-23, 101호 (낭월동)
3rd row대전광역시 동구 동부로10번길 52, 1층 103,104,105호 (판암동)
4th row대전광역시 동구 대학로 62, 대전대학교 산학협력관 3층 310-2호 (용운동)
5th row대전광역시 동구 홍도로 60, 101동 1501호 (용전동, 새피앙아파트)
ValueCountFrequency (%)
대전광역시 493
 
16.5%
동구 493
 
16.5%
1층 144
 
4.8%
용전동 84
 
2.8%
가양동 77
 
2.6%
2층 58
 
1.9%
대전로 39
 
1.3%
삼성동 33
 
1.1%
가오동 27
 
0.9%
정동 26
 
0.9%
Other values (674) 1506
50.5%
2023-12-12T16:30:04.058784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2489
 
17.0%
1132
 
7.7%
718
 
4.9%
712
 
4.9%
1 706
 
4.8%
503
 
3.4%
501
 
3.4%
) 500
 
3.4%
( 500
 
3.4%
496
 
3.4%
Other values (210) 6411
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7943
54.2%
Decimal Number 2678
 
18.3%
Space Separator 2489
 
17.0%
Close Punctuation 500
 
3.4%
Open Punctuation 500
 
3.4%
Other Punctuation 431
 
2.9%
Dash Punctuation 110
 
0.7%
Uppercase Letter 15
 
0.1%
Math Symbol 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1132
14.3%
718
 
9.0%
712
 
9.0%
503
 
6.3%
501
 
6.3%
496
 
6.2%
494
 
6.2%
478
 
6.0%
257
 
3.2%
206
 
2.6%
Other values (183) 2446
30.8%
Decimal Number
ValueCountFrequency (%)
1 706
26.4%
2 401
15.0%
0 256
 
9.6%
3 251
 
9.4%
4 226
 
8.4%
5 222
 
8.3%
7 169
 
6.3%
8 166
 
6.2%
6 151
 
5.6%
9 130
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
B 4
26.7%
E 2
13.3%
C 2
13.3%
H 1
 
6.7%
J 1
 
6.7%
G 1
 
6.7%
O 1
 
6.7%
K 1
 
6.7%
N 1
 
6.7%
A 1
 
6.7%
Space Separator
ValueCountFrequency (%)
2489
100.0%
Close Punctuation
ValueCountFrequency (%)
) 500
100.0%
Open Punctuation
ValueCountFrequency (%)
( 500
100.0%
Other Punctuation
ValueCountFrequency (%)
, 431
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 110
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7943
54.2%
Common 6709
45.7%
Latin 16
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1132
14.3%
718
 
9.0%
712
 
9.0%
503
 
6.3%
501
 
6.3%
496
 
6.2%
494
 
6.2%
478
 
6.0%
257
 
3.2%
206
 
2.6%
Other values (183) 2446
30.8%
Common
ValueCountFrequency (%)
2489
37.1%
1 706
 
10.5%
) 500
 
7.5%
( 500
 
7.5%
, 431
 
6.4%
2 401
 
6.0%
0 256
 
3.8%
3 251
 
3.7%
4 226
 
3.4%
5 222
 
3.3%
Other values (6) 727
 
10.8%
Latin
ValueCountFrequency (%)
B 4
25.0%
E 2
12.5%
C 2
12.5%
H 1
 
6.2%
J 1
 
6.2%
G 1
 
6.2%
O 1
 
6.2%
K 1
 
6.2%
N 1
 
6.2%
A 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7943
54.2%
ASCII 6725
45.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2489
37.0%
1 706
 
10.5%
) 500
 
7.4%
( 500
 
7.4%
, 431
 
6.4%
2 401
 
6.0%
0 256
 
3.8%
3 251
 
3.7%
4 226
 
3.4%
5 222
 
3.3%
Other values (17) 743
 
11.0%
Hangul
ValueCountFrequency (%)
1132
14.3%
718
 
9.0%
712
 
9.0%
503
 
6.3%
501
 
6.3%
496
 
6.2%
494
 
6.2%
478
 
6.0%
257
 
3.2%
206
 
2.6%
Other values (183) 2446
30.8%
Distinct15
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
용전동
89 
중앙동
65 
가양1동
63 
산내동
38 
삼성동
35 
Other values (10)
203 

Length

Max length4
Median length3
Mean length3.1156187
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신인동
2nd row산내동
3rd row판암1동
4th row용운동
5th row용전동

Common Values

ValueCountFrequency (%)
용전동 89
18.1%
중앙동 65
13.2%
가양1동 63
12.8%
산내동 38
7.7%
삼성동 35
 
7.1%
용운동 32
 
6.5%
효동 31
 
6.3%
자양동 25
 
5.1%
성남동 24
 
4.9%
홍도동 23
 
4.7%
Other values (5) 68
13.8%

Length

2023-12-12T16:30:04.273734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용전동 89
18.1%
중앙동 65
13.2%
가양1동 63
12.8%
산내동 38
7.7%
삼성동 35
 
7.1%
용운동 32
 
6.5%
효동 31
 
6.3%
자양동 25
 
5.1%
성남동 24
 
4.9%
홍도동 23
 
4.7%
Other values (5) 68
13.8%

Interactions

2023-12-12T16:30:01.886184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:30:04.407190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업구분영업소행정동
연번1.0000.1270.248
영업구분0.1271.0000.110
영업소행정동0.2480.1101.000
2023-12-12T16:30:04.516500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업소행정동영업구분
영업소행정동1.0000.098
영업구분0.0981.000
2023-12-12T16:30:04.609904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업구분영업소행정동
연번1.0000.0960.094
영업구분0.0961.0000.098
영업소행정동0.0940.0981.000

Missing values

2023-12-12T16:30:01.990325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:30:02.091057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번영업구분영업소명영업소소재지(도로명)영업소행정동
01판매(임대)업더숄더 어깨교정운동센터대전광역시 동구 새들2길 30, 204호 (신흥동)신인동
12판매업더센헬스케어대전광역시 동구 석천로 43-23, 101호 (낭월동)산내동
23판매업씨유대전판암삼정점대전광역시 동구 동부로10번길 52, 1층 103,104,105호 (판암동)판암1동
34판매업코어모션대전광역시 동구 대학로 62, 대전대학교 산학협력관 3층 310-2호 (용운동)용운동
45판매업드림컴퍼니대전광역시 동구 홍도로 60, 101동 1501호 (용전동, 새피앙아파트)용전동
56판매업지에스25(GS25)대전효동현대점대전광역시 동구 계족로 25, 효동현대아파트 1층 (효동)효동
67판매업씨유(CU)대전가양타운점대전광역시 동구 동중앙로 107-6, 1층 (가양동)가양1동
78판매(임대)업본기구필라테스대전광역시 동구 용운로 87-1, 2층 (용운동)용운동
89판매업빅뱅잉글리시리더스대전광역시 동구 동대전로131번길 34, 3층 (자양동)자양동
910판매업씨유 대전하늘채점대전광역시 동구 동구청로 35 (대성동, 은어송마을2단지 코오롱하늘채)산내동
연번영업구분영업소명영업소소재지(도로명)영업소행정동
483484판매업건화치기재상사대전광역시 동구 대전로839번길 57, 3층 (중동)중앙동
484485판매업동남의료기대전광역시 동구 대전로 826 (정동)중앙동
485486판매업드림메디상사대전광역시 동구 우암로255번길 20, 102호 (가양동)가양2동
486487판매업광산의료기대전광역시 동구 대전로779번길 11 (원동)중앙동
487488판매업한밭의료기대전광역시 동구 대전로 823-1 (정동)중앙동
488489판매업보문상사대전광역시 동구 대전로797번길 42 (중동)중앙동
489490판매업장수치과재료상사대전광역시 동구 대전로 823-2 (정동)중앙동
490491판매업삼공치과재료상사대전광역시 동구 대전로815번길 51, 2층 (정동)중앙동
491492판매업독일보청기대전광역시 동구 중앙로 210-1 (중동)중앙동
492493판매업홍명형제양행대전광역시 동구 대전로 819 (정동)중앙동