Overview

Dataset statistics

Number of variables5
Number of observations264
Missing cells18
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Text4

Dataset

Description대구광역시 동구에 등록된 공장 현황 데이터 입니다. 이 데이터는 회사명, 위치, 전화번호, 생산품 등의 정보를 포함하고 있습니다.
Author대구광역시 동구
URLhttps://www.data.go.kr/data/3075346/fileData.do

Alerts

전화번호 has 18 (6.8%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:01:33.274043
Analysis finished2024-04-06 08:01:34.442276
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct264
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean132.5
Minimum1
Maximum264
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-04-06T17:01:34.586515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.15
Q166.75
median132.5
Q3198.25
95-th percentile250.85
Maximum264
Range263
Interquartile range (IQR)131.5

Descriptive statistics

Standard deviation76.354437
Coefficient of variation (CV)0.5762599
Kurtosis-1.2
Mean132.5
Median Absolute Deviation (MAD)66
Skewness0
Sum34980
Variance5830
MonotonicityStrictly increasing
2024-04-06T17:01:34.948382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
183 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
172 1
 
0.4%
173 1
 
0.4%
174 1
 
0.4%
175 1
 
0.4%
176 1
 
0.4%
Other values (254) 254
96.2%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
264 1
0.4%
263 1
0.4%
262 1
0.4%
261 1
0.4%
260 1
0.4%
259 1
0.4%
258 1
0.4%
257 1
0.4%
256 1
0.4%
255 1
0.4%
Distinct260
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-06T17:01:35.389695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length7.0340909
Min length2

Characters and Unicode

Total characters1857
Distinct characters285
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique256 ?
Unique (%)97.0%

Sample

1st row(사)한국척수장애인협회디지털사업단
2nd row(주)E.O.S
3rd row(주)경민광학
4th row(주)경북 캐터링
5th row(주)경오전자
ValueCountFrequency (%)
주식회사 21
 
6.8%
선일기전(주 2
 
0.6%
주)유니월드 2
 
0.6%
주)서우 2
 
0.6%
대구공장 2
 
0.6%
농업회사법인 2
 
0.6%
주)에이치엠지 2
 
0.6%
주)디케이 2
 
0.6%
주)미승 2
 
0.6%
우미int 1
 
0.3%
Other values (272) 272
87.7%
2024-04-06T17:01:36.150228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
146
 
7.9%
( 127
 
6.8%
) 127
 
6.8%
48
 
2.6%
45
 
2.4%
41
 
2.2%
39
 
2.1%
35
 
1.9%
35
 
1.9%
31
 
1.7%
Other values (275) 1183
63.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1490
80.2%
Open Punctuation 127
 
6.8%
Close Punctuation 127
 
6.8%
Uppercase Letter 53
 
2.9%
Space Separator 48
 
2.6%
Other Punctuation 7
 
0.4%
Decimal Number 3
 
0.2%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
9.8%
45
 
3.0%
41
 
2.8%
39
 
2.6%
35
 
2.3%
35
 
2.3%
31
 
2.1%
30
 
2.0%
28
 
1.9%
26
 
1.7%
Other values (251) 1034
69.4%
Uppercase Letter
ValueCountFrequency (%)
N 8
15.1%
E 8
15.1%
S 4
7.5%
G 4
7.5%
T 4
7.5%
I 4
7.5%
D 3
 
5.7%
A 3
 
5.7%
H 3
 
5.7%
J 3
 
5.7%
Other values (5) 9
17.0%
Other Punctuation
ValueCountFrequency (%)
. 6
85.7%
& 1
 
14.3%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
w 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 127
100.0%
Close Punctuation
ValueCountFrequency (%)
) 127
100.0%
Space Separator
ValueCountFrequency (%)
48
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1490
80.2%
Common 312
 
16.8%
Latin 55
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
9.8%
45
 
3.0%
41
 
2.8%
39
 
2.6%
35
 
2.3%
35
 
2.3%
31
 
2.1%
30
 
2.0%
28
 
1.9%
26
 
1.7%
Other values (251) 1034
69.4%
Latin
ValueCountFrequency (%)
N 8
14.5%
E 8
14.5%
S 4
 
7.3%
G 4
 
7.3%
T 4
 
7.3%
I 4
 
7.3%
D 3
 
5.5%
A 3
 
5.5%
H 3
 
5.5%
J 3
 
5.5%
Other values (7) 11
20.0%
Common
ValueCountFrequency (%)
( 127
40.7%
) 127
40.7%
48
 
15.4%
. 6
 
1.9%
2 2
 
0.6%
& 1
 
0.3%
1 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1490
80.2%
ASCII 367
 
19.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
146
 
9.8%
45
 
3.0%
41
 
2.8%
39
 
2.6%
35
 
2.3%
35
 
2.3%
31
 
2.1%
30
 
2.0%
28
 
1.9%
26
 
1.7%
Other values (251) 1034
69.4%
ASCII
ValueCountFrequency (%)
( 127
34.6%
) 127
34.6%
48
 
13.1%
N 8
 
2.2%
E 8
 
2.2%
. 6
 
1.6%
S 4
 
1.1%
G 4
 
1.1%
T 4
 
1.1%
I 4
 
1.1%
Other values (14) 27
 
7.4%
Distinct259
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-06T17:01:36.755141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length40
Mean length25.606061
Min length17

Characters and Unicode

Total characters6760
Distinct characters146
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)96.2%

Sample

1st row대구광역시 동구 첨단로 30, 113호, 114호 (신서동)
2nd row대구광역시 동구 안심로53길 58 (동호동) (총 2 필지) 외 1필지
3rd row대구광역시 동구 공항로56길 18(지저동)
4th row대구광역시 동구 반야월북로2길 10 (율암동)
5th row대구광역시 동구 안심로73길 5 (신서동)
ValueCountFrequency (%)
대구광역시 264
 
18.9%
동구 264
 
18.9%
동호동 29
 
2.1%
신평동 23
 
1.6%
방촌동 21
 
1.5%
각산동 18
 
1.3%
신서동 18
 
1.3%
용계동 17
 
1.2%
율암동 16
 
1.1%
불로동 14
 
1.0%
Other values (379) 715
51.1%
2024-04-06T17:01:37.687680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1135
16.8%
594
 
8.8%
529
 
7.8%
273
 
4.0%
269
 
4.0%
266
 
3.9%
265
 
3.9%
264
 
3.9%
( 263
 
3.9%
) 263
 
3.9%
Other values (136) 2639
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3854
57.0%
Space Separator 1135
 
16.8%
Decimal Number 1094
 
16.2%
Open Punctuation 263
 
3.9%
Close Punctuation 263
 
3.9%
Dash Punctuation 82
 
1.2%
Other Punctuation 67
 
1.0%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
594
15.4%
529
13.7%
273
 
7.1%
269
 
7.0%
266
 
6.9%
265
 
6.9%
264
 
6.9%
163
 
4.2%
105
 
2.7%
50
 
1.3%
Other values (120) 1076
27.9%
Decimal Number
ValueCountFrequency (%)
1 259
23.7%
2 159
14.5%
3 121
11.1%
5 115
10.5%
0 87
 
8.0%
4 86
 
7.9%
6 86
 
7.9%
9 71
 
6.5%
7 61
 
5.6%
8 49
 
4.5%
Space Separator
ValueCountFrequency (%)
1135
100.0%
Open Punctuation
ValueCountFrequency (%)
( 263
100.0%
Close Punctuation
ValueCountFrequency (%)
) 263
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 82
100.0%
Other Punctuation
ValueCountFrequency (%)
, 67
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3854
57.0%
Common 2904
43.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
594
15.4%
529
13.7%
273
 
7.1%
269
 
7.0%
266
 
6.9%
265
 
6.9%
264
 
6.9%
163
 
4.2%
105
 
2.7%
50
 
1.3%
Other values (120) 1076
27.9%
Common
ValueCountFrequency (%)
1135
39.1%
( 263
 
9.1%
) 263
 
9.1%
1 259
 
8.9%
2 159
 
5.5%
3 121
 
4.2%
5 115
 
4.0%
0 87
 
3.0%
4 86
 
3.0%
6 86
 
3.0%
Other values (5) 330
 
11.4%
Latin
ValueCountFrequency (%)
A 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3854
57.0%
ASCII 2906
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1135
39.1%
( 263
 
9.1%
) 263
 
9.1%
1 259
 
8.9%
2 159
 
5.5%
3 121
 
4.2%
5 115
 
4.0%
0 87
 
3.0%
4 86
 
3.0%
6 86
 
3.0%
Other values (6) 332
 
11.4%
Hangul
ValueCountFrequency (%)
594
15.4%
529
13.7%
273
 
7.1%
269
 
7.0%
266
 
6.9%
265
 
6.9%
264
 
6.9%
163
 
4.2%
105
 
2.7%
50
 
1.3%
Other values (120) 1076
27.9%

전화번호
Text

MISSING 

Distinct238
Distinct (%)96.7%
Missing18
Missing (%)6.8%
Memory size2.2 KiB
2024-04-06T17:01:38.147253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.00813
Min length9

Characters and Unicode

Total characters2954
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique231 ?
Unique (%)93.9%

Sample

1st row053-965-7277
2nd row053-962-7842
3rd row053-983-6461
4th row053-963-0006
5th row053-963-0751
ValueCountFrequency (%)
053-985-3881 3
 
1.2%
053-354-3311 2
 
0.8%
053-962-0152 2
 
0.8%
053-964-6600 2
 
0.8%
053-964-0475 2
 
0.8%
053-963-2233 2
 
0.8%
053-986-1031 2
 
0.8%
053-964-9576 1
 
0.4%
053-986-8787 1
 
0.4%
053-756-1624 1
 
0.4%
Other values (228) 228
92.7%
2024-04-06T17:01:38.844728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 491
16.6%
5 426
14.4%
0 407
13.8%
3 395
13.4%
9 240
8.1%
6 210
7.1%
8 186
 
6.3%
1 166
 
5.6%
2 158
 
5.3%
4 149
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2463
83.4%
Dash Punctuation 491
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 426
17.3%
0 407
16.5%
3 395
16.0%
9 240
9.7%
6 210
8.5%
8 186
7.6%
1 166
 
6.7%
2 158
 
6.4%
4 149
 
6.0%
7 126
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 491
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2954
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 491
16.6%
5 426
14.4%
0 407
13.8%
3 395
13.4%
9 240
8.1%
6 210
7.1%
8 186
 
6.3%
1 166
 
5.6%
2 158
 
5.3%
4 149
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2954
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 491
16.6%
5 426
14.4%
0 407
13.8%
3 395
13.4%
9 240
8.1%
6 210
7.1%
8 186
 
6.3%
1 166
 
5.6%
2 158
 
5.3%
4 149
 
5.0%
Distinct242
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-06T17:01:39.344924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length17
Mean length8.0075758
Min length1

Characters and Unicode

Total characters2114
Distinct characters369
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)85.6%

Sample

1st row인쇄물
2nd row콘텍트렌즈
3rd row작업용 안경
4th row도시락
5th rowDMB ANT
ValueCountFrequency (%)
12
 
2.7%
창호 10
 
2.3%
7
 
1.6%
인쇄물 6
 
1.4%
광고물 6
 
1.4%
영상감시장치 4
 
0.9%
간판 3
 
0.7%
근무복 3
 
0.7%
3
 
0.7%
cctv 3
 
0.7%
Other values (341) 385
87.1%
2024-04-06T17:01:40.081398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
179
 
8.5%
, 107
 
5.1%
65
 
3.1%
44
 
2.1%
42
 
2.0%
34
 
1.6%
33
 
1.6%
33
 
1.6%
27
 
1.3%
26
 
1.2%
Other values (359) 1524
72.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1716
81.2%
Space Separator 179
 
8.5%
Other Punctuation 107
 
5.1%
Uppercase Letter 59
 
2.8%
Lowercase Letter 18
 
0.9%
Close Punctuation 16
 
0.8%
Open Punctuation 16
 
0.8%
Dash Punctuation 2
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
3.8%
44
 
2.6%
42
 
2.4%
34
 
2.0%
33
 
1.9%
33
 
1.9%
27
 
1.6%
26
 
1.5%
25
 
1.5%
25
 
1.5%
Other values (327) 1362
79.4%
Uppercase Letter
ValueCountFrequency (%)
C 13
22.0%
T 7
11.9%
D 7
11.9%
E 7
11.9%
V 6
10.2%
L 6
10.2%
A 3
 
5.1%
H 2
 
3.4%
M 2
 
3.4%
P 1
 
1.7%
Other values (5) 5
 
8.5%
Lowercase Letter
ValueCountFrequency (%)
p 3
16.7%
c 3
16.7%
s 2
11.1%
u 2
11.1%
v 2
11.1%
d 1
 
5.6%
l 1
 
5.6%
i 1
 
5.6%
a 1
 
5.6%
y 1
 
5.6%
Space Separator
ValueCountFrequency (%)
179
100.0%
Other Punctuation
ValueCountFrequency (%)
, 107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1715
81.1%
Common 321
 
15.2%
Latin 77
 
3.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
3.8%
44
 
2.6%
42
 
2.4%
34
 
2.0%
33
 
1.9%
33
 
1.9%
27
 
1.6%
26
 
1.5%
25
 
1.5%
25
 
1.5%
Other values (326) 1361
79.4%
Latin
ValueCountFrequency (%)
C 13
16.9%
T 7
 
9.1%
D 7
 
9.1%
E 7
 
9.1%
V 6
 
7.8%
L 6
 
7.8%
p 3
 
3.9%
c 3
 
3.9%
A 3
 
3.9%
H 2
 
2.6%
Other values (16) 20
26.0%
Common
ValueCountFrequency (%)
179
55.8%
, 107
33.3%
) 16
 
5.0%
( 16
 
5.0%
- 2
 
0.6%
1 1
 
0.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1715
81.1%
ASCII 398
 
18.8%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
179
45.0%
, 107
26.9%
) 16
 
4.0%
( 16
 
4.0%
C 13
 
3.3%
T 7
 
1.8%
D 7
 
1.8%
E 7
 
1.8%
V 6
 
1.5%
L 6
 
1.5%
Other values (22) 34
 
8.5%
Hangul
ValueCountFrequency (%)
65
 
3.8%
44
 
2.6%
42
 
2.4%
34
 
2.0%
33
 
1.9%
33
 
1.9%
27
 
1.6%
26
 
1.5%
25
 
1.5%
25
 
1.5%
Other values (326) 1361
79.4%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2024-04-06T17:01:33.965470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-06T17:01:34.174403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:01:34.350545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번회사명공장대표도로명주소전화번호생산품
01(사)한국척수장애인협회디지털사업단대구광역시 동구 첨단로 30, 113호, 114호 (신서동)053-965-7277인쇄물
12(주)E.O.S대구광역시 동구 안심로53길 58 (동호동) (총 2 필지) 외 1필지053-962-7842콘텍트렌즈
23(주)경민광학대구광역시 동구 공항로56길 18(지저동)053-983-6461작업용 안경
34(주)경북 캐터링대구광역시 동구 반야월북로2길 10 (율암동)053-963-0006도시락
45(주)경오전자대구광역시 동구 안심로73길 5 (신서동)053-963-0751DMB ANT
56(주)고려과학대구광역시 동구 동촌로45길 42 (방촌동)053-985-4131염색실험기기
67(주)구남공조ENG대구광역시 동구 팔공로29길 38 (불로동)053-981-4835닥트
78(주)글로벌엔지니어링대구광역시 동구 화랑로 533 (용계동)053-985-9046방산부품및반도체장비
89(주)금강금속대구광역시 동구 신덕로 120 (신평동)053-983-6269보강심
910(주)네오칸대구광역시 동구 반야월북로 62 (율암동)053-961-5757판넬(목재가공품)
순번회사명공장대표도로명주소전화번호생산품
254255한국정유기(주)대구광역시 동구 옻골로 55 (부동)053-982-7710주수주유설비
255256한국제동산업대구광역시 동구 반야월북로12길 21, 1층 동편 (율암동)053-964-8527철도차량부품
256257한미합자 제일화학(주)대구광역시 동구 대림로2길 54-11 (대림동)053-962-3108농약
257258한샘DECO대구광역시 동구 신평동 52-13번지<NA>주방가구 및 일반가구
258259한세정보통신(주)대구광역시 동구 신평로 140 (용계동)053-985-7601영상감시장치
259260한양쇼파대구광역시 동구 안심로49길 130(동호동)053-962-6387쇼파
260261한영종합인쇄대구광역시 동구 첨단로 30, 404호 (신서동)053-962-3200인쇄물
261262한일시멘트(주)대구공장대구광역시 동구 반야월로 266 (동호동) 외 2필지053-961-0394레미콘
262263한창침장대구광역시 동구 신평로16길 57-20 (신평동)<NA>침구류 외
263264화성봉재대구광역시 동구 송라로11길 12 (신천동)<NA>봉제품, 타올