Overview

Dataset statistics

Number of variables6
Number of observations90
Missing cells17
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory50.5 B

Variable types

Numeric1
Text4
DateTime1

Dataset

Description부산광역시 연제구 공장등록현황(회사명, 주소, 등록일, 전화번호 등)에 대한 데이터로 아래와 같이 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3082241/fileData.do

Alerts

전화번호 has 6 (6.7%) missing valuesMissing
팩스번호 has 11 (12.2%) missing valuesMissing
순번 has unique valuesUnique
공장대표주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:28:07.766226
Analysis finished2023-12-12 10:28:09.082003
Duration1.32 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.5
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2023-12-12T19:28:09.160938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.45
Q123.25
median45.5
Q367.75
95-th percentile85.55
Maximum90
Range89
Interquartile range (IQR)44.5

Descriptive statistics

Standard deviation26.124701
Coefficient of variation (CV)0.57416925
Kurtosis-1.2
Mean45.5
Median Absolute Deviation (MAD)22.5
Skewness0
Sum4095
Variance682.5
MonotonicityStrictly increasing
2023-12-12T19:28:09.313829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
69 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
Other values (80) 80
88.9%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
Distinct89
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-12T19:28:09.578663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length7.0222222
Min length2

Characters and Unicode

Total characters632
Distinct characters188
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)97.8%

Sample

1st row(주) 송림드리움
2nd row(주)강동미디어
3rd row(주)고려폴리머
4th row(주)국제비엠에스
5th row(주)대륙건설광고공사
ValueCountFrequency (%)
주식회사 12
 
11.0%
주)엘앤비기술 2
 
1.8%
타펨코리아 1
 
0.9%
비손 1
 
0.9%
동우비지니스솔루션 1
 
0.9%
일성테크원주식회사 1
 
0.9%
이한기술단 1
 
0.9%
이채라이팅 1
 
0.9%
유성산업(주 1
 
0.9%
월드파워 1
 
0.9%
Other values (87) 87
79.8%
2023-12-12T19:28:10.018081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
7.4%
( 32
 
5.1%
) 32
 
5.1%
25
 
4.0%
19
 
3.0%
18
 
2.8%
15
 
2.4%
15
 
2.4%
12
 
1.9%
11
 
1.7%
Other values (178) 406
64.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 539
85.3%
Open Punctuation 32
 
5.1%
Close Punctuation 32
 
5.1%
Space Separator 19
 
3.0%
Uppercase Letter 8
 
1.3%
Other Punctuation 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
8.7%
25
 
4.6%
18
 
3.3%
15
 
2.8%
15
 
2.8%
12
 
2.2%
11
 
2.0%
10
 
1.9%
10
 
1.9%
9
 
1.7%
Other values (167) 367
68.1%
Uppercase Letter
ValueCountFrequency (%)
S 2
25.0%
E 2
25.0%
H 1
12.5%
C 1
12.5%
T 1
12.5%
I 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 539
85.3%
Common 85
 
13.4%
Latin 8
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
8.7%
25
 
4.6%
18
 
3.3%
15
 
2.8%
15
 
2.8%
12
 
2.2%
11
 
2.0%
10
 
1.9%
10
 
1.9%
9
 
1.7%
Other values (167) 367
68.1%
Latin
ValueCountFrequency (%)
S 2
25.0%
E 2
25.0%
H 1
12.5%
C 1
12.5%
T 1
12.5%
I 1
12.5%
Common
ValueCountFrequency (%)
( 32
37.6%
) 32
37.6%
19
22.4%
& 1
 
1.2%
2 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 539
85.3%
ASCII 93
 
14.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
8.7%
25
 
4.6%
18
 
3.3%
15
 
2.8%
15
 
2.8%
12
 
2.2%
11
 
2.0%
10
 
1.9%
10
 
1.9%
9
 
1.7%
Other values (167) 367
68.1%
ASCII
ValueCountFrequency (%)
( 32
34.4%
) 32
34.4%
19
20.4%
S 2
 
2.2%
E 2
 
2.2%
H 1
 
1.1%
C 1
 
1.1%
T 1
 
1.1%
I 1
 
1.1%
& 1
 
1.1%
Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-12T19:28:10.387659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length39
Mean length30.044444
Min length21

Characters and Unicode

Total characters2704
Distinct characters113
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)100.0%

Sample

1st row부산광역시 연제구 과정로287번길 52 (연산동) 외 1필지
2nd row부산광역시 연제구 거제대로 176, 3층 (거제동, 한신상가)
3rd row부산광역시 연제구 거제대로 120(거제동)
4th row부산광역시 연제구 토곡남로 9 (연산동)
5th row부산광역시 연제구 반송로 104, 연산동 104-36번지 (연산동)
ValueCountFrequency (%)
부산광역시 90
 
17.3%
연제구 90
 
17.3%
연산동 51
 
9.8%
거제동 27
 
5.2%
거제대로 10
 
1.9%
2층 7
 
1.3%
월드컵대로 6
 
1.2%
3층 5
 
1.0%
반송로 4
 
0.8%
3 4
 
0.8%
Other values (177) 226
43.5%
2023-12-12T19:28:10.989177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
430
 
15.9%
150
 
5.5%
147
 
5.4%
146
 
5.4%
98
 
3.6%
1 95
 
3.5%
95
 
3.5%
) 92
 
3.4%
92
 
3.4%
( 92
 
3.4%
Other values (103) 1267
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1613
59.7%
Space Separator 430
 
15.9%
Decimal Number 408
 
15.1%
Close Punctuation 92
 
3.4%
Open Punctuation 92
 
3.4%
Other Punctuation 54
 
2.0%
Dash Punctuation 10
 
0.4%
Uppercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
150
 
9.3%
147
 
9.1%
146
 
9.1%
98
 
6.1%
95
 
5.9%
92
 
5.7%
90
 
5.6%
90
 
5.6%
90
 
5.6%
90
 
5.6%
Other values (86) 525
32.5%
Decimal Number
ValueCountFrequency (%)
1 95
23.3%
2 70
17.2%
3 56
13.7%
4 33
 
8.1%
5 31
 
7.6%
0 29
 
7.1%
8 26
 
6.4%
7 24
 
5.9%
9 23
 
5.6%
6 21
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 4
80.0%
D 1
 
20.0%
Space Separator
ValueCountFrequency (%)
430
100.0%
Close Punctuation
ValueCountFrequency (%)
) 92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 92
100.0%
Other Punctuation
ValueCountFrequency (%)
, 54
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1613
59.7%
Common 1086
40.2%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
9.3%
147
 
9.1%
146
 
9.1%
98
 
6.1%
95
 
5.9%
92
 
5.7%
90
 
5.6%
90
 
5.6%
90
 
5.6%
90
 
5.6%
Other values (86) 525
32.5%
Common
ValueCountFrequency (%)
430
39.6%
1 95
 
8.7%
) 92
 
8.5%
( 92
 
8.5%
2 70
 
6.4%
3 56
 
5.2%
, 54
 
5.0%
4 33
 
3.0%
5 31
 
2.9%
0 29
 
2.7%
Other values (5) 104
 
9.6%
Latin
ValueCountFrequency (%)
B 4
80.0%
D 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1613
59.7%
ASCII 1091
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
430
39.4%
1 95
 
8.7%
) 92
 
8.4%
( 92
 
8.4%
2 70
 
6.4%
3 56
 
5.1%
, 54
 
4.9%
4 33
 
3.0%
5 31
 
2.8%
0 29
 
2.7%
Other values (7) 109
 
10.0%
Hangul
ValueCountFrequency (%)
150
 
9.3%
147
 
9.1%
146
 
9.1%
98
 
6.1%
95
 
5.9%
92
 
5.7%
90
 
5.6%
90
 
5.6%
90
 
5.6%
90
 
5.6%
Other values (86) 525
32.5%
Distinct88
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size852.0 B
Minimum1998-02-17 00:00:00
Maximum2023-05-23 00:00:00
2023-12-12T19:28:11.163807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:28:11.339246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

전화번호
Text

MISSING 

Distinct83
Distinct (%)98.8%
Missing6
Missing (%)6.7%
Memory size852.0 B
2023-12-12T19:28:11.630993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035714
Min length12

Characters and Unicode

Total characters1011
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)97.6%

Sample

1st row051-862-8170
2nd row051-864-6500
3rd row051-644-8877
4th row051-852-4517
5th row051-867-9999
ValueCountFrequency (%)
051-851-3400 2
 
2.4%
051-852-2233 1
 
1.2%
051-852-5833 1
 
1.2%
051-504-8422 1
 
1.2%
051-925-1514 1
 
1.2%
051-752-7771 1
 
1.2%
051-751-5520 1
 
1.2%
051-866-3016 1
 
1.2%
051-522-7278 1
 
1.2%
051-852-2950 1
 
1.2%
Other values (73) 73
86.9%
2023-12-12T19:28:12.111753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 168
16.6%
5 162
16.0%
0 146
14.4%
1 131
13.0%
8 85
8.4%
6 75
7.4%
2 66
 
6.5%
7 50
 
4.9%
3 47
 
4.6%
4 46
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 843
83.4%
Dash Punctuation 168
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 162
19.2%
0 146
17.3%
1 131
15.5%
8 85
10.1%
6 75
8.9%
2 66
7.8%
7 50
 
5.9%
3 47
 
5.6%
4 46
 
5.5%
9 35
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1011
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 168
16.6%
5 162
16.0%
0 146
14.4%
1 131
13.0%
8 85
8.4%
6 75
7.4%
2 66
 
6.5%
7 50
 
4.9%
3 47
 
4.6%
4 46
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1011
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 168
16.6%
5 162
16.0%
0 146
14.4%
1 131
13.0%
8 85
8.4%
6 75
7.4%
2 66
 
6.5%
7 50
 
4.9%
3 47
 
4.6%
4 46
 
4.5%

팩스번호
Text

MISSING 

Distinct77
Distinct (%)97.5%
Missing11
Missing (%)12.2%
Memory size852.0 B
2023-12-12T19:28:12.400825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.075949
Min length12

Characters and Unicode

Total characters954
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)94.9%

Sample

1st row051-862-8174
2nd row051-853-8444
3rd row051-644-8897
4th row051-852-4519
5th row051-866-2375
ValueCountFrequency (%)
051-851-4646 2
 
2.5%
051-867-0289 2
 
2.5%
051-866-2874 1
 
1.3%
051-862-8174 1
 
1.3%
051-759-8429 1
 
1.3%
051-925-1515 1
 
1.3%
051-752-7726 1
 
1.3%
051-751-5521 1
 
1.3%
051-864-7657 1
 
1.3%
051-523-6767 1
 
1.3%
Other values (67) 67
84.8%
2023-12-12T19:28:12.894231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 163
17.1%
- 158
16.6%
0 125
13.1%
1 120
12.6%
6 81
8.5%
8 77
8.1%
2 65
 
6.8%
4 50
 
5.2%
9 40
 
4.2%
7 38
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 796
83.4%
Dash Punctuation 158
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 163
20.5%
0 125
15.7%
1 120
15.1%
6 81
10.2%
8 77
9.7%
2 65
 
8.2%
4 50
 
6.3%
9 40
 
5.0%
7 38
 
4.8%
3 37
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 158
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 954
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 163
17.1%
- 158
16.6%
0 125
13.1%
1 120
12.6%
6 81
8.5%
8 77
8.1%
2 65
 
6.8%
4 50
 
5.2%
9 40
 
4.2%
7 38
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 954
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 163
17.1%
- 158
16.6%
0 125
13.1%
1 120
12.6%
6 81
8.5%
8 77
8.1%
2 65
 
6.8%
4 50
 
5.2%
9 40
 
4.2%
7 38
 
4.0%

Interactions

2023-12-12T19:28:08.731216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:28:13.013515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번회사명공장대표주소(도로명)공장등록일전화번호팩스번호
순번1.0001.0001.0000.9731.0000.969
회사명1.0001.0001.0000.9971.0001.000
공장대표주소(도로명)1.0001.0001.0001.0001.0001.000
공장등록일0.9730.9971.0001.0000.9960.995
전화번호1.0001.0001.0000.9961.0001.000
팩스번호0.9691.0001.0000.9951.0001.000

Missing values

2023-12-12T19:28:08.842963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:28:08.944443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:28:09.036210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번회사명공장대표주소(도로명)공장등록일전화번호팩스번호
01(주) 송림드리움부산광역시 연제구 과정로287번길 52 (연산동) 외 1필지2018-01-11051-862-8170051-862-8174
12(주)강동미디어부산광역시 연제구 거제대로 176, 3층 (거제동, 한신상가)2017-09-20051-864-6500051-853-8444
23(주)고려폴리머부산광역시 연제구 거제대로 120(거제동)2023-02-22051-644-8877051-644-8897
34(주)국제비엠에스부산광역시 연제구 토곡남로 9 (연산동)2013-09-24051-852-4517051-852-4519
45(주)대륙건설광고공사부산광역시 연제구 반송로 104, 연산동 104-36번지 (연산동)2022-02-22051-867-9999<NA>
56(주)동성지기부산광역시 연제구 중앙대로1133번길 21 (연산동, 주식회사 동성지기)2018-01-04051-852-7321051-866-2375
67(주)동인산업부산광역시 연제구 과정로344번길 53 (연산동)2009-09-10051-863-5983051-867-0289
78(주)부경테크부산광역시 연제구 고분로247번길 6-1 (연산동)2021-04-16051-865-1417051-865-9399
89(주)세광 2공장부산광역시 연제구 배산북로 38-1 (연산동)2021-04-29051-556-1730051-556-1732
910(주)신승시스템부산광역시 연제구 중앙대로1219번길 15 (거제동, 신승빌딩)2005-11-30051-507-6031051-507-6034
순번회사명공장대표주소(도로명)공장등록일전화번호팩스번호
8081타펨코리아부산광역시 연제구 중앙대로1038번길 54 (연산동, 효성아파트) 지하30호2020-07-16051-553-6446051-553-6445
8182태광물산(주)부산광역시 연제구 월드컵대로164번길 17 (연산동)2006-04-05051-866-8111051-851-3022
8283하이눈정보통신부산광역시 연제구 거제대로252번길 32, 2층 (거제동)2013-04-26051-866-0377<NA>
8384하이콤 주식회사부산광역시 연제구 연안로13번길 57, 3층 (연산동)2019-12-13051-638-3500051-638-5300
8485해동그린파워펌프부산광역시 연제구 월드컵대로188번길 33 (거제동)2021-06-02051-863-3600051-861-6666
8586현대미디어부산광역시 연제구 월드컵대로 34-1 (연산동)2011-12-22051-868-2022<NA>
8687형제흑판사부산광역시 연제구 거제대로 294 (거제동)2009-03-30051-864-7802051-863-2688
8788혜정섬유부산광역시 연제구 아시아드대로22번길 3 (거제동) 지하1층2020-12-09051-504-2806051-503-2806
8889화성아이티주식회사부산광역시 연제구 거제대로 128-9, 302호(거제동)2022-05-20051-852-9690051-852-9692
8990휴먼코어젠주식회사부산광역시 연제구 쌍미천로84번길 85 (연산동)2021-06-23<NA>050-4064-2625