Overview

Dataset statistics

Number of variables5
Number of observations175
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Text4

Dataset

Description경기도 연천군 공장등록현황을 개방하여 지역산업을 파악하여 제조할 수 있도록 해주는 서비스입니다.(번호,회사명,소재지,전화번호,종업원수,업종명)
Author경기도 연천군
URLhttps://www.data.go.kr/data/15000609/fileData.do

Alerts

번호 has unique valuesUnique
회사명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:03:35.269894
Analysis finished2023-12-12 12:03:35.837933
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct175
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean88
Minimum1
Maximum175
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-12T21:03:35.910159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.7
Q144.5
median88
Q3131.5
95-th percentile166.3
Maximum175
Range174
Interquartile range (IQR)87

Descriptive statistics

Standard deviation50.662281
Coefficient of variation (CV)0.57570773
Kurtosis-1.2
Mean88
Median Absolute Deviation (MAD)44
Skewness0
Sum15400
Variance2566.6667
MonotonicityStrictly increasing
2023-12-12T21:03:36.049331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
2 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
117 1
 
0.6%
118 1
 
0.6%
119 1
 
0.6%
120 1
 
0.6%
Other values (165) 165
94.3%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
175 1
0.6%
174 1
0.6%
173 1
0.6%
172 1
0.6%
171 1
0.6%
170 1
0.6%
169 1
0.6%
168 1
0.6%
167 1
0.6%
166 1
0.6%

회사명
Text

UNIQUE 

Distinct175
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T21:03:36.282164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length7.1542857
Min length2

Characters and Unicode

Total characters1252
Distinct characters242
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)100.0%

Sample

1st row(주)거래금속
2nd row(주)건호텍스타일
3rd row(주)광동
4th row(주)그린환경
5th row(주)금주나
ValueCountFrequency (%)
주식회사 31
 
13.9%
농업회사법인 5
 
2.2%
삼호 1
 
0.4%
부창텍스 1
 
0.4%
연천금속 1
 
0.4%
유창농산주식회사 1
 
0.4%
연천주조 1
 
0.4%
금광산업 1
 
0.4%
이화바이오 1
 
0.4%
장남 1
 
0.4%
Other values (179) 179
80.3%
2023-12-12T21:03:36.670746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
102
 
8.1%
( 58
 
4.6%
) 58
 
4.6%
51
 
4.1%
48
 
3.8%
48
 
3.8%
46
 
3.7%
29
 
2.3%
28
 
2.2%
23
 
1.8%
Other values (232) 761
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1058
84.5%
Open Punctuation 58
 
4.6%
Close Punctuation 58
 
4.6%
Space Separator 48
 
3.8%
Lowercase Letter 12
 
1.0%
Uppercase Letter 8
 
0.6%
Other Symbol 7
 
0.6%
Decimal Number 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
 
9.6%
51
 
4.8%
48
 
4.5%
46
 
4.3%
29
 
2.7%
28
 
2.6%
23
 
2.2%
21
 
2.0%
18
 
1.7%
17
 
1.6%
Other values (211) 675
63.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
25.0%
n 2
16.7%
i 2
16.7%
l 1
 
8.3%
c 1
 
8.3%
s 1
 
8.3%
a 1
 
8.3%
m 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
G 2
25.0%
H 1
12.5%
C 1
12.5%
S 1
12.5%
I 1
12.5%
N 1
12.5%
K 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Space Separator
ValueCountFrequency (%)
48
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1064
85.0%
Common 167
 
13.3%
Latin 20
 
1.6%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
 
9.6%
51
 
4.8%
48
 
4.5%
46
 
4.3%
29
 
2.7%
28
 
2.6%
23
 
2.2%
21
 
2.0%
18
 
1.7%
17
 
1.6%
Other values (211) 681
64.0%
Latin
ValueCountFrequency (%)
e 3
15.0%
G 2
 
10.0%
n 2
 
10.0%
i 2
 
10.0%
H 1
 
5.0%
C 1
 
5.0%
S 1
 
5.0%
I 1
 
5.0%
N 1
 
5.0%
K 1
 
5.0%
Other values (5) 5
25.0%
Common
ValueCountFrequency (%)
( 58
34.7%
) 58
34.7%
48
28.7%
2 2
 
1.2%
& 1
 
0.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1057
84.4%
ASCII 187
 
14.9%
None 7
 
0.6%
CJK 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
102
 
9.6%
51
 
4.8%
48
 
4.5%
46
 
4.4%
29
 
2.7%
28
 
2.6%
23
 
2.2%
21
 
2.0%
18
 
1.7%
17
 
1.6%
Other values (210) 674
63.8%
ASCII
ValueCountFrequency (%)
( 58
31.0%
) 58
31.0%
48
25.7%
e 3
 
1.6%
G 2
 
1.1%
2 2
 
1.1%
n 2
 
1.1%
i 2
 
1.1%
& 1
 
0.5%
H 1
 
0.5%
Other values (10) 10
 
5.3%
None
ValueCountFrequency (%)
7
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct163
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T21:03:37.135135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length21.342857
Min length17

Characters and Unicode

Total characters3735
Distinct characters75
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)86.9%

Sample

1st row경기도 연천군 백학면 백학산단길 126-43
2nd row경기도 연천군 청산면 초대로 217-55
3rd row경기도 연천군 전곡읍 선사로81번길 55
4th row경기도 연천군 미산면 마유로143번길 139-30
5th row경기도 연천군 청산면 초대로 244-16
ValueCountFrequency (%)
경기도 175
20.0%
연천군 175
20.0%
백학면 49
 
5.6%
전곡읍 45
 
5.1%
백학산단길 43
 
4.9%
청산면 41
 
4.7%
초대로 20
 
2.3%
양연로 16
 
1.8%
군남면 15
 
1.7%
연천읍 11
 
1.3%
Other values (214) 286
32.6%
2023-12-12T21:03:37.693573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
701
18.8%
227
 
6.1%
194
 
5.2%
186
 
5.0%
177
 
4.7%
175
 
4.7%
175
 
4.7%
1 133
 
3.6%
121
 
3.2%
119
 
3.2%
Other values (65) 1527
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2277
61.0%
Decimal Number 702
 
18.8%
Space Separator 701
 
18.8%
Dash Punctuation 55
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
227
 
10.0%
194
 
8.5%
186
 
8.2%
177
 
7.8%
175
 
7.7%
175
 
7.7%
121
 
5.3%
119
 
5.2%
97
 
4.3%
96
 
4.2%
Other values (53) 710
31.2%
Decimal Number
ValueCountFrequency (%)
1 133
18.9%
2 109
15.5%
4 68
9.7%
3 67
9.5%
6 63
9.0%
5 60
8.5%
0 55
7.8%
8 50
 
7.1%
9 50
 
7.1%
7 47
 
6.7%
Space Separator
ValueCountFrequency (%)
701
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2277
61.0%
Common 1458
39.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
227
 
10.0%
194
 
8.5%
186
 
8.2%
177
 
7.8%
175
 
7.7%
175
 
7.7%
121
 
5.3%
119
 
5.2%
97
 
4.3%
96
 
4.2%
Other values (53) 710
31.2%
Common
ValueCountFrequency (%)
701
48.1%
1 133
 
9.1%
2 109
 
7.5%
4 68
 
4.7%
3 67
 
4.6%
6 63
 
4.3%
5 60
 
4.1%
0 55
 
3.8%
- 55
 
3.8%
8 50
 
3.4%
Other values (2) 97
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2277
61.0%
ASCII 1458
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
701
48.1%
1 133
 
9.1%
2 109
 
7.5%
4 68
 
4.7%
3 67
 
4.6%
6 63
 
4.3%
5 60
 
4.1%
0 55
 
3.8%
- 55
 
3.8%
8 50
 
3.4%
Other values (2) 97
 
6.7%
Hangul
ValueCountFrequency (%)
227
 
10.0%
194
 
8.5%
186
 
8.2%
177
 
7.8%
175
 
7.7%
175
 
7.7%
121
 
5.3%
119
 
5.2%
97
 
4.3%
96
 
4.2%
Other values (53) 710
31.2%
Distinct161
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T21:03:38.014522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.754286
Min length7

Characters and Unicode

Total characters2057
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)88.6%

Sample

1st row031-944-9841
2nd row031-832-5665
3rd row031-832-3134
4th row031-833-7033
5th row031-832-9640
ValueCountFrequency (%)
개인휴대폰번호 10
 
5.7%
031-833-0420 2
 
1.1%
031-835-1872 2
 
1.1%
031-832-0017 2
 
1.1%
031-835-7793 2
 
1.1%
031-832-6366 2
 
1.1%
031-867-9100 1
 
0.6%
031-832-0241 1
 
0.6%
031-833-7112 1
 
0.6%
031-834-1005 1
 
0.6%
Other values (151) 151
86.3%
2023-12-12T21:03:38.467674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 399
19.4%
- 330
16.0%
0 275
13.4%
1 237
11.5%
8 220
10.7%
2 125
 
6.1%
5 123
 
6.0%
6 85
 
4.1%
7 74
 
3.6%
4 72
 
3.5%
Other values (8) 117
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1657
80.6%
Dash Punctuation 330
 
16.0%
Other Letter 70
 
3.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 399
24.1%
0 275
16.6%
1 237
14.3%
8 220
13.3%
2 125
 
7.5%
5 123
 
7.4%
6 85
 
5.1%
7 74
 
4.5%
4 72
 
4.3%
9 47
 
2.8%
Other Letter
ValueCountFrequency (%)
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 330
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1987
96.6%
Hangul 70
 
3.4%

Most frequent character per script

Common
ValueCountFrequency (%)
3 399
20.1%
- 330
16.6%
0 275
13.8%
1 237
11.9%
8 220
11.1%
2 125
 
6.3%
5 123
 
6.2%
6 85
 
4.3%
7 74
 
3.7%
4 72
 
3.6%
Hangul
ValueCountFrequency (%)
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1987
96.6%
Hangul 70
 
3.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 399
20.1%
- 330
16.6%
0 275
13.8%
1 237
11.9%
8 220
11.1%
2 125
 
6.3%
5 123
 
6.2%
6 85
 
4.3%
7 74
 
3.7%
4 72
 
3.6%
Hangul
ValueCountFrequency (%)
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
10
14.3%
Distinct139
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T21:03:38.678709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length7.3942857
Min length1

Characters and Unicode

Total characters1294
Distinct characters313
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)71.4%

Sample

1st row도장+기타피막처리
2nd row섬유염색
3rd row계측기기+유량계+자동제어장치
4th row가금사료
5th row섬유염색
ValueCountFrequency (%)
섬유염색 17
 
9.7%
5
 
2.9%
레미콘 4
 
2.3%
콘크리트옹벽블록 3
 
1.7%
먹는샘물 3
 
1.7%
홍삼액기스 2
 
1.1%
점토벽돌 2
 
1.1%
김치류 2
 
1.1%
합성수지지붕 2
 
1.1%
유지+수지막 2
 
1.1%
Other values (129) 133
76.0%
2023-12-12T21:03:39.061066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 107
 
8.3%
38
 
2.9%
33
 
2.6%
31
 
2.4%
26
 
2.0%
25
 
1.9%
21
 
1.6%
19
 
1.5%
19
 
1.5%
18
 
1.4%
Other values (303) 957
74.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1165
90.0%
Math Symbol 107
 
8.3%
Uppercase Letter 15
 
1.2%
Lowercase Letter 7
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
3.3%
33
 
2.8%
31
 
2.7%
26
 
2.2%
25
 
2.1%
21
 
1.8%
19
 
1.6%
19
 
1.6%
18
 
1.5%
18
 
1.5%
Other values (286) 917
78.7%
Uppercase Letter
ValueCountFrequency (%)
P 4
26.7%
R 2
13.3%
T 2
13.3%
C 2
13.3%
M 1
 
6.7%
E 1
 
6.7%
H 1
 
6.7%
V 1
 
6.7%
F 1
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
x 1
14.3%
e 1
14.3%
d 1
14.3%
n 1
14.3%
a 1
14.3%
p 1
14.3%
s 1
14.3%
Math Symbol
ValueCountFrequency (%)
+ 107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1165
90.0%
Common 107
 
8.3%
Latin 22
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
3.3%
33
 
2.8%
31
 
2.7%
26
 
2.2%
25
 
2.1%
21
 
1.8%
19
 
1.6%
19
 
1.6%
18
 
1.5%
18
 
1.5%
Other values (286) 917
78.7%
Latin
ValueCountFrequency (%)
P 4
18.2%
R 2
 
9.1%
T 2
 
9.1%
C 2
 
9.1%
x 1
 
4.5%
e 1
 
4.5%
d 1
 
4.5%
M 1
 
4.5%
n 1
 
4.5%
a 1
 
4.5%
Other values (6) 6
27.3%
Common
ValueCountFrequency (%)
+ 107
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1165
90.0%
ASCII 129
 
10.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 107
82.9%
P 4
 
3.1%
R 2
 
1.6%
T 2
 
1.6%
C 2
 
1.6%
x 1
 
0.8%
e 1
 
0.8%
d 1
 
0.8%
M 1
 
0.8%
n 1
 
0.8%
Other values (7) 7
 
5.4%
Hangul
ValueCountFrequency (%)
38
 
3.3%
33
 
2.8%
31
 
2.7%
26
 
2.2%
25
 
2.1%
21
 
1.8%
19
 
1.6%
19
 
1.6%
18
 
1.5%
18
 
1.5%
Other values (286) 917
78.7%

Interactions

2023-12-12T21:03:35.567490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T21:03:35.683501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:03:35.795780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호회사명소재지전화번호생산품
01(주)거래금속경기도 연천군 백학면 백학산단길 126-43031-944-9841도장+기타피막처리
12(주)건호텍스타일경기도 연천군 청산면 초대로 217-55031-832-5665섬유염색
23(주)광동경기도 연천군 전곡읍 선사로81번길 55031-832-3134계측기기+유량계+자동제어장치
34(주)그린환경경기도 연천군 미산면 마유로143번길 139-30031-833-7033가금사료
45(주)금주나경기도 연천군 청산면 초대로 244-16031-832-9640섬유염색
56(주)금호니트경기도 연천군 청산면 초대로 208-42031-833-5511섬유염색
67(주)대성실업경기도 연천군 청산면 초대로 217-10031-832-3001섬유염색
78(주)대원산업경기도 연천군 청산면 초대로 201-12031-833-3200섬유염색
89(주)동원에프앤비연천공장경기도 연천군 청산면 순욱길 256031-832-8813먹는샘물
910(주)두다경기도 연천군 신서면 도내로 28-21031-834-9938기상액상정화정제용필터+연속기화식재생기
번호회사명소재지전화번호생산품
165166한백파워콤경기도 연천군 군남면 진은로 480031-832-3774배전반+분전반+변압기반
166167한빈실업경기도 연천군 청산면 초대로 2020507-1316-1897섬유염색
167168한씨가원경기도 연천군 장남면 장백로330번길 192-1개인휴대폰번호참기름+생들기름
168169호아경기도 연천군 백학면 백학산단길 356031-861-4075접착제
169170홍은금속㈜경기도 연천군 백학면 백학산단길 385031-988-6627알루미늄비레트
170171화성산업파이프경기도 연천군 전곡읍 양연로292번길 39031-832-9413PE파이프
171172화성섬유경기도 연천군 청산면 전영로 441031-832-0067편조원단+맞춤의류제작
172173(주)휴먼소재경기도 연천군 연천읍 은통산단1길 86개인휴대폰번호플라스틱필름
173174흥일경기도 연천군 청산면 초대로 208-36031-833-5772섬유염색
174175현명경기도 연천군 연천읍 지혜로 68개인휴대폰번호보행매트