Overview

Dataset statistics

Number of variables5
Number of observations1962
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory80.6 KiB
Average record size in memory42.1 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 기업지원 맞춤형 원스톱 지원 시스템(bizok) 내 가입되어있는 기업회원 현황(기업명, 대표자명, 소재지, 가입년도) 목록 자료입니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15049268/fileData.do

Alerts

번호 is highly overall correlated with 가입년도High correlation
가입년도 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:03:50.702021
Analysis finished2023-12-12 21:03:51.881866
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1962
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean981.5
Minimum1
Maximum1962
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.4 KiB
2023-12-13T06:03:51.980225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile99.05
Q1491.25
median981.5
Q31471.75
95-th percentile1863.95
Maximum1962
Range1961
Interquartile range (IQR)980.5

Descriptive statistics

Standard deviation566.52493
Coefficient of variation (CV)0.57720319
Kurtosis-1.2
Mean981.5
Median Absolute Deviation (MAD)490.5
Skewness0
Sum1925703
Variance320950.5
MonotonicityStrictly increasing
2023-12-13T06:03:52.148564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1319 1
 
0.1%
1317 1
 
0.1%
1316 1
 
0.1%
1315 1
 
0.1%
1314 1
 
0.1%
1313 1
 
0.1%
1312 1
 
0.1%
1311 1
 
0.1%
1310 1
 
0.1%
Other values (1952) 1952
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1962 1
0.1%
1961 1
0.1%
1960 1
0.1%
1959 1
0.1%
1958 1
0.1%
1957 1
0.1%
1956 1
0.1%
1955 1
0.1%
1954 1
0.1%
1953 1
0.1%
Distinct1891
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2023-12-13T06:03:52.546861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length21
Mean length7.6727829
Min length1

Characters and Unicode

Total characters15054
Distinct characters638
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1830 ?
Unique (%)93.3%

Sample

1st row한테크
2nd row(주)베스트웨이코리아
3rd row
4th row티아테크
5th row(주)심테크
ValueCountFrequency (%)
주식회사 523
 
20.1%
19
 
0.7%
농업회사법인 7
 
0.3%
인천테크노파크 7
 
0.3%
유한회사 5
 
0.2%
주)유진로봇 4
 
0.2%
코퍼레이션 4
 
0.2%
미가에스티 3
 
0.1%
주)빅텍스 3
 
0.1%
진성원(주 3
 
0.1%
Other values (1953) 2020
77.8%
2023-12-13T06:03:53.037160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1244
 
8.3%
701
 
4.7%
651
 
4.3%
643
 
4.3%
636
 
4.2%
) 622
 
4.1%
( 618
 
4.1%
527
 
3.5%
437
 
2.9%
243
 
1.6%
Other values (628) 8732
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12849
85.4%
Space Separator 636
 
4.2%
Close Punctuation 622
 
4.1%
Open Punctuation 618
 
4.1%
Uppercase Letter 208
 
1.4%
Lowercase Letter 70
 
0.5%
Other Punctuation 24
 
0.2%
Decimal Number 24
 
0.2%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1244
 
9.7%
701
 
5.5%
651
 
5.1%
643
 
5.0%
527
 
4.1%
437
 
3.4%
243
 
1.9%
182
 
1.4%
182
 
1.4%
171
 
1.3%
Other values (570) 7868
61.2%
Uppercase Letter
ValueCountFrequency (%)
N 19
 
9.1%
E 18
 
8.7%
C 17
 
8.2%
T 16
 
7.7%
A 14
 
6.7%
O 13
 
6.2%
S 11
 
5.3%
G 10
 
4.8%
K 9
 
4.3%
M 9
 
4.3%
Other values (13) 72
34.6%
Lowercase Letter
ValueCountFrequency (%)
i 10
14.3%
n 7
10.0%
o 7
10.0%
l 7
10.0%
c 5
 
7.1%
a 5
 
7.1%
t 4
 
5.7%
r 4
 
5.7%
d 4
 
5.7%
e 3
 
4.3%
Other values (9) 14
20.0%
Decimal Number
ValueCountFrequency (%)
1 7
29.2%
2 4
16.7%
0 4
16.7%
5 3
12.5%
9 3
12.5%
4 2
 
8.3%
3 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 12
50.0%
& 7
29.2%
, 3
 
12.5%
/ 1
 
4.2%
' 1
 
4.2%
Space Separator
ValueCountFrequency (%)
636
100.0%
Close Punctuation
ValueCountFrequency (%)
) 622
100.0%
Open Punctuation
ValueCountFrequency (%)
( 618
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12847
85.3%
Common 1927
 
12.8%
Latin 278
 
1.8%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1244
 
9.7%
701
 
5.5%
651
 
5.1%
643
 
5.0%
527
 
4.1%
437
 
3.4%
243
 
1.9%
182
 
1.4%
182
 
1.4%
171
 
1.3%
Other values (568) 7866
61.2%
Latin
ValueCountFrequency (%)
N 19
 
6.8%
E 18
 
6.5%
C 17
 
6.1%
T 16
 
5.8%
A 14
 
5.0%
O 13
 
4.7%
S 11
 
4.0%
G 10
 
3.6%
i 10
 
3.6%
K 9
 
3.2%
Other values (32) 141
50.7%
Common
ValueCountFrequency (%)
636
33.0%
) 622
32.3%
( 618
32.1%
. 12
 
0.6%
& 7
 
0.4%
1 7
 
0.4%
2 4
 
0.2%
0 4
 
0.2%
5 3
 
0.2%
9 3
 
0.2%
Other values (6) 11
 
0.6%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12847
85.3%
ASCII 2205
 
14.6%
CJK 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1244
 
9.7%
701
 
5.5%
651
 
5.1%
643
 
5.0%
527
 
4.1%
437
 
3.4%
243
 
1.9%
182
 
1.4%
182
 
1.4%
171
 
1.3%
Other values (568) 7866
61.2%
ASCII
ValueCountFrequency (%)
636
28.8%
) 622
28.2%
( 618
28.0%
N 19
 
0.9%
E 18
 
0.8%
C 17
 
0.8%
T 16
 
0.7%
A 14
 
0.6%
O 13
 
0.6%
. 12
 
0.5%
Other values (48) 220
 
10.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct1865
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2023-12-13T06:03:53.384709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length3
Mean length3.0519878
Min length2

Characters and Unicode

Total characters5988
Distinct characters274
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1783 ?
Unique (%)90.9%

Sample

1st row김병준
2nd row이춘원
3rd row김석범
4th row이현규
5th row심상중
ValueCountFrequency (%)
박현정 4
 
0.2%
김지훈 4
 
0.2%
김태형 4
 
0.2%
정진호 3
 
0.2%
김미정 3
 
0.2%
정지현 3
 
0.2%
이유진 3
 
0.2%
박정미 3
 
0.2%
최윤정 3
 
0.2%
김태용 3
 
0.2%
Other values (1867) 1942
98.3%
2023-12-13T06:03:53.791051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
395
 
6.6%
266
 
4.4%
221
 
3.7%
177
 
3.0%
168
 
2.8%
144
 
2.4%
127
 
2.1%
118
 
2.0%
105
 
1.8%
104
 
1.7%
Other values (264) 4163
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5834
97.4%
Uppercase Letter 114
 
1.9%
Lowercase Letter 26
 
0.4%
Space Separator 13
 
0.2%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
395
 
6.8%
266
 
4.6%
221
 
3.8%
177
 
3.0%
168
 
2.9%
144
 
2.5%
127
 
2.2%
118
 
2.0%
105
 
1.8%
104
 
1.8%
Other values (225) 4009
68.7%
Uppercase Letter
ValueCountFrequency (%)
I 17
14.9%
N 16
14.0%
A 9
 
7.9%
E 8
 
7.0%
H 7
 
6.1%
O 7
 
6.1%
U 6
 
5.3%
Y 6
 
5.3%
R 6
 
5.3%
P 5
 
4.4%
Other values (13) 27
23.7%
Lowercase Letter
ValueCountFrequency (%)
n 5
19.2%
e 4
15.4%
a 3
11.5%
u 2
 
7.7%
g 2
 
7.7%
k 2
 
7.7%
i 1
 
3.8%
m 1
 
3.8%
h 1
 
3.8%
c 1
 
3.8%
Other values (4) 4
15.4%
Space Separator
ValueCountFrequency (%)
13
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5834
97.4%
Latin 140
 
2.3%
Common 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
395
 
6.8%
266
 
4.6%
221
 
3.8%
177
 
3.0%
168
 
2.9%
144
 
2.5%
127
 
2.2%
118
 
2.0%
105
 
1.8%
104
 
1.8%
Other values (225) 4009
68.7%
Latin
ValueCountFrequency (%)
I 17
 
12.1%
N 16
 
11.4%
A 9
 
6.4%
E 8
 
5.7%
H 7
 
5.0%
O 7
 
5.0%
U 6
 
4.3%
Y 6
 
4.3%
R 6
 
4.3%
P 5
 
3.6%
Other values (27) 53
37.9%
Common
ValueCountFrequency (%)
13
92.9%
1 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5834
97.4%
ASCII 154
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
395
 
6.8%
266
 
4.6%
221
 
3.8%
177
 
3.0%
168
 
2.9%
144
 
2.5%
127
 
2.2%
118
 
2.0%
105
 
1.8%
104
 
1.8%
Other values (225) 4009
68.7%
ASCII
ValueCountFrequency (%)
I 17
 
11.0%
N 16
 
10.4%
13
 
8.4%
A 9
 
5.8%
E 8
 
5.2%
H 7
 
4.5%
O 7
 
4.5%
U 6
 
3.9%
Y 6
 
3.9%
R 6
 
3.9%
Other values (29) 59
38.3%
Distinct65
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2023-12-13T06:03:53.937390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length5.7497452
Min length5

Characters and Unicode

Total characters11281
Distinct characters71
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)1.5%

Sample

1st row서울 강남구
2nd row인천 서구
3rd row인천 부평구
4th row인천 미추홀구
5th row인천 서구
ValueCountFrequency (%)
인천 1834
46.7%
서구 522
 
13.3%
남동구 437
 
11.1%
연수구 262
 
6.7%
부평구 221
 
5.6%
미추홀구 142
 
3.6%
계양구 107
 
2.7%
중구 72
 
1.8%
경기 68
 
1.7%
동구 45
 
1.1%
Other values (64) 214
 
5.5%
2023-12-13T06:03:54.197959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1962
17.4%
1862
16.5%
1848
16.4%
1837
16.3%
568
 
5.0%
486
 
4.3%
453
 
4.0%
267
 
2.4%
262
 
2.3%
236
 
2.1%
Other values (61) 1500
13.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9319
82.6%
Space Separator 1962
 
17.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1862
20.0%
1848
19.8%
1837
19.7%
568
 
6.1%
486
 
5.2%
453
 
4.9%
267
 
2.9%
262
 
2.8%
236
 
2.5%
223
 
2.4%
Other values (60) 1277
13.7%
Space Separator
ValueCountFrequency (%)
1962
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9319
82.6%
Common 1962
 
17.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1862
20.0%
1848
19.8%
1837
19.7%
568
 
6.1%
486
 
5.2%
453
 
4.9%
267
 
2.9%
262
 
2.8%
236
 
2.5%
223
 
2.4%
Other values (60) 1277
13.7%
Common
ValueCountFrequency (%)
1962
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9319
82.6%
ASCII 1962
 
17.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1962
100.0%
Hangul
ValueCountFrequency (%)
1862
20.0%
1848
19.8%
1837
19.7%
568
 
6.1%
486
 
5.2%
453
 
4.9%
267
 
2.9%
262
 
2.8%
236
 
2.5%
223
 
2.4%
Other values (60) 1277
13.7%

가입년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2022
1182 
<NA>
780 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 1182
60.2%
<NA> 780
39.8%

Length

2023-12-13T06:03:54.302628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:03:54.383723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1182
60.2%
na 780
39.8%

Interactions

2023-12-13T06:03:51.586062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:03:54.442993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호소재지
번호1.0000.143
소재지0.1431.000
2023-12-13T06:03:54.525800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호가입년도
번호1.0001.000
가입년도1.0001.000

Missing values

2023-12-13T06:03:51.720692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:03:51.830249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호법인(기업명)대표자 명소재지가입년도
01한테크김병준서울 강남구2022
12(주)베스트웨이코리아이춘원인천 서구2022
23김석범인천 부평구2022
34티아테크이현규인천 미추홀구2022
45(주)심테크심상중인천 서구2022
56(주)나무와나무정유혜인천 서구2022
67인천테크노파크이명윤인천 연수구2022
78다온테크이상복인천 남동구2022
89ST시스템최재혁인천 연수구2022
910대성기공함세호인천 서구2022
번호법인(기업명)대표자 명소재지가입년도
19521953기상산업개발염경수경기 부천시<NA>
19531954(주)엠에스벤터오만석인천 연수구<NA>
19541955(주)태명기연최은영인천 남동구<NA>
19551956(주)덕수산업구남수인천 남동구<NA>
19561957킴스비닐포장김천희서울 중구<NA>
19571958주식회사 코보김필범인천 서구<NA>
19581959(주)더온플랫폼윤석배인천 남동구<NA>
19591960마이 패스포트조찬휘인천 미추홀구<NA>
19601961주식회사 삼양휴텍문석용인천 서구<NA>
19611962금광산업홍정우경기 부천시<NA>