Overview

Dataset statistics

Number of variables3
Number of observations202
Missing cells0
Missing cells (%)0.0%
Duplicate rows7
Duplicate rows (%)3.5%
Total size in memory5.1 KiB
Average record size in memory25.7 B

Variable types

Categorical1
Text1
Numeric1

Dataset

Description경기도_광교테크노벨리(기관 입주업체) 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=AEE19U270VR7X367MD1W15106558&infSeq=1

Alerts

Dataset has 7 (3.5%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 21:56:39.536899
Analysis finished2023-12-10 21:56:39.868100
Duration0.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리기관명
Categorical

Distinct5
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
경기도경제과학진흥원(경기R&DB센터)
62 
경기도경제과학진흥원(경기중소기업센터)
52 
차세대융합기술연구원
34 
경기도경제과학진흥원(경기바이오센터)
29 
한국나노기술원
25 

Length

Max length20
Median length20
Mean length16.564356
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국나노기술원
2nd row한국나노기술원
3rd row한국나노기술원
4th row한국나노기술원
5th row한국나노기술원

Common Values

ValueCountFrequency (%)
경기도경제과학진흥원(경기R&DB센터) 62
30.7%
경기도경제과학진흥원(경기중소기업센터) 52
25.7%
차세대융합기술연구원 34
16.8%
경기도경제과학진흥원(경기바이오센터) 29
14.4%
한국나노기술원 25
12.4%

Length

2023-12-11T06:56:39.942840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:56:40.061658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도경제과학진흥원(경기r&db센터 62
30.7%
경기도경제과학진흥원(경기중소기업센터 52
25.7%
차세대융합기술연구원 34
16.8%
경기도경제과학진흥원(경기바이오센터 29
14.4%
한국나노기술원 25
12.4%
Distinct167
Distinct (%)82.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T06:56:40.248663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.8960396
Min length2

Characters and Unicode

Total characters1595
Distinct characters261
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)74.3%

Sample

1st row영진약품㈜
2nd row암페놀센싱코리아(유)
3rd row㈜센플러스
4th row㈜오킨스전자
5th row㈜에이프로
ValueCountFrequency (%)
경기신용보증재단 13
 
5.8%
경기지역본부 9
 
4.0%
한국산업안전보건공단 6
 
2.7%
주식회사 5
 
2.2%
미래회계법인 4
 
1.8%
중소벤처기업진흥공단 3
 
1.3%
가람푸드서비스 3
 
1.3%
광교 2
 
0.9%
경기도 2
 
0.9%
한국수출입은행 2
 
0.9%
Other values (164) 175
78.1%
2023-12-11T06:56:40.547148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
5.0%
54
 
3.4%
51
 
3.2%
49
 
3.1%
39
 
2.4%
29
 
1.8%
25
 
1.6%
25
 
1.6%
24
 
1.5%
) 24
 
1.5%
Other values (251) 1196
75.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1439
90.2%
Other Symbol 79
 
5.0%
Close Punctuation 24
 
1.5%
Open Punctuation 24
 
1.5%
Space Separator 22
 
1.4%
Uppercase Letter 4
 
0.3%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
3.8%
51
 
3.5%
49
 
3.4%
39
 
2.7%
29
 
2.0%
25
 
1.7%
25
 
1.7%
24
 
1.7%
24
 
1.7%
23
 
1.6%
Other values (241) 1096
76.2%
Uppercase Letter
ValueCountFrequency (%)
T 1
25.0%
S 1
25.0%
I 1
25.0%
K 1
25.0%
Other Symbol
ValueCountFrequency (%)
79
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1518
95.2%
Common 73
 
4.6%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
5.2%
54
 
3.6%
51
 
3.4%
49
 
3.2%
39
 
2.6%
29
 
1.9%
25
 
1.6%
25
 
1.6%
24
 
1.6%
24
 
1.6%
Other values (242) 1119
73.7%
Common
ValueCountFrequency (%)
) 24
32.9%
( 24
32.9%
22
30.1%
. 2
 
2.7%
- 1
 
1.4%
Latin
ValueCountFrequency (%)
T 1
25.0%
S 1
25.0%
I 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1438
90.2%
None 79
 
5.0%
ASCII 77
 
4.8%
Compat Jamo 1
 
0.1%

Most frequent character per block

None
ValueCountFrequency (%)
79
100.0%
Hangul
ValueCountFrequency (%)
54
 
3.8%
51
 
3.5%
49
 
3.4%
39
 
2.7%
29
 
2.0%
25
 
1.7%
25
 
1.7%
24
 
1.7%
24
 
1.7%
23
 
1.6%
Other values (240) 1095
76.1%
ASCII
ValueCountFrequency (%)
) 24
31.2%
( 24
31.2%
22
28.6%
. 2
 
2.6%
T 1
 
1.3%
S 1
 
1.3%
I 1
 
1.3%
K 1
 
1.3%
- 1
 
1.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

임대면적(㎡)
Real number (ℝ)

Distinct179
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean559.22748
Minimum14.4
Maximum10756.21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-11T06:56:40.853558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14.4
5-th percentile44.615
Q1102.51
median201.07
Q3683.1925
95-th percentile1891.2505
Maximum10756.21
Range10741.81
Interquartile range (IQR)580.6825

Descriptive statistics

Standard deviation985.11201
Coefficient of variation (CV)1.7615587
Kurtosis57.889333
Mean559.22748
Median Absolute Deviation (MAD)143.97
Skewness6.2724432
Sum112963.95
Variance970445.66
MonotonicityNot monotonic
2023-12-11T06:56:40.959470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
61.2 5
 
2.5%
1303.01 3
 
1.5%
154.67 3
 
1.5%
68.53 2
 
1.0%
70.1 2
 
1.0%
683.87 2
 
1.0%
530.5 2
 
1.0%
103.11 2
 
1.0%
374.55 2
 
1.0%
67.0 2
 
1.0%
Other values (169) 177
87.6%
ValueCountFrequency (%)
14.4 1
0.5%
17.22 1
0.5%
18.1 1
0.5%
19.7 1
0.5%
22.4 1
0.5%
23.9 1
0.5%
24.9 1
0.5%
26.0 1
0.5%
40.4 1
0.5%
40.9 1
0.5%
ValueCountFrequency (%)
10756.21 1
0.5%
4241.02 1
0.5%
3373.0 1
0.5%
3125.0 1
0.5%
3040.49 1
0.5%
2203.22 1
0.5%
2179.4 1
0.5%
2114.35 1
0.5%
2036.9 1
0.5%
1903.4 1
0.5%

Interactions

2023-12-11T06:56:39.699532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:56:41.024059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관명임대면적(㎡)
관리기관명1.0000.361
임대면적(㎡)0.3611.000
2023-12-11T06:56:41.087690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
임대면적(㎡)관리기관명
임대면적(㎡)1.0000.141
관리기관명0.1411.000

Missing values

2023-12-11T06:56:39.783048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:56:39.837929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리기관명업체명임대면적(㎡)
0한국나노기술원영진약품㈜1771.43
1한국나노기술원암페놀센싱코리아(유)264.61
2한국나노기술원㈜센플러스448.63
3한국나노기술원㈜오킨스전자107.12
4한국나노기술원㈜에이프로107.65
5한국나노기술원인테그리스코리아(유)2203.22
6한국나노기술원캐봇마이크로일렉트로닉스458.83
7한국나노기술원(사)기술혁신협회287.56
8한국나노기술원㈜아이디스107.02
9한국나노기술원럭스피아㈜180.53
관리기관명업체명임대면적(㎡)
192경기도경제과학진흥원(경기중소기업센터)한국수출입은행683.87
193경기도경제과학진흥원(경기중소기업센터)한국산업안전보건공단 경기지역본부2036.9
194경기도경제과학진흥원(경기중소기업센터)경기신용보증재단3040.49
195경기도경제과학진흥원(경기중소기업센터)경기도경제과학진흥원10756.21
196경기도경제과학진흥원(경기중소기업센터)경기벤처협회222.08
197경기도경제과학진흥원(경기중소기업센터)경기도외투기업협의회187.74
198경기도경제과학진흥원(경기중소기업센터)미래회계법인1348.64
199경기도경제과학진흥원(경기중소기업센터)미래경영연구원102.84
200경기도경제과학진흥원(경기중소기업센터)다온국제특허116.73
201경기도경제과학진흥원(경기중소기업센터)관세법인 세명(수원관세사무소)156.83

Duplicate rows

Most frequently occurring

관리기관명업체명임대면적(㎡)# duplicates
0경기도경제과학진흥원(경기R&DB센터)경기신용보증재단61.22
1경기도경제과학진흥원(경기중소기업센터)경기벤처협회222.082
2경기도경제과학진흥원(경기중소기업센터)다온국제특허116.732
3경기도경제과학진흥원(경기중소기업센터)중소벤처기업진흥공단 경기지역본부1303.012
4경기도경제과학진흥원(경기중소기업센터)한국무역보험공사530.52
5경기도경제과학진흥원(경기중소기업센터)한국무역협회415.142
6경기도경제과학진흥원(경기중소기업센터)한국수출입은행683.872