Overview

Dataset statistics

Number of variables4
Number of observations101
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory35.3 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description중소벤처기업부 및 창업진흥원이 운영하는 판교창업존 입주기업 현황 정보 자료. 순번, 구분, 기업명, 사업분야, 호실
Author창업진흥원
URLhttps://www.data.go.kr/data/15088134/fileData.do

Alerts

순번 is highly overall correlated with 입주호수High correlation
입주호수 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique
입주기업명 has unique valuesUnique
입주호수 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:11:33.526599
Analysis finished2023-12-12 07:11:34.212080
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct101
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51
Minimum1
Maximum101
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T16:11:34.312108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q126
median51
Q376
95-th percentile96
Maximum101
Range100
Interquartile range (IQR)50

Descriptive statistics

Standard deviation29.300171
Coefficient of variation (CV)0.57451315
Kurtosis-1.2
Mean51
Median Absolute Deviation (MAD)25
Skewness0
Sum5151
Variance858.5
MonotonicityStrictly increasing
2023-12-12T16:11:34.515422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (91) 91
90.1%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%

입주기업명
Text

UNIQUE 

Distinct101
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size940.0 B
2023-12-12T16:11:34.874691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length8.9405941
Min length3

Characters and Unicode

Total characters903
Distinct characters163
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)100.0%

Sample

1st row에이블제이 주식회사
2nd row주식회사 올빅뎃
3rd row주식회사 페어랩스
4th row블루필 주식회사
5th row디씨엔바이오 주식회사
ValueCountFrequency (%)
주식회사 80
44.2%
아이피윈 1
 
0.6%
아이카 1
 
0.6%
바틀 1
 
0.6%
뉴트리어드바이저 1
 
0.6%
투비 1
 
0.6%
아펠레스 1
 
0.6%
주)플립션코리아 1
 
0.6%
와이제이에스(yjs 1
 
0.6%
아이젠 1
 
0.6%
Other values (92) 92
50.8%
2023-12-12T16:11:35.386734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93
 
10.3%
85
 
9.4%
81
 
9.0%
81
 
9.0%
80
 
8.9%
40
 
4.4%
30
 
3.3%
) 15
 
1.7%
( 15
 
1.7%
14
 
1.6%
Other values (153) 369
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 781
86.5%
Space Separator 85
 
9.4%
Close Punctuation 15
 
1.7%
Open Punctuation 15
 
1.7%
Uppercase Letter 7
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
11.9%
81
 
10.4%
81
 
10.4%
80
 
10.2%
40
 
5.1%
30
 
3.8%
14
 
1.8%
14
 
1.8%
13
 
1.7%
12
 
1.5%
Other values (143) 323
41.4%
Uppercase Letter
ValueCountFrequency (%)
J 1
14.3%
S 1
14.3%
Y 1
14.3%
R 1
14.3%
A 1
14.3%
V 1
14.3%
E 1
14.3%
Space Separator
ValueCountFrequency (%)
85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 781
86.5%
Common 115
 
12.7%
Latin 7
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
11.9%
81
 
10.4%
81
 
10.4%
80
 
10.2%
40
 
5.1%
30
 
3.8%
14
 
1.8%
14
 
1.8%
13
 
1.7%
12
 
1.5%
Other values (143) 323
41.4%
Latin
ValueCountFrequency (%)
J 1
14.3%
S 1
14.3%
Y 1
14.3%
R 1
14.3%
A 1
14.3%
V 1
14.3%
E 1
14.3%
Common
ValueCountFrequency (%)
85
73.9%
) 15
 
13.0%
( 15
 
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 781
86.5%
ASCII 122
 
13.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
93
 
11.9%
81
 
10.4%
81
 
10.4%
80
 
10.2%
40
 
5.1%
30
 
3.8%
14
 
1.8%
14
 
1.8%
13
 
1.7%
12
 
1.5%
Other values (143) 323
41.4%
ASCII
ValueCountFrequency (%)
85
69.7%
) 15
 
12.3%
( 15
 
12.3%
J 1
 
0.8%
S 1
 
0.8%
Y 1
 
0.8%
R 1
 
0.8%
A 1
 
0.8%
V 1
 
0.8%
E 1
 
0.8%

분야
Categorical

Distinct18
Distinct (%)17.8%
Missing0
Missing (%)0.0%
Memory size940.0 B
정보통신업
35 
제조업
24 
서비스
10 
서비스업
도매 및 소매업
Other values (13)
19 

Length

Max length21
Median length19
Mean length5.2277228
Min length2

Unique

Unique9 ?
Unique (%)8.9%

Sample

1st row정보통신업
2nd row정보통신업
3rd row전문, 과학 및 기술서비스업
4th row제조업
5th row서비스

Common Values

ValueCountFrequency (%)
정보통신업 35
34.7%
제조업 24
23.8%
서비스 10
 
9.9%
서비스업 9
 
8.9%
도매 및 소매업 4
 
4.0%
전문, 과학 및 기술서비스업 4
 
4.0%
전문 과학기술 및 기술서비스업 2
 
2.0%
도소매 2
 
2.0%
<NA> 2
 
2.0%
소매업 1
 
1.0%
Other values (8) 8
 
7.9%

Length

2023-12-12T16:11:35.575924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
정보통신업 36
26.7%
제조업 24
17.8%
13
 
9.6%
서비스 10
 
7.4%
서비스업 9
 
6.7%
소매업 6
 
4.4%
전문 6
 
4.4%
기술서비스업 6
 
4.4%
도매 4
 
3.0%
과학 4
 
3.0%
Other values (12) 17
12.6%

입주호수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct101
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46646.832
Minimum701
Maximum825862
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T16:11:35.744809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum701
5-th percentile706
Q1738
median778
Q3834
95-th percentile718859
Maximum825862
Range825161
Interquartile range (IQR)96

Descriptive statistics

Standard deviation183368.77
Coefficient of variation (CV)3.9310016
Kurtosis12.729254
Mean46646.832
Median Absolute Deviation (MAD)48
Skewness3.7992438
Sum4711330
Variance3.3624107 × 1010
MonotonicityNot monotonic
2023-12-12T16:11:35.920020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
701 1
 
1.0%
807 1
 
1.0%
825862 1
 
1.0%
824 1
 
1.0%
823 1
 
1.0%
822 1
 
1.0%
819 1
 
1.0%
818 1
 
1.0%
817 1
 
1.0%
813 1
 
1.0%
Other values (91) 91
90.1%
ValueCountFrequency (%)
701 1
1.0%
702 1
1.0%
703 1
1.0%
704 1
1.0%
705 1
1.0%
706 1
1.0%
707 1
1.0%
708 1
1.0%
709 1
1.0%
710 1
1.0%
ValueCountFrequency (%)
825862 1
1.0%
804820 1
1.0%
774802 1
1.0%
766805 1
1.0%
739744 1
1.0%
718859 1
1.0%
7151 1
1.0%
864 1
1.0%
861 1
1.0%
860 1
1.0%

Interactions

2023-12-12T16:11:33.880774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:11:33.715770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:11:33.960228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:11:33.811037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:11:36.021136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분야입주호수
순번1.0000.0000.000
분야0.0001.0000.273
입주호수0.0000.2731.000
2023-12-12T16:11:36.140043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번입주호수분야
순번1.0000.8450.000
입주호수0.8451.0000.141
분야0.0000.1411.000

Missing values

2023-12-12T16:11:34.077672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:11:34.175845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번입주기업명분야입주호수
01에이블제이 주식회사정보통신업701
12주식회사 올빅뎃정보통신업702
23주식회사 페어랩스전문, 과학 및 기술서비스업703
34블루필 주식회사제조업704
45디씨엔바이오 주식회사서비스705
56주식회사 니나노컴퍼니서비스706
67주식회사 어드밴스솔루션제조업707
78에이스브라더스스포츠용품708
89주식회사 홈나이도매 및 소매업709
910씨에이이머그 주식회사정보통신업710
순번입주기업명분야입주호수
9192주식회사 어밸브서비스업847
9293주식회사 긱스로프트제조업848
9394주식회사 랜딩정보통신업850
9495주식회사 피텐제조업852
9596주식회사 이우솔루션정보서비스855
9697메이트코리아 주식회사전문, 과학 및 기술서비스업856
9798(주)모드랩정보통신업858
9899주식회사 링커버스정보통신업860
99100주식회사 에스엠티제조업861
100101주식회사 케이펫도매 및 소매업864