Overview

Dataset statistics

Number of variables4
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory36.6 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description대전광역시 2021년 유망중소기업 선정현황입니다. 2022년 공공데이터 기업매칭지원사업으로 수행되었습니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15111101/fileData.do

Alerts

선정년도 has constant value ""Constant
연번 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:06:44.404026
Analysis finished2023-12-12 01:06:44.941990
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.5
Minimum1
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2023-12-12T10:06:45.050331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.45
Q113.25
median25.5
Q337.75
95-th percentile47.55
Maximum50
Range49
Interquartile range (IQR)24.5

Descriptive statistics

Standard deviation14.57738
Coefficient of variation (CV)0.57166195
Kurtosis-1.2
Mean25.5
Median Absolute Deviation (MAD)12.5
Skewness0
Sum1275
Variance212.5
MonotonicityNot monotonic
2023-12-12T10:06:45.232296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
39 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%

선정년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2021
50 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 50
100.0%

Length

2023-12-12T10:06:45.389061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:06:45.539085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 50
100.0%

업체명
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T10:06:45.777891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length10
Mean length6.46
Min length3

Characters and Unicode

Total characters323
Distinct characters110
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row주식회사 경원알미늄
2nd row㈜위드텍
3rd row㈜파이버프로
4th row㈜지티사이언
5th row㈜리메드
ValueCountFrequency (%)
주식회사 10
 
16.7%
㈜삼보 1
 
1.7%
㈜지에프테크놀로지 1
 
1.7%
㈜에이엠시스템 1
 
1.7%
이레테크 1
 
1.7%
신한정밀공업㈜ 1
 
1.7%
㈜오션정보기술 1
 
1.7%
㈜준형 1
 
1.7%
㈜파셉 1
 
1.7%
모루기술 1
 
1.7%
Other values (41) 41
68.3%
2023-12-12T10:06:46.215195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
12.1%
17
 
5.3%
12
 
3.7%
11
 
3.4%
10
 
3.1%
10
 
3.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
7
 
2.2%
Other values (100) 188
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 274
84.8%
Other Symbol 39
 
12.1%
Space Separator 10
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
6.2%
12
 
4.4%
11
 
4.0%
10
 
3.6%
10
 
3.6%
10
 
3.6%
9
 
3.3%
7
 
2.6%
7
 
2.6%
6
 
2.2%
Other values (98) 175
63.9%
Other Symbol
ValueCountFrequency (%)
39
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 313
96.9%
Common 10
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
12.5%
17
 
5.4%
12
 
3.8%
11
 
3.5%
10
 
3.2%
10
 
3.2%
10
 
3.2%
9
 
2.9%
7
 
2.2%
7
 
2.2%
Other values (99) 181
57.8%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 274
84.8%
None 39
 
12.1%
ASCII 10
 
3.1%

Most frequent character per block

None
ValueCountFrequency (%)
39
100.0%
Hangul
ValueCountFrequency (%)
17
 
6.2%
12
 
4.4%
11
 
4.0%
10
 
3.6%
10
 
3.6%
10
 
3.6%
9
 
3.3%
7
 
2.6%
7
 
2.6%
6
 
2.2%
Other values (98) 175
63.9%
ASCII
ValueCountFrequency (%)
10
100.0%
Distinct49
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T10:06:46.867479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length20
Mean length12.32
Min length2

Characters and Unicode

Total characters616
Distinct characters171
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)96.0%

Sample

1st row알루미늄 창호
2nd row환경정밀 계측기기
3rd row광센서조립체 및 광계측 기기
4th rowlot시약장, 위해가스 정화장치
5th row의료기기
ValueCountFrequency (%)
10
 
6.5%
7
 
4.6%
개발 5
 
3.3%
소프트웨어 4
 
2.6%
엔지니어링 3
 
2.0%
창호 2
 
1.3%
알루미늄 2
 
1.3%
기계 2
 
1.3%
자동차 2
 
1.3%
서비스 2
 
1.3%
Other values (108) 114
74.5%
2023-12-12T10:06:47.397098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
105
 
17.0%
23
 
3.7%
17
 
2.8%
16
 
2.6%
, 16
 
2.6%
15
 
2.4%
13
 
2.1%
11
 
1.8%
10
 
1.6%
10
 
1.6%
Other values (161) 380
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 467
75.8%
Space Separator 105
 
17.0%
Other Punctuation 18
 
2.9%
Uppercase Letter 11
 
1.8%
Lowercase Letter 8
 
1.3%
Open Punctuation 3
 
0.5%
Close Punctuation 3
 
0.5%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.9%
17
 
3.6%
16
 
3.4%
15
 
3.2%
13
 
2.8%
11
 
2.4%
10
 
2.1%
10
 
2.1%
9
 
1.9%
9
 
1.9%
Other values (139) 334
71.5%
Uppercase Letter
ValueCountFrequency (%)
C 3
27.3%
T 2
18.2%
G 1
 
9.1%
I 1
 
9.1%
V 1
 
9.1%
D 1
 
9.1%
E 1
 
9.1%
L 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
t 2
25.0%
a 1
12.5%
s 1
12.5%
h 1
12.5%
f 1
12.5%
o 1
12.5%
l 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 16
88.9%
/ 1
 
5.6%
& 1
 
5.6%
Space Separator
ValueCountFrequency (%)
105
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 467
75.8%
Common 130
 
21.1%
Latin 19
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
4.9%
17
 
3.6%
16
 
3.4%
15
 
3.2%
13
 
2.8%
11
 
2.4%
10
 
2.1%
10
 
2.1%
9
 
1.9%
9
 
1.9%
Other values (139) 334
71.5%
Latin
ValueCountFrequency (%)
C 3
15.8%
t 2
 
10.5%
T 2
 
10.5%
a 1
 
5.3%
G 1
 
5.3%
I 1
 
5.3%
s 1
 
5.3%
h 1
 
5.3%
f 1
 
5.3%
o 1
 
5.3%
Other values (5) 5
26.3%
Common
ValueCountFrequency (%)
105
80.8%
, 16
 
12.3%
( 3
 
2.3%
) 3
 
2.3%
5 1
 
0.8%
/ 1
 
0.8%
& 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 467
75.8%
ASCII 149
 
24.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105
70.5%
, 16
 
10.7%
( 3
 
2.0%
C 3
 
2.0%
) 3
 
2.0%
t 2
 
1.3%
T 2
 
1.3%
a 1
 
0.7%
G 1
 
0.7%
I 1
 
0.7%
Other values (12) 12
 
8.1%
Hangul
ValueCountFrequency (%)
23
 
4.9%
17
 
3.6%
16
 
3.4%
15
 
3.2%
13
 
2.8%
11
 
2.4%
10
 
2.1%
10
 
2.1%
9
 
1.9%
9
 
1.9%
Other values (139) 334
71.5%

Interactions

2023-12-12T10:06:44.643603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:06:47.532869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명주생산품
연번1.0001.0001.000
업체명1.0001.0001.000
주생산품1.0001.0001.000

Missing values

2023-12-12T10:06:44.793892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:06:44.897658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번선정년도업체명주생산품
012021주식회사 경원알미늄알루미늄 창호
122021㈜위드텍환경정밀 계측기기
232021㈜파이버프로광센서조립체 및 광계측 기기
342021㈜지티사이언lot시약장, 위해가스 정화장치
452021㈜리메드의료기기
562021㈜한국건설안전공사건축, 엔지니어링 및 관련 기술업
672021㈜인텍플러스반도체 검사장비
782021주식회사 유진타올타올
892021바프렉스㈜기능성 필름
9102021㈜인포카안전운전 보조시스템 (모바일앱, 스마트스캐너)
연번선정년도업체명주생산품
40412021㈜티에스무정전 전원장치, 자동전압 조정기
41422021비즈㈜연구개발
42432021위텍코퍼레이션㈜황사마스크
43442021㈜유니스소프트전력 ICT 솔루션, 정보 통신 솔루션
44462021아트원 주식회사자동차 제작, 자동차용품
45452021㈜레스텍보건용 마스크 등
46472021㈜아이리스닷넷전자도서관시스템 개발
47482021주식회사 루맥스에어로스페이스기계, 전기 전자 항공기 (유무선 통신장비 부품)
48492021㈜지에프테크놀로지사격장비
49502021㈜아이티코리아정보처리, 소프트개발 등