Overview

Dataset statistics

Number of variables4
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory914.0 B
Average record size in memory39.7 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description2022년 대전광역시 유망중소기업으로 선정된 기업체 목록입니다. 선정년도, 업체명, 주생산품 항목을 확인할 수 있습니다
Author대전광역시
URLhttps://www.data.go.kr/data/15077571/fileData.do

Alerts

선정년도 has constant value ""Constant
연번 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:49:37.204189
Analysis finished2023-12-12 06:49:37.950511
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12
Minimum1
Maximum23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-12T15:49:38.017329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.1
Q16.5
median12
Q317.5
95-th percentile21.9
Maximum23
Range22
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.78233
Coefficient of variation (CV)0.56519417
Kurtosis-1.2
Mean12
Median Absolute Deviation (MAD)6
Skewness0
Sum276
Variance46
MonotonicityStrictly increasing
2023-12-12T15:49:38.167790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1 1
 
4.3%
2 1
 
4.3%
23 1
 
4.3%
22 1
 
4.3%
21 1
 
4.3%
20 1
 
4.3%
19 1
 
4.3%
18 1
 
4.3%
17 1
 
4.3%
16 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
1 1
4.3%
2 1
4.3%
3 1
4.3%
4 1
4.3%
5 1
4.3%
6 1
4.3%
7 1
4.3%
8 1
4.3%
9 1
4.3%
10 1
4.3%
ValueCountFrequency (%)
23 1
4.3%
22 1
4.3%
21 1
4.3%
20 1
4.3%
19 1
4.3%
18 1
4.3%
17 1
4.3%
16 1
4.3%
15 1
4.3%
14 1
4.3%

선정년도
Categorical

CONSTANT 

Distinct1
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size316.0 B
2022
23 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 23
100.0%

Length

2023-12-12T15:49:38.314737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:49:38.419764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 23
100.0%

업체명
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T15:49:38.586390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length8.3478261
Min length3

Characters and Unicode

Total characters192
Distinct characters85
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row프리시젼바이오㈜
2nd row㈜코어테크놀로지
3rd row㈜포텍
4th row주식회사 아레스
5th row㈜일신오토클레이브
ValueCountFrequency (%)
주식회사 9
26.5%
프리시젼바이오㈜ 1
 
2.9%
스폰코리아 1
 
2.9%
이아이에스 1
 
2.9%
라미랩 1
 
2.9%
㈜온더시스 1
 
2.9%
아이옵스 1
 
2.9%
콜라보에어㈜ 1
 
2.9%
㈜카보엑스퍼트 1
 
2.9%
스탠더드시험연구소 1
 
2.9%
Other values (16) 16
47.1%
2023-12-12T15:49:38.942200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
6.2%
11
 
5.7%
11
 
5.7%
11
 
5.7%
10
 
5.2%
9
 
4.7%
9
 
4.7%
8
 
4.2%
4
 
2.1%
4
 
2.1%
Other values (75) 103
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 167
87.0%
Other Symbol 11
 
5.7%
Space Separator 11
 
5.7%
Uppercase Letter 2
 
1.0%
Other Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
7.2%
11
 
6.6%
10
 
6.0%
9
 
5.4%
9
 
5.4%
8
 
4.8%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
Other values (70) 94
56.3%
Uppercase Letter
ValueCountFrequency (%)
F 1
50.0%
D 1
50.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 178
92.7%
Common 12
 
6.2%
Latin 2
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
6.7%
11
 
6.2%
11
 
6.2%
10
 
5.6%
9
 
5.1%
9
 
5.1%
8
 
4.5%
4
 
2.2%
4
 
2.2%
3
 
1.7%
Other values (71) 97
54.5%
Common
ValueCountFrequency (%)
11
91.7%
& 1
 
8.3%
Latin
ValueCountFrequency (%)
F 1
50.0%
D 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 167
87.0%
ASCII 14
 
7.3%
None 11
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
7.2%
11
 
6.6%
10
 
6.0%
9
 
5.4%
9
 
5.4%
8
 
4.8%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
Other values (70) 94
56.3%
None
ValueCountFrequency (%)
11
100.0%
ASCII
ValueCountFrequency (%)
11
78.6%
F 1
 
7.1%
& 1
 
7.1%
D 1
 
7.1%
Distinct19
Distinct (%)82.6%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T15:49:39.204754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length14
Mean length9.6521739
Min length4

Characters and Unicode

Total characters222
Distinct characters112
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)78.3%

Sample

1st row체외진단의료기기 및 진단용시약
2nd rowSW개발(U-러닝솔루션)
3rd row안광학의료기기
4th rowSW개발
5th row초고압분산기,정수압프레스
ValueCountFrequency (%)
sw개발 5
 
12.5%
건축용도료 1
 
2.5%
쌀국수 1
 
2.5%
화재방호 1
 
2.5%
분석,평가,기술개발 1
 
2.5%
말토덱스트린 1
 
2.5%
화장품 1
 
2.5%
영상 1
 
2.5%
콘텐츠 1
 
2.5%
sw개발(물관리솔루션 1
 
2.5%
Other values (26) 26
65.0%
2023-12-12T15:49:39.616492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
7.7%
12
 
5.4%
, 11
 
5.0%
10
 
4.5%
9
 
4.1%
S 8
 
3.6%
W 8
 
3.6%
4
 
1.8%
4
 
1.8%
4
 
1.8%
Other values (102) 135
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 170
76.6%
Space Separator 17
 
7.7%
Uppercase Letter 17
 
7.7%
Other Punctuation 11
 
5.0%
Close Punctuation 3
 
1.4%
Open Punctuation 3
 
1.4%
Dash Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
7.1%
10
 
5.9%
9
 
5.3%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (94) 114
67.1%
Uppercase Letter
ValueCountFrequency (%)
S 8
47.1%
W 8
47.1%
U 1
 
5.9%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 170
76.6%
Common 35
 
15.8%
Latin 17
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
7.1%
10
 
5.9%
9
 
5.3%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (94) 114
67.1%
Common
ValueCountFrequency (%)
17
48.6%
, 11
31.4%
) 3
 
8.6%
( 3
 
8.6%
- 1
 
2.9%
Latin
ValueCountFrequency (%)
S 8
47.1%
W 8
47.1%
U 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 170
76.6%
ASCII 52
 
23.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
32.7%
, 11
21.2%
S 8
15.4%
W 8
15.4%
) 3
 
5.8%
( 3
 
5.8%
- 1
 
1.9%
U 1
 
1.9%
Hangul
ValueCountFrequency (%)
12
 
7.1%
10
 
5.9%
9
 
5.3%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (94) 114
67.1%

Interactions

2023-12-12T15:49:37.396829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:49:39.711657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명주생산품
연번1.0001.0000.738
업체명1.0001.0001.000
주생산품0.7381.0001.000

Missing values

2023-12-12T15:49:37.509282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:49:37.912074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번선정년도업체명주생산품
012022프리시젼바이오㈜체외진단의료기기 및 진단용시약
122022㈜코어테크놀로지SW개발(U-러닝솔루션)
232022㈜포텍안광학의료기기
342022주식회사 아레스SW개발
452022㈜일신오토클레이브초고압분산기,정수압프레스
562022주시회사 알씨테크무선통신장비
672022한국축산데이터 주식회사 농업회사법인SW개발(농장관리 솔루션)
782022㈜청오엔지니어링농업용 자동개폐모터
892022㈜포스엔텍고압 반응기, 가압오븐
9102022주식회사 인포비정보기술SW개발
연번선정년도업체명주생산품
13142022가온유니폼단체복, 유니폼
14152022대성F&D냉면, 쌀국수
15162022주식회사 스탠더드시험연구소화재방호 분석,평가,기술개발
16172022㈜카보엑스퍼트말토덱스트린, 화장품
17182022콜라보에어㈜영상 콘텐츠
18192022주식회사 아이옵스SW개발
19202022㈜온더시스SW개발(물관리솔루션)
20212022라미랩 주식회사SW개발
21222022주식회사 이아이에스면진테이블, 지진계측기
22232022㈜뉴젠사이언스의료기기 소독제