Overview

Dataset statistics

Number of variables3
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory828.0 B
Average record size in memory28.6 B

Variable types

Categorical2
Text1

Dataset

Description2021년 규제자유특구 지역별 산업별 지정현황으로, 1차에서 6차까지 비수도권에 어떤 분야가 지정되어있는지 확인할 수 있는 정보입니다.
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15094642/fileData.do

Alerts

규제자유 산업 항목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:50:08.823371
Analysis finished2023-12-12 20:50:09.117965
Duration0.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

차수
Categorical

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
1차
2차
3차
5차
4차

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row1차
2nd row1차
3rd row1차
4th row1차
5th row1차

Common Values

ValueCountFrequency (%)
1차 7
24.1%
2차 7
24.1%
3차 7
24.1%
5차 4
13.8%
4차 3
10.3%
6차 1
 
3.4%

Length

2023-12-13T05:50:09.203395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:50:09.348241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1차 7
24.1%
2차 7
24.1%
3차 7
24.1%
5차 4
13.8%
4차 3
10.3%
6차 1
 
3.4%

지역
Categorical

Distinct14
Distinct (%)48.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
강원
경북
부산
울산
대구
Other values (9)
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique3 ?
Unique (%)10.3%

Sample

1st row강원
2nd row대구
3rd row경북
4th row부산
5th row세종

Common Values

ValueCountFrequency (%)
강원 3
10.3%
경북 3
10.3%
부산 3
10.3%
울산 3
10.3%
대구 2
 
6.9%
전남 2
 
6.9%
충북 2
 
6.9%
광주 2
 
6.9%
전북 2
 
6.9%
경남 2
 
6.9%
Other values (4) 5
17.2%

Length

2023-12-13T05:50:09.506636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원 3
10.3%
경북 3
10.3%
부산 3
10.3%
울산 3
10.3%
대구 2
 
6.9%
전남 2
 
6.9%
충북 2
 
6.9%
광주 2
 
6.9%
전북 2
 
6.9%
경남 2
 
6.9%
Other values (4) 5
17.2%
Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-13T05:50:09.730014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length7.6551724
Min length4

Characters and Unicode

Total characters222
Distinct characters102
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row디지털 헬스케어
2nd row스마트 웰니스
3rd row배터리리사이클링
4th row블록체인
5th row자율주행
ValueCountFrequency (%)
스마트 2
 
4.2%
디지털 1
 
2.1%
자원화 1
 
2.1%
전환 1
 
2.1%
산업용 1
 
2.1%
헴프 1
 
2.1%
해양모빌리티 1
 
2.1%
탄소융복합산업 1
 
2.1%
그린에너지 1
 
2.1%
에너지저장장치 1
 
2.1%
Other values (37) 37
77.1%
2023-12-13T05:50:10.116525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
8.6%
8
 
3.6%
8
 
3.6%
8
 
3.6%
7
 
3.2%
6
 
2.7%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (92) 146
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 199
89.6%
Space Separator 19
 
8.6%
Decimal Number 1
 
0.5%
Uppercase Letter 1
 
0.5%
Dash Punctuation 1
 
0.5%
Lowercase Letter 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
4.0%
8
 
4.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (87) 138
69.3%
Space Separator
ValueCountFrequency (%)
19
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 199
89.6%
Common 21
 
9.5%
Latin 2
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
4.0%
8
 
4.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (87) 138
69.3%
Common
ValueCountFrequency (%)
19
90.5%
5 1
 
4.8%
- 1
 
4.8%
Latin
ValueCountFrequency (%)
G 1
50.0%
e 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 199
89.6%
ASCII 23
 
10.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19
82.6%
5 1
 
4.3%
G 1
 
4.3%
- 1
 
4.3%
e 1
 
4.3%
Hangul
ValueCountFrequency (%)
8
 
4.0%
8
 
4.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (87) 138
69.3%

Correlations

2023-12-13T05:50:10.244639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차수지역규제자유 산업 항목
차수1.0000.0001.000
지역0.0001.0001.000
규제자유 산업 항목1.0001.0001.000
2023-12-13T05:50:10.348409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역차수
지역1.0000.000
차수0.0001.000
2023-12-13T05:50:10.467592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차수지역
차수1.0000.000
지역0.0001.000

Missing values

2023-12-13T05:50:08.994579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:50:09.070119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

차수지역규제자유 산업 항목
01차강원디지털 헬스케어
11차대구스마트 웰니스
21차경북배터리리사이클링
31차부산블록체인
41차세종자율주행
51차전남e-모빌리티
61차충북스마트 안전제어
72차광주무인 저속 특장차
82차대전바이오메디칼
92차울산수소그린 모빌리티
차수지역규제자유 산업 항목
193차부산해양모빌리티
203차전북탄소융복합산업
214차광주그린에너지 에너지저장장치 발전
224차울산이산화탄소 자원화
234차경남5G 활용 차세대 스마트공장
245차강원정밀의료산업
255차충북그린수소 산업
265차충남탄소저감건설소재
275차경북스마트그린물류
286차부산암모니아 친환경에너지