Overview

Dataset statistics

Number of variables7
Number of observations296
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.9 KiB
Average record size in memory58.4 B

Variable types

Categorical5
Text1
Numeric1

Dataset

Description제주특별자치도에서 매년 실시하는 사업체조사와 관련한 대표자 성별 사업체 조사 결과 데이터입니다.참고: 제주통계포털 홈페이지
Author제주특별자치도
URLhttps://www.data.go.kr/data/15109169/fileData.do

Alerts

관련부서 has constant value ""Constant
기준연도 is highly overall correlated with 데이터기준일자High correlation
데이터기준일자 is highly overall correlated with 기준연도High correlation
대표자 수 has 21 (7.1%) zerosZeros

Reproduction

Analysis started2024-04-19 05:38:07.811302
Analysis finished2024-04-19 05:38:08.359183
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2020
148 
2021
148 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 148
50.0%
2021 148
50.0%

Length

2024-04-19T14:38:08.421513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:38:08.507663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 148
50.0%
2021 148
50.0%

산업 대분류
Categorical

Distinct19
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
C.제조업(10~34)
100 
J.정보통신업(58~63)
24 
B.광업(05~08)
16 
E.수도·하수 및 폐기물 처리·원료 재생업(36 ~ 39)
16 
M.전문·과학 및 기술 서비스업(70~73)
16 
Other values (14)
124 

Length

Max length32
Median length26
Mean length17.621622
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA.농업·임업 및 어업(01~03)
2nd rowA.농업·임업 및 어업(01~03)
3rd rowA.농업·임업 및 어업(01~03)
4th rowA.농업·임업 및 어업(01~03)
5th rowA.농업·임업 및 어업(01~03)

Common Values

ValueCountFrequency (%)
C.제조업(10~34) 100
33.8%
J.정보통신업(58~63) 24
 
8.1%
B.광업(05~08) 16
 
5.4%
E.수도·하수 및 폐기물 처리·원료 재생업(36 ~ 39) 16
 
5.4%
M.전문·과학 및 기술 서비스업(70~73) 16
 
5.4%
H.운수 및 창고업(49~52) 16
 
5.4%
A.농업·임업 및 어업(01~03) 12
 
4.1%
S.협회 및 단체·수리 및 기타 개인 서비스업(94~96) 12
 
4.1%
G.도매 및 소매업(45~47) 12
 
4.1%
K.금융 및 보험업(64~66) 12
 
4.1%
Other values (9) 60
20.3%

Length

2024-04-19T14:38:08.603533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
152
 
19.8%
c.제조업(10~34 100
 
13.0%
j.정보통신업(58~63 24
 
3.1%
e.수도·하수 16
 
2.1%
39 16
 
2.1%
창고업(49~52 16
 
2.1%
h.운수 16
 
2.1%
b.광업(05~08 16
 
2.1%
m.전문·과학 16
 
2.1%
서비스업(70~73 16
 
2.1%
Other values (39) 380
49.5%
Distinct74
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-04-19T14:38:08.822060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length14.567568
Min length5

Characters and Unicode

Total characters4312
Distinct characters176
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row01.농업
2nd row01.농업
3rd row02.임업
4th row02.임업
5th row03.어업
ValueCountFrequency (%)
152
 
16.1%
제조업 92
 
9.7%
서비스업 52
 
5.5%
제외 24
 
2.5%
운송업 12
 
1.3%
광업 12
 
1.3%
기계 12
 
1.3%
의약품 8
 
0.8%
가구 8
 
0.8%
수리업 8
 
0.8%
Other values (140) 564
59.7%
2024-04-19T14:38:09.162717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
648
 
15.0%
304
 
7.1%
. 296
 
6.9%
168
 
3.9%
152
 
3.5%
100
 
2.3%
88
 
2.0%
84
 
1.9%
80
 
1.9%
· 80
 
1.9%
Other values (166) 2312
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2664
61.8%
Space Separator 648
 
15.0%
Decimal Number 596
 
13.8%
Other Punctuation 404
 
9.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
304
 
11.4%
168
 
6.3%
152
 
5.7%
100
 
3.8%
88
 
3.3%
84
 
3.2%
80
 
3.0%
68
 
2.6%
60
 
2.3%
36
 
1.4%
Other values (152) 1524
57.2%
Decimal Number
ValueCountFrequency (%)
1 80
13.4%
2 72
12.1%
6 72
12.1%
5 68
11.4%
3 64
10.7%
0 56
9.4%
7 52
8.7%
4 52
8.7%
8 40
6.7%
9 40
6.7%
Other Punctuation
ValueCountFrequency (%)
. 296
73.3%
· 80
 
19.8%
; 28
 
6.9%
Space Separator
ValueCountFrequency (%)
648
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2664
61.8%
Common 1648
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
304
 
11.4%
168
 
6.3%
152
 
5.7%
100
 
3.8%
88
 
3.3%
84
 
3.2%
80
 
3.0%
68
 
2.6%
60
 
2.3%
36
 
1.4%
Other values (152) 1524
57.2%
Common
ValueCountFrequency (%)
648
39.3%
. 296
18.0%
· 80
 
4.9%
1 80
 
4.9%
2 72
 
4.4%
6 72
 
4.4%
5 68
 
4.1%
3 64
 
3.9%
0 56
 
3.4%
7 52
 
3.2%
Other values (4) 160
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2660
61.7%
ASCII 1568
36.4%
None 80
 
1.9%
Compat Jamo 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
648
41.3%
. 296
18.9%
1 80
 
5.1%
2 72
 
4.6%
6 72
 
4.6%
5 68
 
4.3%
3 64
 
4.1%
0 56
 
3.6%
7 52
 
3.3%
4 52
 
3.3%
Other values (3) 108
 
6.9%
Hangul
ValueCountFrequency (%)
304
 
11.4%
168
 
6.3%
152
 
5.7%
100
 
3.8%
88
 
3.3%
84
 
3.2%
80
 
3.0%
68
 
2.6%
60
 
2.3%
36
 
1.4%
Other values (151) 1520
57.1%
None
ValueCountFrequency (%)
· 80
100.0%
Compat Jamo
ValueCountFrequency (%)
4
100.0%

성별
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
남자
148 
여자
148 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남자
2nd row여자
3rd row남자
4th row여자
5th row남자

Common Values

ValueCountFrequency (%)
남자 148
50.0%
여자 148
50.0%

Length

2024-04-19T14:38:09.320037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:38:09.413446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남자 148
50.0%
여자 148
50.0%

대표자 수
Real number (ℝ)

ZEROS 

Distinct203
Distinct (%)68.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean642.17568
Minimum0
Maximum10580
Zeros21
Zeros (%)7.1%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-19T14:38:09.525166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q119
median84.5
Q3394.25
95-th percentile2916.25
Maximum10580
Range10580
Interquartile range (IQR)375.25

Descriptive statistics

Standard deviation1631.4968
Coefficient of variation (CV)2.5405771
Kurtosis16.670578
Mean642.17568
Median Absolute Deviation (MAD)81.5
Skewness3.9885737
Sum190084
Variance2661781.9
MonotonicityNot monotonic
2024-04-19T14:38:09.659640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 21
 
7.1%
3 7
 
2.4%
1 6
 
2.0%
14 5
 
1.7%
2 4
 
1.4%
19 4
 
1.4%
5 4
 
1.4%
25 4
 
1.4%
81 4
 
1.4%
15 3
 
1.0%
Other values (193) 234
79.1%
ValueCountFrequency (%)
0 21
7.1%
1 6
 
2.0%
2 4
 
1.4%
3 7
 
2.4%
4 2
 
0.7%
5 4
 
1.4%
6 1
 
0.3%
7 3
 
1.0%
8 2
 
0.7%
9 1
 
0.3%
ValueCountFrequency (%)
10580 1
0.3%
10373 1
0.3%
8830 1
0.3%
8748 1
0.3%
7382 1
0.3%
7180 1
0.3%
7094 1
0.3%
7001 1
0.3%
6951 1
0.3%
6858 1
0.3%

관련부서
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
정책기획관
296 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정책기획관
2nd row정책기획관
3rd row정책기획관
4th row정책기획관
5th row정책기획관

Common Values

ValueCountFrequency (%)
정책기획관 296
100.0%

Length

2024-04-19T14:38:09.777406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:38:09.860341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정책기획관 296
100.0%

데이터기준일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2022-08-09
148 
2023-01-18
148 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-08-09
2nd row2022-08-09
3rd row2022-08-09
4th row2022-08-09
5th row2022-08-09

Common Values

ValueCountFrequency (%)
2022-08-09 148
50.0%
2023-01-18 148
50.0%

Length

2024-04-19T14:38:09.948456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:38:10.043843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-08-09 148
50.0%
2023-01-18 148
50.0%

Interactions

2024-04-19T14:38:08.098943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:38:10.102558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도산업 대분류산업 중분류성별대표자 수데이터기준일자
기준연도1.0000.0000.0000.0000.0001.000
산업 대분류0.0001.0001.0000.0000.7460.000
산업 중분류0.0001.0001.0000.0000.8690.000
성별0.0000.0000.0001.0000.1690.000
대표자 수0.0000.7460.8690.1691.0000.000
데이터기준일자1.0000.0000.0000.0000.0001.000
2024-04-19T14:38:10.207285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별기준연도데이터기준일자산업 대분류
성별1.0000.0000.0000.000
기준연도0.0001.0000.9930.000
데이터기준일자0.0000.9931.0000.000
산업 대분류0.0000.0000.0001.000
2024-04-19T14:38:10.297964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자 수기준연도산업 대분류성별데이터기준일자
대표자 수1.0000.0000.4000.1660.000
기준연도0.0001.0000.0000.0000.993
산업 대분류0.4000.0001.0000.0000.000
성별0.1660.0000.0001.0000.000
데이터기준일자0.0000.9930.0000.0001.000

Missing values

2024-04-19T14:38:08.204406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:38:08.316767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도산업 대분류산업 중분류성별대표자 수관련부서데이터기준일자
02020A.농업·임업 및 어업(01~03)01.농업남자395정책기획관2022-08-09
12020A.농업·임업 및 어업(01~03)01.농업여자113정책기획관2022-08-09
22020A.농업·임업 및 어업(01~03)02.임업남자23정책기획관2022-08-09
32020A.농업·임업 및 어업(01~03)02.임업여자11정책기획관2022-08-09
42020A.농업·임업 및 어업(01~03)03.어업남자179정책기획관2022-08-09
52020A.농업·임업 및 어업(01~03)03.어업여자38정책기획관2022-08-09
62020B.광업(05~08)05.석탄·원유 및 천연가스 광업남자0정책기획관2022-08-09
72020B.광업(05~08)05.석탄·원유 및 천연가스 광업여자0정책기획관2022-08-09
82020B.광업(05~08)06.금속 광업남자0정책기획관2022-08-09
92020B.광업(05~08)06.금속 광업여자0정책기획관2022-08-09
기준연도산업 대분류산업 중분류성별대표자 수관련부서데이터기준일자
2862021R.예술·스포츠 및 여가관련 서비스업(90~91)90.창작·예술 및 여가관련 서비스업남자329정책기획관2023-01-18
2872021R.예술·스포츠 및 여가관련 서비스업(90~91)90.창작·예술 및 여가관련 서비스업여자231정책기획관2023-01-18
2882021R.예술·스포츠 및 여가관련 서비스업(90~91)91.스포츠 및 오락관련 서비스업남자1368정책기획관2023-01-18
2892021R.예술·스포츠 및 여가관련 서비스업(90~91)91.스포츠 및 오락관련 서비스업여자717정책기획관2023-01-18
2902021S.협회 및 단체·수리 및 기타 개인 서비스업(94~96)94.협회 및 단체남자1436정책기획관2023-01-18
2912021S.협회 및 단체·수리 및 기타 개인 서비스업(94~96)94.협회 및 단체여자300정책기획관2023-01-18
2922021S.협회 및 단체·수리 및 기타 개인 서비스업(94~96)95.개인 및 소비용품 수리업남자1023정책기획관2023-01-18
2932021S.협회 및 단체·수리 및 기타 개인 서비스업(94~96)95.개인 및 소비용품 수리업여자290정책기획관2023-01-18
2942021S.협회 및 단체·수리 및 기타 개인 서비스업(94~96)96.기타 개인 서비스업남자1121정책기획관2023-01-18
2952021S.협회 및 단체·수리 및 기타 개인 서비스업(94~96)96.기타 개인 서비스업여자2971정책기획관2023-01-18