Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1843 |
Missing cells | 920 |
Missing cells (%) | 8.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 93.7 KiB |
Average record size in memory | 52.1 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 4 |
Text | 1 |
Dataset
Description | 중장기개방계획에따른 경상남도 경남도립거창대학 데이터자료입니다.(대분류, 중분류, 소분류, 세분류, 세세분류, 직업명등의 데이터를 포함하고있습니다.) |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15066703 |
Reproduction
Analysis started | 2023-12-11 00:34:57.542605 |
---|---|
Analysis finished | 2023-12-11 00:34:59.600155 |
Duration | 2.06 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
대분류
Categorical
Distinct | 10 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 14.5 KiB |
2 | |
---|---|
8 | |
7 | |
1 | |
4 | |
Other values (5) |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 648 | |
8 | 341 | |
7 | 304 | |
1 | 122 | 6.6% |
4 | 121 | 6.6% |
3 | 97 | 5.3% |
9 | 91 | 4.9% |
5 | 59 | 3.2% |
6 | 50 | 2.7% |
A | 10 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 648 | |
8 | 341 | |
7 | 304 | |
1 | 122 | 6.6% |
4 | 121 | 6.6% |
3 | 97 | 5.3% |
9 | 91 | 4.9% |
5 | 59 | 3.2% |
6 | 50 | 2.7% |
a | 10 | 0.5% |
중분류
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 0.5% |
Missing | 10 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.3758865 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 16.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 4 |
Q3 | 7 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.4870861 |
---|---|
Coefficient of variation (CV) | 0.56836165 |
Kurtosis | -1.0516673 |
Mean | 4.3758865 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.36291042 |
Sum | 8021 |
Variance | 6.1855973 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 310 | |
1 | 251 | |
2 | 247 | |
4 | 242 | |
5 | 216 | |
7 | 201 | |
8 | 160 | |
9 | 125 | |
6 | 81 | 4.4% |
(Missing) | 10 | 0.5% |
Value | Count | Frequency (%) |
1 | 251 | |
2 | 247 | |
3 | 310 | |
4 | 242 | |
5 | 216 | |
6 | 81 | 4.4% |
7 | 201 | |
8 | 160 | |
9 | 125 |
Value | Count | Frequency (%) |
9 | 125 | |
8 | 160 | |
7 | 201 | |
6 | 81 | 4.4% |
5 | 216 | |
4 | 242 | |
3 | 310 | |
2 | 247 | |
1 | 251 |
소분류
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 0.6% |
Missing | 62 |
Missing (%) | 3.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.7012914 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 124 |
Zeros (%) | 6.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 16.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 9 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.2459811 |
---|---|
Coefficient of variation (CV) | 0.83144716 |
Kurtosis | 1.5470798 |
Mean | 2.7012914 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.4217413 |
Sum | 4811 |
Variance | 5.0444309 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 507 | |
2 | 452 | |
3 | 246 | |
4 | 163 | 8.8% |
0 | 124 | 6.7% |
9 | 106 | 5.8% |
5 | 94 | 5.1% |
6 | 48 | 2.6% |
7 | 30 | 1.6% |
8 | 11 | 0.6% |
(Missing) | 62 | 3.4% |
Value | Count | Frequency (%) |
0 | 124 | 6.7% |
1 | 507 | |
2 | 452 | |
3 | 246 | |
4 | 163 | 8.8% |
5 | 94 | 5.1% |
6 | 48 | 2.6% |
7 | 30 | 1.6% |
8 | 11 | 0.6% |
9 | 106 | 5.8% |
Value | Count | Frequency (%) |
9 | 106 | 5.8% |
8 | 11 | 0.6% |
7 | 30 | 1.6% |
6 | 48 | 2.6% |
5 | 94 | 5.1% |
4 | 163 | 8.8% |
3 | 246 | |
2 | 452 | |
1 | 507 | |
0 | 124 | 6.7% |
세분류
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 0.6% |
Missing | 211 |
Missing (%) | 11.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.8167892 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 201 |
Zeros (%) | 10.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 16.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 9 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.5958298 |
---|---|
Coefficient of variation (CV) | 0.92155628 |
Kurtosis | 0.76663453 |
Mean | 2.8167892 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.2877764 |
Sum | 4597 |
Variance | 6.7383323 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 419 | |
2 | 355 | |
3 | 215 | |
0 | 201 | |
9 | 170 | |
4 | 142 | 7.7% |
5 | 76 | 4.1% |
6 | 36 | 2.0% |
7 | 15 | 0.8% |
8 | 3 | 0.2% |
(Missing) | 211 |
Value | Count | Frequency (%) |
0 | 201 | |
1 | 419 | |
2 | 355 | |
3 | 215 | |
4 | 142 | 7.7% |
5 | 76 | 4.1% |
6 | 36 | 2.0% |
7 | 15 | 0.8% |
8 | 3 | 0.2% |
9 | 170 |
Value | Count | Frequency (%) |
9 | 170 | |
8 | 3 | 0.2% |
7 | 15 | 0.8% |
6 | 36 | 2.0% |
5 | 76 | 4.1% |
4 | 142 | 7.7% |
3 | 215 | |
2 | 355 | |
1 | 419 | |
0 | 201 |
세세분류
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 0.8% |
Missing | 637 |
Missing (%) | 34.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.0704809 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 129 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 16.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 9 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.8120973 |
---|---|
Coefficient of variation (CV) | 0.91584914 |
Kurtosis | 0.1261149 |
Mean | 3.0704809 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.1569103 |
Sum | 3703 |
Variance | 7.9078914 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 297 | |
2 | 277 | |
9 | 169 | 9.2% |
3 | 158 | 8.6% |
0 | 129 | 7.0% |
4 | 89 | 4.8% |
5 | 44 | 2.4% |
6 | 26 | 1.4% |
7 | 11 | 0.6% |
8 | 6 | 0.3% |
(Missing) | 637 |
Value | Count | Frequency (%) |
0 | 129 | |
1 | 297 | |
2 | 277 | |
3 | 158 | |
4 | 89 | 4.8% |
5 | 44 | 2.4% |
6 | 26 | 1.4% |
7 | 11 | 0.6% |
8 | 6 | 0.3% |
9 | 169 |
Value | Count | Frequency (%) |
9 | 169 | |
8 | 6 | 0.3% |
7 | 11 | 0.6% |
6 | 26 | 1.4% |
5 | 44 | 2.4% |
4 | 89 | 4.8% |
3 | 158 | |
2 | 277 | |
1 | 297 | |
0 | 129 |
직업명
Text
Distinct | 1677 |
---|---|
Distinct (%) | 91.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 14.5 KiB |
Value | Count | Frequency (%) |
및 | 624 | 11.4% |
조작원 | 189 | 3.5% |
그 | 171 | 3.1% |
외 | 171 | 3.1% |
관련 | 102 | 1.9% |
관리자 | 101 | 1.9% |
연구원 | 98 | 1.8% |
기술자 | 87 | 1.6% |
종사원 | 84 | 1.5% |
사무원 | 69 | 1.3% |
Other values (1436) | 3759 |
Most occurring characters
Value | Count | Frequency (%) |
3612 | 19.8% | |
원 | 1117 | 6.1% |
기 | 643 | 3.5% |
및 | 624 | 3.4% |
사 | 580 | 3.2% |
조 | 469 | 2.6% |
관 | 441 | 2.4% |
자 | 407 | 2.2% |
작 | 282 | 1.5% |
리 | 257 | 1.4% |
Other values (402) | 9855 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 14668 | |
Space Separator | 3612 | 19.8% |
Uppercase Letter | 4 | < 0.1% |
Decimal Number | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
원 | 1117 | 7.6% |
기 | 643 | 4.4% |
및 | 624 | 4.3% |
사 | 580 | 4.0% |
조 | 469 | 3.2% |
관 | 441 | 3.0% |
자 | 407 | 2.8% |
작 | 282 | 1.9% |
리 | 257 | 1.8% |
전 | 251 | 1.7% |
Other values (397) | 9597 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2 | |
C | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 | |
9 | 1 |
Space Separator
Value | Count | Frequency (%) |
3612 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 14668 | |
Common | 3615 | 19.8% |
Latin | 4 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
원 | 1117 | 7.6% |
기 | 643 | 4.4% |
및 | 624 | 4.3% |
사 | 580 | 4.0% |
조 | 469 | 3.2% |
관 | 441 | 3.0% |
자 | 407 | 2.8% |
작 | 282 | 1.9% |
리 | 257 | 1.8% |
전 | 251 | 1.7% |
Other values (397) | 9597 |
Common
Value | Count | Frequency (%) |
3612 | ||
1 | 2 | 0.1% |
9 | 1 | < 0.1% |
Latin
Value | Count | Frequency (%) |
P | 2 | |
C | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 14667 | |
ASCII | 3619 | 19.8% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3612 | ||
P | 2 | 0.1% |
C | 2 | 0.1% |
1 | 2 | 0.1% |
9 | 1 | < 0.1% |
Hangul
Value | Count | Frequency (%) |
원 | 1117 | 7.6% |
기 | 643 | 4.4% |
및 | 624 | 4.3% |
사 | 580 | 4.0% |
조 | 469 | 3.2% |
관 | 441 | 3.0% |
자 | 407 | 2.8% |
작 | 282 | 1.9% |
리 | 257 | 1.8% |
전 | 251 | 1.7% |
Other values (396) | 9596 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
대분류 | 중분류 | 소분류 | 세분류 | 세세분류 | |
---|---|---|---|---|---|
대분류 | 1.000 | 0.538 | 0.580 | 0.458 | 0.113 |
중분류 | 0.538 | 1.000 | 0.495 | 0.234 | 0.000 |
소분류 | 0.580 | 0.495 | 1.000 | 0.268 | 0.114 |
세분류 | 0.458 | 0.234 | 0.268 | 1.000 | 0.114 |
세세분류 | 0.113 | 0.000 | 0.114 | 0.114 | 1.000 |
중분류 | 소분류 | 세분류 | 세세분류 | 대분류 | |
---|---|---|---|---|---|
중분류 | 1.000 | 0.259 | 0.032 | 0.004 | 0.280 |
소분류 | 0.259 | 1.000 | 0.057 | 0.016 | 0.212 |
세분류 | 0.032 | 0.057 | 1.000 | -0.061 | 0.156 |
세세분류 | 0.004 | 0.016 | -0.061 | 1.000 | 0.035 |
대분류 | 0.280 | 0.212 | 0.156 | 0.035 | 1.000 |
대분류 | 중분류 | 소분류 | 세분류 | 세세분류 | 직업명 | |
---|---|---|---|---|---|---|
0 | 2 | 8 | 1 | 2 | <NA> | 번역가 |
1 | 2 | 8 | 1 | 2 | 0 | 번역가 |
2 | 2 | 8 | 1 | 3 | <NA> | 통역가 |
3 | 2 | 8 | 1 | 3 | 0 | 통역가 |
4 | 2 | 8 | 1 | 4 | <NA> | 기자 및 논설위원 |
5 | 2 | 8 | 1 | 4 | 1 | 기자 |
6 | 2 | 8 | 1 | 4 | 2 | 논설위원 |
7 | 2 | 8 | 1 | 4 | 3 | 칼럼니스트 |
8 | 2 | 8 | 1 | 5 | <NA> | 출판물 전문가 |
9 | 2 | 8 | 1 | 5 | 1 | 출판물 기획자 |
대분류 | 중분류 | 소분류 | 세분류 | 세세분류 | 직업명 | |
---|---|---|---|---|---|---|
1833 | A | <NA> | <NA> | <NA> | <NA> | 군인 |
1834 | A | 1 | <NA> | <NA> | <NA> | 군인 |
1835 | A | 1 | 1 | <NA> | <NA> | 장교 |
1836 | A | 1 | 1 | 1 | <NA> | 영관급 이상 |
1837 | A | 1 | 1 | 1 | 0 | 영관급 이상 장교 |
1838 | A | 1 | 1 | 2 | <NA> | 위관급 |
1839 | A | 1 | 1 | 2 | 0 | 위관급 장교 |
1840 | A | 1 | 2 | <NA> | <NA> | 장기 부사관 및 준위 |
1841 | A | 1 | 2 | 0 | <NA> | 장기 부사관 및 준위 |
1842 | A | 1 | 2 | 0 | 0 | 장기 부사관 및 준위 |