Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 30 |
Missing cells | 1 |
Missing cells (%) | 0.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.2 KiB |
Average record size in memory | 39.4 B |
Variable types
Numeric | 3 |
---|---|
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 경기도경제과학진흥원 |
URL | https://bigdata-region.kr/#/dataset/e70f4772-4c10-4d38-a308-1159f0eccd63 |
Reproduction
Analysis started | 2023-12-10 13:55:26.607520 |
---|---|
Analysis finished | 2023-12-10 13:55:29.157208 |
Duration | 2.55 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
UNIQUE
  ZEROS
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.5 |
Minimum | 0 |
---|---|
Maximum | 29 |
Zeros | 1 |
Zeros (%) | 3.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1.45 |
Q1 | 7.25 |
median | 14.5 |
Q3 | 21.75 |
95-th percentile | 27.55 |
Maximum | 29 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.60713162 |
Kurtosis | -1.2 |
Mean | 14.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 435 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
0 | 1 | 3.3% |
16 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
22 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
0 | 1 | |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 |
Value | Count | Frequency (%) |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 | |
20 | 1 |
우편번호
Real number (ℝ)
MISSING
 
Distinct | 27 |
---|---|
Distinct (%) | 93.1% |
Missing | 1 |
Missing (%) | 3.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10068.724 |
Minimum | 3051 |
---|---|
Maximum | 18583 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 3051 |
---|---|
5-th percentile | 3156 |
Q1 | 4513 |
median | 10257 |
Q3 | 14057 |
95-th percentile | 18448.8 |
Maximum | 18583 |
Range | 15532 |
Interquartile range (IQR) | 9544 |
Descriptive statistics
Standard deviation | 5278.1779 |
---|---|
Coefficient of variation (CV) | 0.52421517 |
Kurtosis | -1.3104768 |
Mean | 10068.724 |
Median Absolute Deviation (MAD) | 4366 |
Skewness | 0.20575206 |
Sum | 291993 |
Variance | 27859162 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10881 | 3 | 10.0% |
16643 | 1 | 3.3% |
14057 | 1 | 3.3% |
8588 | 1 | 3.3% |
4376 | 1 | 3.3% |
11623 | 1 | 3.3% |
14623 | 1 | 3.3% |
18578 | 1 | 3.3% |
6103 | 1 | 3.3% |
13605 | 1 | 3.3% |
Other values (17) | 17 |
Value | Count | Frequency (%) |
3051 | 1 | |
3134 | 1 | |
3189 | 1 | |
3909 | 1 | |
3997 | 1 | |
4074 | 1 | |
4376 | 1 | |
4513 | 1 | |
6103 | 1 | |
6633 | 1 |
Value | Count | Frequency (%) |
18583 | 1 | |
18578 | 1 | |
18255 | 1 | |
16898 | 1 | |
16827 | 1 | |
16643 | 1 | |
14623 | 1 | |
14057 | 1 | |
14056 | 1 | |
13605 | 1 |
범주명
Text
Distinct | 15 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 18 |
---|---|
Median length | 14 |
Mean length | 12.8 |
Min length | 8 |
Characters and Unicode
Total characters | 384 |
---|---|
Distinct characters | 106 |
Distinct categories | 4 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 30.0% |
Sample
1st row | 기기;기계;장비 |
---|---|
2nd row | 의료;보건;복지;제약;동물 |
3rd row | 연구;조사;분석;컨설팅;R&D |
4th row | 웹;모바일프로그래밍 |
5th row | 온라인 포털; 호스팅 |
Value | Count | Frequency (%) |
웹;모바일프로그래밍 | 6 | |
언론;방송;연예;공연 | 4 | |
판매;유통;무역;도소매;운송;물류 | 4 | |
마케팅;광고;홍보;전시;출판;인쇄 | 3 | |
기기;기계;장비 | 2 | 6.2% |
의료;보건;복지;제약;동물 | 2 | 6.2% |
교육;유학;어학 | 1 | 3.1% |
솔루션;si;시스템;it컨설팅 | 1 | 3.1% |
임업;가구;목재;제지 | 1 | 3.1% |
생활가전;용품;소비재;사무 | 1 | 3.1% |
Other values (7) | 7 |
Most occurring characters
Value | Count | Frequency (%) |
; | 87 | 22.7% |
연 | 9 | 2.3% |
송 | 8 | 2.1% |
매 | 8 | 2.1% |
판 | 7 | 1.8% |
웹 | 6 | 1.6% |
팅 | 6 | 1.6% |
기 | 6 | 1.6% |
유 | 6 | 1.6% |
물 | 6 | 1.6% |
Other values (96) | 235 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 286 | |
Other Punctuation | 88 | 22.9% |
Uppercase Letter | 8 | 2.1% |
Space Separator | 2 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
연 | 9 | 3.1% |
송 | 8 | 2.8% |
매 | 8 | 2.8% |
판 | 7 | 2.4% |
웹 | 6 | 2.1% |
팅 | 6 | 2.1% |
기 | 6 | 2.1% |
유 | 6 | 2.1% |
물 | 6 | 2.1% |
밍 | 6 | 2.1% |
Other values (88) | 218 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 3 | |
T | 2 | |
S | 1 | 12.5% |
R | 1 | 12.5% |
D | 1 | 12.5% |
Other Punctuation
Value | Count | Frequency (%) |
; | 87 | |
& | 1 | 1.1% |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 286 | |
Common | 90 | 23.4% |
Latin | 8 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
연 | 9 | 3.1% |
송 | 8 | 2.8% |
매 | 8 | 2.8% |
판 | 7 | 2.4% |
웹 | 6 | 2.1% |
팅 | 6 | 2.1% |
기 | 6 | 2.1% |
유 | 6 | 2.1% |
물 | 6 | 2.1% |
밍 | 6 | 2.1% |
Other values (88) | 218 |
Latin
Value | Count | Frequency (%) |
I | 3 | |
T | 2 | |
S | 1 | 12.5% |
R | 1 | 12.5% |
D | 1 | 12.5% |
Common
Value | Count | Frequency (%) |
; | 87 | |
2 | 2.2% | |
& | 1 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 286 | |
ASCII | 98 | 25.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
; | 87 | |
I | 3 | 3.1% |
2 | 2.0% | |
T | 2 | 2.0% |
S | 1 | 1.0% |
R | 1 | 1.0% |
& | 1 | 1.0% |
D | 1 | 1.0% |
Hangul
Value | Count | Frequency (%) |
연 | 9 | 3.1% |
송 | 8 | 2.8% |
매 | 8 | 2.8% |
판 | 7 | 2.4% |
웹 | 6 | 2.1% |
팅 | 6 | 2.1% |
기 | 6 | 2.1% |
유 | 6 | 2.1% |
물 | 6 | 2.1% |
밍 | 6 | 2.1% |
Other values (88) | 218 |
복지지수
Real number (ℝ)
Distinct | 13 |
---|---|
Distinct (%) | 43.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.3333333 |
Minimum | 1 |
---|---|
Maximum | 19 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5.5 |
Q3 | 8.75 |
95-th percentile | 13 |
Maximum | 19 |
Range | 18 |
Interquartile range (IQR) | 5.75 |
Descriptive statistics
Standard deviation | 4.2938074 |
---|---|
Coefficient of variation (CV) | 0.67796958 |
Kurtosis | 1.0347514 |
Mean | 6.3333333 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.93070299 |
Sum | 190 |
Variance | 18.436782 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 4 | |
2 | 4 | |
3 | 3 | |
7 | 3 | |
1 | 3 | |
10 | 3 | |
8 | 3 | |
13 | 2 | |
6 | 1 | 3.3% |
9 | 1 | 3.3% |
Other values (3) | 3 |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 4 | |
3 | 3 | |
4 | 1 | 3.3% |
5 | 4 | |
6 | 1 | 3.3% |
7 | 3 | |
8 | 3 | |
9 | 1 | 3.3% |
10 | 3 |
Value | Count | Frequency (%) |
19 | 1 | 3.3% |
13 | 2 | |
11 | 1 | 3.3% |
10 | 3 | |
9 | 1 | 3.3% |
8 | 3 | |
7 | 3 | |
6 | 1 | 3.3% |
5 | 4 | |
4 | 1 | 3.3% |
분석인덱스 | 우편번호 | 범주명 | 복지지수 | |
---|---|---|---|---|
분석인덱스 | 1.000 | 0.663 | 0.476 | 0.643 |
우편번호 | 0.663 | 1.000 | 0.336 | 0.601 |
범주명 | 0.476 | 0.336 | 1.000 | 0.628 |
복지지수 | 0.643 | 0.601 | 0.628 | 1.000 |
분석인덱스 | 우편번호 | 복지지수 | |
---|---|---|---|
분석인덱스 | 1.000 | 0.093 | -0.038 |
우편번호 | 0.093 | 1.000 | 0.134 |
복지지수 | -0.038 | 0.134 | 1.000 |
분석인덱스 | 우편번호 | 범주명 | 복지지수 | |
---|---|---|---|---|
0 | 0 | 16643 | 기기;기계;장비 | 3 |
1 | 1 | 16827 | 의료;보건;복지;제약;동물 | 13 |
2 | 2 | 3189 | 연구;조사;분석;컨설팅;R&D | 6 |
3 | 3 | 3909 | 웹;모바일프로그래밍 | 7 |
4 | 4 | 3051 | 온라인 포털; 호스팅 | 7 |
5 | 5 | 16898 | 판매;유통;무역;도소매;운송;물류 | 3 |
6 | 6 | 4513 | 석유;화학;에너지;환경 | 9 |
7 | 7 | 3134 | 교육;유학;어학 | 5 |
8 | 8 | 10881 | 마케팅;광고;홍보;전시;출판;인쇄 | 1 |
9 | 9 | 18583 | 기기;기계;장비 | 5 |
분석인덱스 | 우편번호 | 범주명 | 복지지수 | |
---|---|---|---|---|
20 | 20 | 6633 | 웹;모바일프로그래밍 | 8 |
21 | 21 | 7976 | 언론;방송;연예;공연 | 1 |
22 | 22 | 13605 | 의료;보건;복지;제약;동물 | 10 |
23 | 23 | 10881 | 마케팅;광고;홍보;전시;출판;인쇄 | 1 |
24 | 24 | 6103 | 마케팅;광고;홍보;전시;출판;인쇄 | 3 |
25 | 25 | 18578 | 임업;가구;목재;제지 | 4 |
26 | 26 | 14623 | 솔루션;SI;시스템;IT컨설팅 | 11 |
27 | 27 | 11623 | IT하드웨어;장비 | 2 |
28 | 28 | 4376 | 판매;유통;무역;도소매;운송;물류 | 7 |
29 | 29 | 8588 | 언론;방송;연예;공연 | 10 |