Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 1262 |
Missing cells | 122 |
Missing cells (%) | 2.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 40.8 KiB |
Average record size in memory | 33.1 B |
Variable types
Text | 3 |
---|---|
Numeric | 1 |
Dataset
Description | 경기도 경기통계시스템 출처 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=7Z0IQFV4O0V48NTF5QZJ33541341&infSeq=1 |
Reproduction
Analysis started | 2023-12-10 22:31:45.716552 |
---|---|
Analysis finished | 2023-12-10 22:31:46.330016 |
Duration | 0.61 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
통계조사ID
Text
UNIQUE
 
Distinct | 1262 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.0 KiB |
Value | Count | Frequency (%) |
1991016 | 1 | 0.1% |
1971003 | 1 | 0.1% |
1974002 | 1 | 0.1% |
1974001 | 1 | 0.1% |
1973002 | 1 | 0.1% |
1973001 | 1 | 0.1% |
1972002 | 1 | 0.1% |
1975004 | 1 | 0.1% |
1971004 | 1 | 0.1% |
1971002 | 1 | 0.1% |
Other values (1252) | 1252 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2810 | |
1 | 1389 | |
9 | 1288 | |
2 | 998 | 11.0% |
6 | 569 | 6.3% |
7 | 433 | 4.8% |
3 | 403 | 4.5% |
5 | 361 | 4.0% |
8 | 354 | 3.9% |
4 | 344 | 3.8% |
Other values (2) | 83 | 0.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8949 | |
Uppercase Letter | 83 | 0.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2810 | |
1 | 1389 | |
9 | 1288 | |
2 | 998 | 11.2% |
6 | 569 | 6.4% |
7 | 433 | 4.8% |
3 | 403 | 4.5% |
5 | 361 | 4.0% |
8 | 354 | 4.0% |
4 | 344 | 3.8% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 80 | |
A | 3 | 3.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 8949 | |
Latin | 83 | 0.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2810 | |
1 | 1389 | |
9 | 1288 | |
2 | 998 | 11.2% |
6 | 569 | 6.4% |
7 | 433 | 4.8% |
3 | 403 | 4.5% |
5 | 361 | 4.0% |
8 | 354 | 4.0% |
4 | 344 | 3.8% |
Latin
Value | Count | Frequency (%) |
B | 80 | |
A | 3 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9032 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2810 | |
1 | 1389 | |
9 | 1288 | |
2 | 998 | 11.0% |
6 | 569 | 6.3% |
7 | 433 | 4.8% |
3 | 403 | 4.5% |
5 | 361 | 4.0% |
8 | 354 | 3.9% |
4 | 344 | 3.8% |
Other values (2) | 83 | 0.9% |
최초실시년도
Real number (ℝ)
MISSING
 
Distinct | 67 |
---|---|
Distinct (%) | 5.7% |
Missing | 91 |
Missing (%) | 7.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1991.9334 |
Minimum | 1910 |
---|---|
Maximum | 2018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.2 KiB |
Quantile statistics
Minimum | 1910 |
---|---|
5-th percentile | 1963 |
Q1 | 1981 |
median | 1997 |
Q3 | 2004 |
95-th percentile | 2007 |
Maximum | 2018 |
Range | 108 |
Interquartile range (IQR) | 23 |
Descriptive statistics
Standard deviation | 14.8142 |
---|---|
Coefficient of variation (CV) | 0.007437096 |
Kurtosis | 2.6032776 |
Mean | 1991.9334 |
Median Absolute Deviation (MAD) | 9 |
Skewness | -1.370025 |
Sum | 2332554 |
Variance | 219.46052 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2006 | 143 | 11.3% |
2007 | 75 | 5.9% |
1976 | 56 | 4.4% |
2001 | 53 | 4.2% |
1999 | 51 | 4.0% |
2005 | 51 | 4.0% |
1975 | 49 | 3.9% |
1998 | 49 | 3.9% |
1994 | 42 | 3.3% |
1996 | 41 | 3.2% |
Other values (57) | 561 | |
(Missing) | 91 | 7.2% |
Value | Count | Frequency (%) |
1910 | 3 | |
1925 | 1 | 0.1% |
1936 | 1 | 0.1% |
1937 | 1 | 0.1% |
1938 | 1 | 0.1% |
1940 | 1 | 0.1% |
1946 | 2 | |
1948 | 3 | |
1949 | 1 | 0.1% |
1952 | 2 |
Value | Count | Frequency (%) |
2018 | 1 | 0.1% |
2016 | 2 | 0.2% |
2010 | 1 | 0.1% |
2007 | 75 | |
2006 | 143 | |
2005 | 51 | 4.0% |
2004 | 37 | 2.9% |
2003 | 36 | 2.9% |
2002 | 32 | 2.5% |
2001 | 53 | 4.2% |
통계조사명
Text
Distinct | 1200 |
---|---|
Distinct (%) | 95.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.0 KiB |
Length
Max length | 40 |
---|---|
Median length | 28 |
Mean length | 10.744057 |
Min length | 2 |
Characters and Unicode
Total characters | 13559 |
---|---|
Distinct characters | 422 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 1171 ? |
---|---|
Unique (%) | 92.8% |
Sample
1st row | 주요수입상품의경쟁력실태조사 |
---|---|
2nd row | 임대공단및장기분할상환공단에대한수요조사 |
3rd row | 수입의파급효과와기업의대응방안조사 |
4th row | 고밀주택단지내시설배치에대한의식조사 |
5th row | 수출산업실태조사 |
Value | Count | Frequency (%) |
및 | 40 | 2.4% |
실태조사 | 27 | 1.6% |
조사 | 20 | 1.2% |
주민등록인구통계 | 18 | 1.1% |
교육통계 | 15 | 0.9% |
대한 | 10 | 0.6% |
기업의 | 10 | 0.6% |
중소기업 | 8 | 0.5% |
설비투자계획조사 | 7 | 0.4% |
관한 | 6 | 0.4% |
Other values (1407) | 1518 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 858 | 6.3% |
조 | 773 | 5.7% |
417 | 3.1% | |
업 | 407 | 3.0% |
계 | 356 | 2.6% |
기 | 345 | 2.5% |
실 | 328 | 2.4% |
통 | 323 | 2.4% |
태 | 292 | 2.2% |
황 | 205 | 1.5% |
Other values (412) | 9255 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 12847 | |
Space Separator | 420 | 3.1% |
Decimal Number | 154 | 1.1% |
Uppercase Letter | 61 | 0.4% |
Close Punctuation | 25 | 0.2% |
Open Punctuation | 25 | 0.2% |
Other Punctuation | 25 | 0.2% |
Lowercase Letter | 1 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 858 | 6.7% |
조 | 773 | 6.0% |
업 | 407 | 3.2% |
계 | 356 | 2.8% |
기 | 345 | 2.7% |
실 | 328 | 2.6% |
통 | 323 | 2.5% |
태 | 292 | 2.3% |
황 | 205 | 1.6% |
수 | 182 | 1.4% |
Other values (369) | 8778 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 7 | |
D | 7 | |
T | 7 | |
B | 5 | |
C | 5 | |
R | 4 | 6.6% |
P | 4 | 6.6% |
F | 4 | 6.6% |
A | 3 | 4.9% |
G | 3 | 4.9% |
Other values (7) | 12 |
Decimal Number
Value | Count | Frequency (%) |
0 | 55 | |
2 | 26 | |
9 | 20 | 13.0% |
1 | 17 | 11.0% |
4 | 9 | 5.8% |
5 | 8 | 5.2% |
6 | 6 | 3.9% |
8 | 6 | 3.9% |
3 | 5 | 3.2% |
7 | 2 | 1.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 8 | |
' | 8 | |
. | 4 | |
· | 3 | 12.0% |
/ | 1 | 4.0% |
& | 1 | 4.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 20 | |
」 | 4 | 16.0% |
』 | 1 | 4.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 20 | |
「 | 4 | 16.0% |
『 | 1 | 4.0% |
Space Separator
Value | Count | Frequency (%) |
417 | ||
3 | 0.7% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 12847 | |
Common | 650 | 4.8% |
Latin | 62 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 858 | 6.7% |
조 | 773 | 6.0% |
업 | 407 | 3.2% |
계 | 356 | 2.8% |
기 | 345 | 2.7% |
실 | 328 | 2.6% |
통 | 323 | 2.5% |
태 | 292 | 2.3% |
황 | 205 | 1.6% |
수 | 182 | 1.4% |
Other values (369) | 8778 |
Common
Value | Count | Frequency (%) |
417 | ||
0 | 55 | 8.5% |
2 | 26 | 4.0% |
) | 20 | 3.1% |
9 | 20 | 3.1% |
( | 20 | 3.1% |
1 | 17 | 2.6% |
4 | 9 | 1.4% |
, | 8 | 1.2% |
5 | 8 | 1.2% |
Other values (15) | 50 | 7.7% |
Latin
Value | Count | Frequency (%) |
I | 7 | |
D | 7 | |
T | 7 | |
B | 5 | 8.1% |
C | 5 | 8.1% |
R | 4 | 6.5% |
P | 4 | 6.5% |
F | 4 | 6.5% |
A | 3 | 4.8% |
G | 3 | 4.8% |
Other values (8) | 13 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 12847 | |
ASCII | 696 | 5.1% |
None | 16 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 858 | 6.7% |
조 | 773 | 6.0% |
업 | 407 | 3.2% |
계 | 356 | 2.8% |
기 | 345 | 2.7% |
실 | 328 | 2.6% |
통 | 323 | 2.5% |
태 | 292 | 2.3% |
황 | 205 | 1.6% |
수 | 182 | 1.4% |
Other values (369) | 8778 |
ASCII
Value | Count | Frequency (%) |
417 | ||
0 | 55 | 7.9% |
2 | 26 | 3.7% |
) | 20 | 2.9% |
9 | 20 | 2.9% |
( | 20 | 2.9% |
1 | 17 | 2.4% |
4 | 9 | 1.3% |
, | 8 | 1.1% |
5 | 8 | 1.1% |
Other values (27) | 96 | 13.8% |
None
Value | Count | Frequency (%) |
」 | 4 | |
「 | 4 | |
· | 3 | |
3 | ||
『 | 1 | 6.2% |
』 | 1 | 6.2% |
영문통계조사명
Text
MISSING
 
Distinct | 807 |
---|---|
Distinct (%) | 65.6% |
Missing | 31 |
Missing (%) | 2.5% |
Memory size | 10.0 KiB |
Length
Max length | 80 |
---|---|
Median length | 71 |
Mean length | 25.760357 |
Min length | 1 |
Characters and Unicode
Total characters | 31711 |
---|---|
Distinct characters | 72 |
Distinct categories | 9 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 789 ? |
---|---|
Unique (%) | 64.1% |
Sample
1st row | Survey of the Competitiveness of Major Imported Goods |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row | Export Industry Survey |
Value | Count | Frequency (%) |
of | 354 | 8.1% |
survey | 286 | 6.6% |
statistics | 185 | 4.3% |
on | 163 | 3.7% |
the | 143 | 3.3% |
and | 123 | 2.8% |
in | 87 | 2.0% |
status | 53 | 1.2% |
for | 46 | 1.1% |
46 | 1.1% | |
Other values (1079) | 2866 |
Most occurring characters
Value | Count | Frequency (%) |
3897 | ||
e | 2561 | 8.1% |
t | 2475 | 7.8% |
i | 2195 | 6.9% |
n | 2128 | 6.7% |
o | 2040 | 6.4% |
a | 1937 | 6.1% |
s | 1908 | 6.0% |
r | 1676 | 5.3% |
u | 1175 | 3.7% |
Other values (62) | 9719 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 24484 | |
Space Separator | 3897 | 12.3% |
Uppercase Letter | 3114 | 9.8% |
Other Punctuation | 106 | 0.3% |
Dash Punctuation | 63 | 0.2% |
Decimal Number | 42 | 0.1% |
Open Punctuation | 2 | < 0.1% |
Close Punctuation | 2 | < 0.1% |
Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 2561 | |
t | 2475 | |
i | 2195 | |
n | 2128 | 8.7% |
o | 2040 | 8.3% |
a | 1937 | 7.9% |
s | 1908 | 7.8% |
r | 1676 | 6.8% |
u | 1175 | 4.8% |
c | 1035 | 4.2% |
Other values (16) | 5354 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 719 | |
C | 283 | 9.1% |
P | 228 | 7.3% |
R | 204 | 6.6% |
I | 186 | 6.0% |
E | 185 | 5.9% |
T | 161 | 5.2% |
M | 155 | 5.0% |
F | 142 | 4.6% |
A | 139 | 4.5% |
Other values (15) | 712 |
Decimal Number
Value | Count | Frequency (%) |
0 | 12 | |
1 | 11 | |
2 | 6 | |
9 | 4 | 9.5% |
4 | 2 | 4.8% |
5 | 2 | 4.8% |
3 | 2 | 4.8% |
8 | 2 | 4.8% |
6 | 1 | 2.4% |
Other Punctuation
Value | Count | Frequency (%) |
& | 47 | |
' | 25 | |
. | 17 | 16.0% |
, | 9 | 8.5% |
& | 4 | 3.8% |
/ | 3 | 2.8% |
: | 1 | 0.9% |
Space Separator
Value | Count | Frequency (%) |
3897 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 63 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 27598 | |
Common | 4113 | 13.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 2561 | 9.3% |
t | 2475 | 9.0% |
i | 2195 | 8.0% |
n | 2128 | 7.7% |
o | 2040 | 7.4% |
a | 1937 | 7.0% |
s | 1908 | 6.9% |
r | 1676 | 6.1% |
u | 1175 | 4.3% |
c | 1035 | 3.8% |
Other values (41) | 8468 |
Common
Value | Count | Frequency (%) |
3897 | ||
- | 63 | 1.5% |
& | 47 | 1.1% |
' | 25 | 0.6% |
. | 17 | 0.4% |
0 | 12 | 0.3% |
1 | 11 | 0.3% |
, | 9 | 0.2% |
2 | 6 | 0.1% |
& | 4 | 0.1% |
Other values (11) | 22 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 31707 | |
None | 4 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3897 | ||
e | 2561 | 8.1% |
t | 2475 | 7.8% |
i | 2195 | 6.9% |
n | 2128 | 6.7% |
o | 2040 | 6.4% |
a | 1937 | 6.1% |
s | 1908 | 6.0% |
r | 1676 | 5.3% |
u | 1175 | 3.7% |
Other values (61) | 9715 |
None
Value | Count | Frequency (%) |
& | 4 |
통계조사ID | 최초실시년도 | 통계조사명 | 영문통계조사명 | |
---|---|---|---|---|
0 | 1991016 | 1991 | 주요수입상품의경쟁력실태조사 | Survey of the Competitiveness of Major Imported Goods |
1 | 1992001 | 1992 | 임대공단및장기분할상환공단에대한수요조사 | |
2 | 1992002 | 1992 | 수입의파급효과와기업의대응방안조사 | |
3 | 1992003 | 1992 | 고밀주택단지내시설배치에대한의식조사 | |
4 | 1992004 | 1985 | 수출산업실태조사 | Export Industry Survey |
5 | 1992005 | 1992 | 산업내근로행태의변화와근로의질제고방안조사 | |
6 | 1992006 | 1992 | 주택청약관련저축자의의식조사 | |
7 | 1992007 | 1992 | 폐기물재자원화실태조사 | |
8 | 1992008 | 1992 | 주공이미지조사 | |
9 | 1992009 | 1992 | 세무행정에관한의견조사 |
통계조사ID | 최초실시년도 | 통계조사명 | 영문통계조사명 | |
---|---|---|---|---|
1252 | 2020049 | <NA> | 경기도경기종합지수 | <NA> |
1253 | 1993010 | <NA> | 경기도기본통계 | <NA> |
1254 | 2017058 | <NA> | 경기도청년통계 | <NA> |
1255 | 2020040 | <NA> | 경기도특별사법경찰범죄통계 | <NA> |
1256 | 2022001 | <NA> | 경기도주요관광지방문객실태조사 | <NA> |
1257 | 2020032 | <NA> | 경기도아동가구주거실태조사 | Survey on Residential Conditions of Households with children in Gyeonggi-do |
1258 | B21020180713171220 | <NA> | 경기도장래인구추계 | <NA> |
1259 | B21020200327123408 | <NA> | 과학기술정보통신부 통계자료(유선통신서비스 가입자 현황) | <NA> |
1260 | B21020210209150152 | <NA> | 경기도영유아통계 | <NA> |
1261 | B21020210218161736 | <NA> | 경기동행종합지수 | <NA> |