Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 69 |
Missing cells | 67 |
Missing cells (%) | 19.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.0 KiB |
Average record size in memory | 44.9 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Categorical | 1 |
Dataset
Description | 인천광역시 계양구 무인민원발급기 제증명 수수료에 관한 데이터파일입니다. 증명서 종류, 관내 수수료, 관외 수수료를 나타내는 데이터파일입니다. |
---|---|
Author | 인천광역시 계양구 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15101766&srcSe=7661IVAWM27C61E190 |
비고 has constant value "" | Constant |
수수료_관외 is highly overall correlated with 수수료_관내 | High correlation |
수수료_관내 is highly overall correlated with 수수료_관외 | High correlation |
수수료_관내 is highly imbalanced (59.9%) | Imbalance |
비고 has 67 (97.1%) missing values | Missing |
연번 has unique values | Unique |
증명서종류 has unique values | Unique |
수수료_관외 has 50 (72.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-21 16:01:49.589659 |
---|---|
Analysis finished | 2024-04-21 16:01:50.721502 |
Duration | 1.13 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
UNIQUE
 
Distinct | 69 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 35 |
Minimum | 1 |
---|---|
Maximum | 69 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 749.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4.4 |
Q1 | 18 |
median | 35 |
Q3 | 52 |
95-th percentile | 65.6 |
Maximum | 69 |
Range | 68 |
Interquartile range (IQR) | 34 |
Descriptive statistics
Standard deviation | 20.062403 |
---|---|
Coefficient of variation (CV) | 0.5732115 |
Kurtosis | -1.2 |
Mean | 35 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 0 |
Sum | 2415 |
Variance | 402.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.4% |
45 | 1 | 1.4% |
51 | 1 | 1.4% |
50 | 1 | 1.4% |
49 | 1 | 1.4% |
48 | 1 | 1.4% |
47 | 1 | 1.4% |
46 | 1 | 1.4% |
44 | 1 | 1.4% |
53 | 1 | 1.4% |
Other values (59) | 59 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
69 | 1 | |
68 | 1 | |
67 | 1 | |
66 | 1 | |
65 | 1 | |
64 | 1 | |
63 | 1 | |
62 | 1 | |
61 | 1 | |
60 | 1 |
증명서종류
Text
UNIQUE
 
Distinct | 69 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 680.0 B |
Length
Max length | 29 |
---|---|
Median length | 23 |
Mean length | 13.826087 |
Min length | 4 |
Characters and Unicode
Total characters | 954 |
---|---|
Distinct characters | 147 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 69 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 주민등록 등초본 |
---|---|
2nd row | 개별공시지가확인서 |
3rd row | 토지이용계획확인서 |
4th row | 토지(임야)대장등본, 대지권등록부 |
5th row | 건축물대장 |
Value | Count | Frequency (%) |
증명 | 4 | 2.9% |
납부확인서 | 4 | 2.9% |
졸업자 | 4 | 2.9% |
지역가입자 | 3 | 2.1% |
건강장기요양보험료 | 3 | 2.1% |
검정고시 | 3 | 2.1% |
고용보험 | 3 | 2.1% |
산재보험 | 2 | 1.4% |
2003년이후 | 2 | 1.4% |
직장가입자 | 2 | 1.4% |
Other values (100) | 110 |
Most occurring characters
Value | Count | Frequency (%) |
71 | 7.4% | |
서 | 51 | 5.3% |
증 | 44 | 4.6% |
명 | 44 | 4.6% |
인 | 30 | 3.1% |
( | 30 | 3.1% |
) | 30 | 3.1% |
자 | 29 | 3.0% |
부 | 18 | 1.9% |
용 | 17 | 1.8% |
Other values (137) | 590 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 786 | |
Space Separator | 71 | 7.4% |
Open Punctuation | 30 | 3.1% |
Close Punctuation | 30 | 3.1% |
Other Punctuation | 21 | 2.2% |
Decimal Number | 16 | 1.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 51 | 6.5% |
증 | 44 | 5.6% |
명 | 44 | 5.6% |
인 | 30 | 3.8% |
자 | 29 | 3.7% |
부 | 18 | 2.3% |
용 | 17 | 2.2% |
보 | 17 | 2.2% |
험 | 16 | 2.0% |
업 | 16 | 2.0% |
Other values (125) | 504 |
Decimal Number
Value | Count | Frequency (%) |
0 | 6 | |
2 | 4 | |
3 | 3 | |
1 | 1 | 6.2% |
9 | 1 | 6.2% |
8 | 1 | 6.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 13 | |
/ | 5 | 23.8% |
· | 3 | 14.3% |
Space Separator
Value | Count | Frequency (%) |
71 |
Open Punctuation
Value | Count | Frequency (%) |
( | 30 |
Close Punctuation
Value | Count | Frequency (%) |
) | 30 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 786 | |
Common | 168 | 17.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 51 | 6.5% |
증 | 44 | 5.6% |
명 | 44 | 5.6% |
인 | 30 | 3.8% |
자 | 29 | 3.7% |
부 | 18 | 2.3% |
용 | 17 | 2.2% |
보 | 17 | 2.2% |
험 | 16 | 2.0% |
업 | 16 | 2.0% |
Other values (125) | 504 |
Common
Value | Count | Frequency (%) |
71 | ||
( | 30 | |
) | 30 | |
, | 13 | 7.7% |
0 | 6 | 3.6% |
/ | 5 | 3.0% |
2 | 4 | 2.4% |
· | 3 | 1.8% |
3 | 3 | 1.8% |
1 | 1 | 0.6% |
Other values (2) | 2 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 786 | |
ASCII | 165 | 17.3% |
None | 3 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
71 | ||
( | 30 | |
) | 30 | |
, | 13 | 7.9% |
0 | 6 | 3.6% |
/ | 5 | 3.0% |
2 | 4 | 2.4% |
3 | 3 | 1.8% |
1 | 1 | 0.6% |
9 | 1 | 0.6% |
Hangul
Value | Count | Frequency (%) |
서 | 51 | 6.5% |
증 | 44 | 5.6% |
명 | 44 | 5.6% |
인 | 30 | 3.8% |
자 | 29 | 3.7% |
부 | 18 | 2.3% |
용 | 17 | 2.2% |
보 | 17 | 2.2% |
험 | 16 | 2.0% |
업 | 16 | 2.0% |
Other values (125) | 504 |
None
Value | Count | Frequency (%) |
· | 3 |
수수료_관내
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 7.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 680.0 B |
0 | |
---|---|
500 | 4 |
800 | 3 |
1000 | 3 |
300 | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.3623188 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.4% |
Sample
1st row | 0 |
---|---|
2nd row | 800 |
3rd row | 1000 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
0 | 58 | |
500 | 4 | 5.8% |
800 | 3 | 4.3% |
1000 | 3 | 4.3% |
300 | 1 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 58 | |
500 | 4 | 5.8% |
800 | 3 | 4.3% |
1000 | 3 | 4.3% |
300 | 1 | 1.4% |
수수료_관외
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 160.86957 |
Minimum | 0 |
---|---|
Maximum | 1000 |
Zeros | 50 |
Zeros (%) | 72.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 749.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 300 |
95-th percentile | 800 |
Maximum | 1000 |
Range | 1000 |
Interquartile range (IQR) | 300 |
Descriptive statistics
Standard deviation | 287.5952 |
---|---|
Coefficient of variation (CV) | 1.7877539 |
Kurtosis | 1.5911934 |
Mean | 160.86957 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.6282324 |
Sum | 11100 |
Variance | 82710.997 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 50 | |
500 | 12 | 17.4% |
1000 | 3 | 4.3% |
800 | 2 | 2.9% |
200 | 1 | 1.4% |
300 | 1 | 1.4% |
Value | Count | Frequency (%) |
0 | 50 | |
200 | 1 | 1.4% |
300 | 1 | 1.4% |
500 | 12 | 17.4% |
800 | 2 | 2.9% |
1000 | 3 | 4.3% |
Value | Count | Frequency (%) |
1000 | 3 | 4.3% |
800 | 2 | 2.9% |
500 | 12 | 17.4% |
300 | 1 | 1.4% |
200 | 1 | 1.4% |
0 | 50 |
비고
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 50.0% |
Missing | 67 |
Missing (%) | 97.1% |
Memory size | 680.0 B |
Value | Count | Frequency (%) |
관외 | 2 | |
불가 | 2 |
Most occurring characters
Value | Count | Frequency (%) |
관 | 2 | |
외 | 2 | |
2 | ||
불 | 2 | |
가 | 2 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8 | |
Space Separator | 2 | 20.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
관 | 2 | |
외 | 2 | |
불 | 2 | |
가 | 2 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8 | |
Common | 2 | 20.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
관 | 2 | |
외 | 2 | |
불 | 2 | |
가 | 2 |
Common
Value | Count | Frequency (%) |
2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8 | |
ASCII | 2 | 20.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
관 | 2 | |
외 | 2 | |
불 | 2 | |
가 | 2 |
ASCII
Value | Count | Frequency (%) |
2 |
연번 | 증명서종류 | 수수료_관내 | 수수료_관외 | |
---|---|---|---|---|
연번 | 1.000 | 1.000 | 0.610 | 0.622 |
증명서종류 | 1.000 | 1.000 | 1.000 | 1.000 |
수수료_관내 | 0.610 | 1.000 | 1.000 | 0.888 |
수수료_관외 | 0.622 | 1.000 | 0.888 | 1.000 |
연번 | 수수료_관외 | 수수료_관내 | |
---|---|---|---|
연번 | 1.000 | -0.295 | 0.283 |
수수료_관외 | -0.295 | 1.000 | 0.815 |
수수료_관내 | 0.283 | 0.815 | 1.000 |
연번 | 증명서종류 | 수수료_관내 | 수수료_관외 | 비고 | |
---|---|---|---|---|---|
0 | 1 | 주민등록 등초본 | 0 | 200 | <NA> |
1 | 2 | 개별공시지가확인서 | 800 | 800 | <NA> |
2 | 3 | 토지이용계획확인서 | 1000 | 1000 | <NA> |
3 | 4 | 토지(임야)대장등본, 대지권등록부 | 500 | 500 | <NA> |
4 | 5 | 건축물대장 | 500 | 500 | <NA> |
5 | 6 | 자동차등록원부(갑, 을) | 300 | 300 | <NA> |
6 | 7 | 건설기계등록원부(갑, 을) | 500 | 500 | <NA> |
7 | 8 | 수급자증명서 | 0 | 0 | <NA> |
8 | 9 | 장애인증명서 | 0 | 0 | <NA> |
9 | 10 | 한부모가족증명서 | 0 | 0 | <NA> |
연번 | 증명서종류 | 수수료_관내 | 수수료_관외 | 비고 | |
---|---|---|---|---|---|
59 | 60 | 산재보험 자격이력내역서(근로자용) | 0 | 0 | <NA> |
60 | 61 | 산재보험 일용근로내역서(근로자용) | 0 | 0 | <NA> |
61 | 62 | 보험급여 지급확인원(근로자용) | 0 | 0 | <NA> |
62 | 63 | 고용·산재보험가입증명원(법인/개인) | 0 | 0 | <NA> |
63 | 64 | 고용·산재보험 신고 및 완납여부 증명원(법인/개인) | 0 | 0 | <NA> |
64 | 65 | 산재요양승인반려여부확인서(법인/개인) | 0 | 0 | <NA> |
65 | 66 | 여권발급기록증명서(국/영) | 0 | 0 | <NA> |
66 | 67 | 여권발급신청서류증명서 | 0 | 0 | <NA> |
67 | 68 | 여권실효확인서(국/영) | 0 | 0 | <NA> |
68 | 69 | 여권정보증명서 | 0 | 0 | <NA> |