Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 33 |
Missing cells | 37 |
Missing cells (%) | 18.7% |
Duplicate rows | 1 |
Duplicate rows (%) | 3.0% |
Total size in memory | 1.8 KiB |
Average record size in memory | 57.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 1 |
Categorical | 3 |
Unsupported | 1 |
Dataset
Description | 광주광역시 동부소방서 1급선임대상 현황에 대한 데이터로 동부소방서 관내 각 안전센터별 구분한 수치자료를 제공합니다. |
---|---|
Author | 광주광역시 |
URL | https://www.data.go.kr/data/15054938/fileData.do |
Dataset has 1 (3.0%) duplicate rows | Duplicates |
지산 is highly overall correlated with 대인 | High correlation |
용산 is highly overall correlated with 합계 and 1 other fields | High correlation |
대인 is highly overall correlated with 합계 and 2 other fields | High correlation |
합계 is highly overall correlated with 대인 and 1 other fields | High correlation |
용산 is highly imbalanced (60.2%) | Imbalance |
종류 has 2 (6.1%) missing values | Missing |
합계 has 2 (6.1%) missing values | Missing |
Unnamed: 5 has 33 (100.0%) missing values | Missing |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
합계 has 20 (60.6%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 08:49:23.335710 |
---|---|
Analysis finished | 2023-12-12 08:49:24.040570 |
Duration | 0.7 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
종류
Text
MISSING
 
Distinct | 31 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 6.1% |
Memory size | 396.0 B |
Value | Count | Frequency (%) |
및 | 6 | 13.3% |
공동주택(기숙사 | 1 | 2.2% |
방송통신시설 | 1 | 2.2% |
동물 | 1 | 2.2% |
식물 | 1 | 2.2% |
관련 | 1 | 2.2% |
분뇨 | 1 | 2.2% |
쓰레기 | 1 | 2.2% |
교정 | 1 | 2.2% |
군사 | 1 | 2.2% |
Other values (30) | 30 |
Most occurring characters
Value | Count | Frequency (%) |
설 | 17 | 10.1% |
시 | 17 | 10.1% |
14 | 8.3% | |
및 | 6 | 3.6% |
동 | 5 | 3.0% |
공 | 4 | 2.4% |
물 | 4 | 2.4% |
장 | 4 | 2.4% |
련 | 3 | 1.8% |
지 | 3 | 1.8% |
Other values (71) | 92 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 151 | |
Space Separator | 14 | 8.3% |
Open Punctuation | 2 | 1.2% |
Close Punctuation | 2 | 1.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
설 | 17 | 11.3% |
시 | 17 | 11.3% |
및 | 6 | 4.0% |
동 | 5 | 3.3% |
공 | 4 | 2.6% |
물 | 4 | 2.6% |
장 | 4 | 2.6% |
련 | 3 | 2.0% |
지 | 3 | 2.0% |
교 | 3 | 2.0% |
Other values (68) | 85 |
Space Separator
Value | Count | Frequency (%) |
14 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 151 | |
Common | 18 | 10.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
설 | 17 | 11.3% |
시 | 17 | 11.3% |
및 | 6 | 4.0% |
동 | 5 | 3.3% |
공 | 4 | 2.6% |
물 | 4 | 2.6% |
장 | 4 | 2.6% |
련 | 3 | 2.0% |
지 | 3 | 2.0% |
교 | 3 | 2.0% |
Other values (68) | 85 |
Common
Value | Count | Frequency (%) |
14 | ||
( | 2 | 11.1% |
) | 2 | 11.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 151 | |
ASCII | 18 | 10.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
설 | 17 | 11.3% |
시 | 17 | 11.3% |
및 | 6 | 4.0% |
동 | 5 | 3.3% |
공 | 4 | 2.6% |
물 | 4 | 2.6% |
장 | 4 | 2.6% |
련 | 3 | 2.0% |
지 | 3 | 2.0% |
교 | 3 | 2.0% |
Other values (68) | 85 |
ASCII
Value | Count | Frequency (%) |
14 | ||
( | 2 | 11.1% |
) | 2 | 11.1% |
합계
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 7 |
---|---|
Distinct (%) | 22.6% |
Missing | 2 |
Missing (%) | 6.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6451613 |
Minimum | 0 |
---|---|
Maximum | 19 |
Zeros | 20 |
Zeros (%) | 60.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 429.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1.5 |
95-th percentile | 8.5 |
Maximum | 19 |
Range | 19 |
Interquartile range (IQR) | 1.5 |
Descriptive statistics
Standard deviation | 4.0045673 |
---|---|
Coefficient of variation (CV) | 2.4341487 |
Kurtosis | 12.9082 |
Mean | 1.6451613 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.4947345 |
Sum | 51 |
Variance | 16.036559 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 20 | |
2 | 4 | 12.1% |
1 | 3 | 9.1% |
4 | 1 | 3.0% |
5 | 1 | 3.0% |
12 | 1 | 3.0% |
19 | 1 | 3.0% |
(Missing) | 2 | 6.1% |
Value | Count | Frequency (%) |
0 | 20 | |
1 | 3 | 9.1% |
2 | 4 | 12.1% |
4 | 1 | 3.0% |
5 | 1 | 3.0% |
12 | 1 | 3.0% |
19 | 1 | 3.0% |
Value | Count | Frequency (%) |
19 | 1 | 3.0% |
12 | 1 | 3.0% |
5 | 1 | 3.0% |
4 | 1 | 3.0% |
2 | 4 | 12.1% |
1 | 3 | 9.1% |
0 | 20 |
대인
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 18.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 396.0 B |
<NA> | |
---|---|
1 | |
2 | 2 |
4 | 1 |
10 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.3333333 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 9.1% |
Sample
1st row | 1 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | 2 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 25 | |
1 | 3 | 9.1% |
2 | 2 | 6.1% |
4 | 1 | 3.0% |
10 | 1 | 3.0% |
15 | 1 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 25 | |
1 | 3 | 9.1% |
2 | 2 | 6.1% |
4 | 1 | 3.0% |
10 | 1 | 3.0% |
15 | 1 | 3.0% |
용산
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 9.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 396.0 B |
<NA> | |
---|---|
1 | |
3 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.6363636 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.0% |
Sample
1st row | 1 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 29 | |
1 | 3 | 9.1% |
3 | 1 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 29 | |
1 | 3 | 9.1% |
3 | 1 | 3.0% |
지산
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 9.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 396.0 B |
<NA> | |
---|---|
1 | |
2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.4545455 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 27 | |
1 | 3 | 9.1% |
2 | 3 | 9.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 27 | |
1 | 3 | 9.1% |
2 | 3 | 9.1% |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 33 |
---|---|
Missing (%) | 100.0% |
Memory size | 429.0 B |
종류 | 합계 | 대인 | 용산 | 지산 | |
---|---|---|---|---|---|
종류 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
합계 | 1.000 | 1.000 | 0.936 | 1.000 | 0.000 |
대인 | 1.000 | 0.936 | 1.000 | 1.000 | 1.000 |
용산 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
지산 | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 |
지산 | 용산 | 대인 | |
---|---|---|---|
지산 | 1.000 | 0.000 | 1.000 |
용산 | 0.000 | 1.000 | 1.000 |
대인 | 1.000 | 1.000 | 1.000 |
합계 | 대인 | 용산 | 지산 | |
---|---|---|---|---|
합계 | 1.000 | 0.565 | 0.707 | 0.000 |
대인 | 0.565 | 1.000 | 1.000 | 1.000 |
용산 | 0.707 | 1.000 | 1.000 | 0.000 |
지산 | 0.000 | 1.000 | 0.000 | 1.000 |
종류 | 합계 | 대인 | 용산 | 지산 | Unnamed: 5 | |
---|---|---|---|---|---|---|
0 | 공동주택(아파트) | 2 | 1 | 1 | <NA> | <NA> |
1 | 공동주택(기숙사) | 0 | <NA> | <NA> | <NA> | <NA> |
2 | 근린생활 | 0 | <NA> | <NA> | <NA> | <NA> |
3 | 문화 및 집회시설 | 2 | 2 | <NA> | <NA> | <NA> |
4 | 종교시설 | 0 | <NA> | <NA> | <NA> | <NA> |
5 | 판매시설 | 4 | 4 | <NA> | <NA> | <NA> |
6 | 운수시설 | 0 | <NA> | <NA> | <NA> | <NA> |
7 | 의료시설 | 2 | <NA> | 1 | 1 | <NA> |
8 | 교육연구시설 | 5 | 2 | 1 | 2 | <NA> |
9 | 노유자시설 | 0 | <NA> | <NA> | <NA> | <NA> |
종류 | 합계 | 대인 | 용산 | 지산 | Unnamed: 5 | |
---|---|---|---|---|---|---|
23 | 발전시설 | 0 | <NA> | <NA> | <NA> | <NA> |
24 | 묘지관련시설 | 0 | <NA> | <NA> | <NA> | <NA> |
25 | 관광휴게시설 | 0 | <NA> | <NA> | <NA> | <NA> |
26 | 장례식장 | 0 | <NA> | <NA> | <NA> | <NA> |
27 | 지하가 | 1 | 1 | <NA> | <NA> | <NA> |
28 | 지하구 | 0 | <NA> | <NA> | <NA> | <NA> |
29 | 문화재 | 0 | <NA> | <NA> | <NA> | <NA> |
30 | 복합건축물 | 19 | 15 | 3 | 1 | <NA> |
31 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
32 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
종류 | 합계 | 대인 | 용산 | 지산 | # duplicates | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |