Overview

Dataset statistics

Number of variables11
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory96.4 B

Variable types

DateTime2
Categorical5
Numeric4

Dataset

Description샘플 데이터
Author한국평가데이터㈜
URLhttps://bigdata-region.kr/#/dataset/4576ac79-26f5-4c51-98ad-01c3149bc75b

Alerts

기준년월 has constant value ""Constant
외국인구분명 has constant value ""Constant
등록일자 has constant value ""Constant
작업자명 has constant value ""Constant
성별코드 is highly overall correlated with 평균신용점수 and 1 other fieldsHigh correlation
성별명 is highly overall correlated with 평균신용점수 and 1 other fieldsHigh correlation
총인원수 is highly overall correlated with 평균신용점수 and 2 other fieldsHigh correlation
평균신용점수 is highly overall correlated with 총인원수 and 3 other fieldsHigh correlation
신용점수중위점수 is highly overall correlated with 총인원수 and 1 other fieldsHigh correlation
시도명 is highly overall correlated with 총인원수High correlation
총인원수 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:55:07.688185
Analysis finished2023-12-10 13:55:11.488508
Duration3.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2020-05-01 00:00:00
Maximum2020-05-01 00:00:00
2023-12-10T22:55:11.572682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:11.724609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

시도명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)43.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
강원
경북
충남
세종
전남
Other values (8)
10 

Length

Max length3
Median length2
Mean length2.0666667
Min length2

Unique

Unique6 ?
Unique (%)20.0%

Sample

1st row강원
2nd row강원
3rd row경북
4th row강원
5th row경북

Common Values

ValueCountFrequency (%)
강원 5
16.7%
경북 5
16.7%
충남 5
16.7%
세종 3
10.0%
전남 2
 
6.7%
광주 2
 
6.7%
미분류 2
 
6.7%
울산 1
 
3.3%
대구 1
 
3.3%
경기 1
 
3.3%
Other values (3) 3
10.0%

Length

2023-12-10T22:55:11.907452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원 5
16.7%
경북 5
16.7%
충남 5
16.7%
세종 3
10.0%
전남 2
 
6.7%
광주 2
 
6.7%
미분류 2
 
6.7%
울산 1
 
3.3%
대구 1
 
3.3%
경기 1
 
3.3%
Other values (3) 3
10.0%

성별코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
F
19 
M
11 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowM
4th rowF
5th rowM

Common Values

ValueCountFrequency (%)
F 19
63.3%
M 11
36.7%

Length

2023-12-10T22:55:12.217211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:55:12.576534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 19
63.3%
m 11
36.7%

성별명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
여성
19 
남성
11 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여성
2nd row여성
3rd row남성
4th row여성
5th row남성

Common Values

ValueCountFrequency (%)
여성 19
63.3%
남성 11
36.7%

Length

2023-12-10T22:55:12.826856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:55:13.051249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여성 19
63.3%
남성 11
36.7%

외국인구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
국내인
30 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내인
2nd row국내인
3rd row국내인
4th row국내인
5th row국내인

Common Values

ValueCountFrequency (%)
국내인 30
100.0%

Length

2023-12-10T22:55:13.212047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:55:13.752524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내인 30
100.0%

연령
Real number (ℝ)

Distinct25
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.2
Minimum20
Maximum97
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:55:13.949471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile27.9
Q137
median47
Q372.75
95-th percentile95.65
Maximum97
Range77
Interquartile range (IQR)35.75

Descriptive statistics

Standard deviation22.305713
Coefficient of variation (CV)0.41154452
Kurtosis-0.71590322
Mean54.2
Median Absolute Deviation (MAD)15
Skewness0.59511121
Sum1626
Variance497.54483
MonotonicityNot monotonic
2023-12-10T22:55:14.205631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
47 3
 
10.0%
36 2
 
6.7%
32 2
 
6.7%
97 2
 
6.7%
29 1
 
3.3%
41 1
 
3.3%
46 1
 
3.3%
55 1
 
3.3%
56 1
 
3.3%
78 1
 
3.3%
Other values (15) 15
50.0%
ValueCountFrequency (%)
20 1
3.3%
27 1
3.3%
29 1
3.3%
30 1
3.3%
32 2
6.7%
36 2
6.7%
40 1
3.3%
41 1
3.3%
42 1
3.3%
45 1
3.3%
ValueCountFrequency (%)
97 2
6.7%
94 1
3.3%
89 1
3.3%
79 1
3.3%
78 1
3.3%
76 1
3.3%
74 1
3.3%
69 1
3.3%
62 1
3.3%
56 1
3.3%

총인원수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16896.1
Minimum115
Maximum65451
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:55:14.447545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum115
5-th percentile139.75
Q16620.25
median11445
Q324843.75
95-th percentile43768.95
Maximum65451
Range65336
Interquartile range (IQR)18223.5

Descriptive statistics

Standard deviation15554.732
Coefficient of variation (CV)0.92061077
Kurtosis1.9590443
Mean16896.1
Median Absolute Deviation (MAD)10486
Skewness1.2729461
Sum506883
Variance2.4194968 × 108
MonotonicityNot monotonic
2023-12-10T22:55:14.754043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
9029 1
 
3.3%
1652 1
 
3.3%
133 1
 
3.3%
148 1
 
3.3%
8403 1
 
3.3%
32749 1
 
3.3%
24456 1
 
3.3%
28368 1
 
3.3%
944 1
 
3.3%
115 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
115 1
3.3%
133 1
3.3%
148 1
3.3%
217 1
3.3%
944 1
3.3%
974 1
3.3%
1652 1
3.3%
6026 1
3.3%
8403 1
3.3%
9029 1
3.3%
ValueCountFrequency (%)
65451 1
3.3%
46572 1
3.3%
40343 1
3.3%
32749 1
3.3%
31373 1
3.3%
28368 1
3.3%
25992 1
3.3%
24973 1
3.3%
24456 1
3.3%
24003 1
3.3%

평균신용점수
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean835.3
Minimum813
Maximum868
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:55:15.021529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum813
5-th percentile815.9
Q1821.25
median831.5
Q3846.5
95-th percentile864.1
Maximum868
Range55
Interquartile range (IQR)25.25

Descriptive statistics

Standard deviation17.248988
Coefficient of variation (CV)0.020650051
Kurtosis-0.95960008
Mean835.3
Median Absolute Deviation (MAD)12.5
Skewness0.59699247
Sum25059
Variance297.52759
MonotonicityNot monotonic
2023-12-10T22:55:15.193840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
836 2
 
6.7%
822 2
 
6.7%
828 2
 
6.7%
863 2
 
6.7%
817 2
 
6.7%
818 2
 
6.7%
813 1
 
3.3%
845 1
 
3.3%
862 1
 
3.3%
842 1
 
3.3%
Other values (14) 14
46.7%
ValueCountFrequency (%)
813 1
3.3%
815 1
3.3%
817 2
6.7%
818 2
6.7%
820 1
3.3%
821 1
3.3%
822 2
6.7%
823 1
3.3%
824 1
3.3%
828 2
6.7%
ValueCountFrequency (%)
868 1
3.3%
865 1
3.3%
863 2
6.7%
862 1
3.3%
857 1
3.3%
852 1
3.3%
847 1
3.3%
845 1
3.3%
842 1
3.3%
839 1
3.3%

신용점수중위점수
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean854.46667
Minimum819
Maximum906
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:55:15.421588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum819
5-th percentile819
Q1819
median863
Q3878.75
95-th percentile899.55
Maximum906
Range87
Interquartile range (IQR)59.75

Descriptive statistics

Standard deviation32.135203
Coefficient of variation (CV)0.037608492
Kurtosis-1.6067879
Mean854.46667
Median Absolute Deviation (MAD)36
Skewness0.078723788
Sum25634
Variance1032.6713
MonotonicityNot monotonic
2023-12-10T22:55:15.671185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
819 11
36.7%
863 3
 
10.0%
899 2
 
6.7%
878 2
 
6.7%
869 2
 
6.7%
823 1
 
3.3%
852 1
 
3.3%
870 1
 
3.3%
892 1
 
3.3%
879 1
 
3.3%
Other values (5) 5
16.7%
ValueCountFrequency (%)
819 11
36.7%
823 1
 
3.3%
845 1
 
3.3%
852 1
 
3.3%
863 3
 
10.0%
869 2
 
6.7%
870 1
 
3.3%
878 2
 
6.7%
879 1
 
3.3%
886 1
 
3.3%
ValueCountFrequency (%)
906 1
3.3%
900 1
3.3%
899 2
6.7%
892 1
3.3%
891 1
3.3%
886 1
3.3%
879 1
3.3%
878 2
6.7%
870 1
3.3%
869 2
6.7%

등록일자
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2021-11-21 00:00:00
Maximum2021-11-21 00:00:00
2023-12-10T22:55:15.810473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:16.012117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

작업자명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
KEDSYSTEM
30 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKEDSYSTEM
2nd rowKEDSYSTEM
3rd rowKEDSYSTEM
4th rowKEDSYSTEM
5th rowKEDSYSTEM

Common Values

ValueCountFrequency (%)
KEDSYSTEM 30
100.0%

Length

2023-12-10T22:55:16.228387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:55:16.368211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kedsystem 30
100.0%

Interactions

2023-12-10T22:55:10.309698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:08.330346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:08.963909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:09.645288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:10.466336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:08.493509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:09.162821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:09.812266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:10.644789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:08.677422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:09.347069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:10.009658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:10.786664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:08.824837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:09.487566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:55:10.164609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:55:16.474877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명성별코드성별명연령총인원수평균신용점수신용점수중위점수
시도명1.0000.5960.5960.2410.8550.6570.632
성별코드0.5961.0000.9930.7010.0000.9440.910
성별명0.5960.9931.0000.7010.0000.9440.910
연령0.2410.7010.7011.0000.6460.0000.197
총인원수0.8550.0000.0000.6461.0000.4880.658
평균신용점수0.6570.9440.9440.0000.4881.0000.571
신용점수중위점수0.6320.9100.9100.1970.6580.5711.000
2023-12-10T22:55:16.659710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별코드성별명시도명
성별코드1.0000.9260.425
성별명0.9261.0000.425
시도명0.4250.4251.000
2023-12-10T22:55:16.822137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령총인원수평균신용점수신용점수중위점수시도명성별코드성별명
연령1.000-0.2230.223-0.2240.0000.4540.454
총인원수-0.2231.0000.6150.6810.5300.0000.000
평균신용점수0.2230.6151.0000.7070.2580.6650.665
신용점수중위점수-0.2240.6810.7071.0000.4760.4230.423
시도명0.0000.5300.2580.4761.0000.4250.425
성별코드0.4540.0000.6650.4230.4251.0000.926
성별명0.4540.0000.6650.4230.4250.9261.000

Missing values

2023-12-10T22:55:11.016489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:55:11.393432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년월시도명성별코드성별명외국인구분명연령총인원수평균신용점수신용점수중위점수등록일자작업자명
02020-05강원F여성국내인2990298138232021-11-21KEDSYSTEM
12020-05강원F여성국내인3293578218522021-11-21KEDSYSTEM
22020-05경북M남성국내인62465728688992021-11-21KEDSYSTEM
32020-05강원F여성국내인36120828248632021-11-21KEDSYSTEM
42020-05경북M남성국내인69249738658782021-11-21KEDSYSTEM
52020-05충남F여성국내인36104058308692021-11-21KEDSYSTEM
62020-05전남F여성국내인8960268228192021-11-21KEDSYSTEM
72020-05강원F여성국내인40187328288702021-11-21KEDSYSTEM
82020-05경북M남성국내인74169558528192021-11-21KEDSYSTEM
92020-05충남M남성국내인47240038578922021-11-21KEDSYSTEM
기준년월시도명성별코드성별명외국인구분명연령총인원수평균신용점수신용점수중위점수등록일자작업자명
202020-05강원F여성국내인49259928288632021-11-21KEDSYSTEM
212020-05경북M남성국내인76107358478192021-11-21KEDSYSTEM
222020-05미분류F여성국내인791158158192021-11-21KEDSYSTEM
232020-05세종M남성국내인789448428192021-11-21KEDSYSTEM
242020-05전북F여성국내인47283688368782021-11-21KEDSYSTEM
252020-05충남M남성국내인56244568639002021-11-21KEDSYSTEM
262020-05경남F여성국내인55327498368692021-11-21KEDSYSTEM
272020-05광주M남성국내인4684038629062021-11-21KEDSYSTEM
282020-05미분류M남성국내인471488188192021-11-21KEDSYSTEM
292020-05세종F여성국내인971338228192021-11-21KEDSYSTEM