Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory683.6 KiB
Average record size in memory70.0 B

Variable types

Numeric4
Categorical3

Dataset

Description한국부동산원(구.한국감정원)에서 제공하는 부동산 거래 통계를 조회할 수 있는 서비스로 충남의 외국인거래 건수 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2566

Alerts

지역명 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
지역구분 레벨 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
번호 is highly overall correlated with 지역코드 and 2 other fieldsHigh correlation
지역코드 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
번호 has unique valuesUnique
외국인거래_건수 has 5962 (59.6%) zerosZeros

Reproduction

Analysis started2024-01-09 21:04:51.813662
Analysis finished2024-01-09 21:04:54.070066
Duration2.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6587.4185
Minimum1
Maximum13087
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:04:54.134635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile685.95
Q13348.75
median6596
Q39862.25
95-th percentile12441.1
Maximum13087
Range13086
Interquartile range (IQR)6513.5

Descriptive statistics

Standard deviation3768.9587
Coefficient of variation (CV)0.57214503
Kurtosis-1.1986498
Mean6587.4185
Median Absolute Deviation (MAD)3258.5
Skewness-0.012925052
Sum65874185
Variance14205050
MonotonicityNot monotonic
2024-01-10T06:04:54.264140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9556 1
 
< 0.1%
5794 1
 
< 0.1%
10387 1
 
< 0.1%
750 1
 
< 0.1%
5326 1
 
< 0.1%
12909 1
 
< 0.1%
7572 1
 
< 0.1%
12952 1
 
< 0.1%
741 1
 
< 0.1%
4012 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
13087 1
< 0.1%
13085 1
< 0.1%
13084 1
< 0.1%
13083 1
< 0.1%
13082 1
< 0.1%
13078 1
< 0.1%
13075 1
< 0.1%
13074 1
< 0.1%
13073 1
< 0.1%
13072 1
< 0.1%

지역코드
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44305.649
Minimum44000
Maximum44790
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:04:54.378679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44000
5-th percentile44000
Q144133
median44210
Q344270
95-th percentile44770
Maximum44790
Range790
Interquartile range (IQR)137

Descriptive statistics

Standard deviation255.3372
Coefficient of variation (CV)0.0057630845
Kurtosis-0.51343521
Mean44305.649
Median Absolute Deviation (MAD)77
Skewness1.0512526
Sum4.4305649 × 108
Variance65197.086
MonotonicityNot monotonic
2024-01-10T06:04:54.490633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
44180 761
 
7.6%
44200 750
 
7.5%
44250 745
 
7.4%
44000 734
 
7.3%
44210 734
 
7.3%
44130 727
 
7.3%
44150 726
 
7.3%
44230 719
 
7.2%
44710 696
 
7.0%
44131 640
 
6.4%
Other values (5) 2768
27.7%
ValueCountFrequency (%)
44000 734
7.3%
44130 727
7.3%
44131 640
6.4%
44133 638
6.4%
44150 726
7.3%
44180 761
7.6%
44200 750
7.5%
44210 734
7.3%
44230 719
7.2%
44250 745
7.4%
ValueCountFrequency (%)
44790 476
4.8%
44770 598
6.0%
44760 563
5.6%
44710 696
7.0%
44270 493
4.9%
44250 745
7.4%
44230 719
7.2%
44210 734
7.3%
44200 750
7.5%
44180 761
7.6%

지역명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보령시
761 
아산시
750 
계룡시
745 
충남
734 
서산시
734 
Other values (10)
6276 

Length

Max length3
Median length3
Mean length2.9266
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row당진시
2nd row서북구
3rd row공주시
4th row보령시
5th row청양군

Common Values

ValueCountFrequency (%)
보령시 761
 
7.6%
아산시 750
 
7.5%
계룡시 745
 
7.4%
충남 734
 
7.3%
서산시 734
 
7.3%
천안시 727
 
7.3%
공주시 726
 
7.3%
논산시 719
 
7.2%
금산군 696
 
7.0%
동남구 640
 
6.4%
Other values (5) 2768
27.7%

Length

2024-01-10T06:04:54.628980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보령시 761
 
7.6%
아산시 750
 
7.5%
계룡시 745
 
7.4%
충남 734
 
7.3%
서산시 734
 
7.3%
천안시 727
 
7.3%
공주시 726
 
7.3%
논산시 719
 
7.2%
금산군 696
 
7.0%
동남구 640
 
6.4%
Other values (5) 2768
27.7%

조사분기
Real number (ℝ)

Distinct198
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201425.59
Minimum200601
Maximum202206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:04:55.032593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200601
5-th percentile200705
Q1201011
median201409
Q3201808
95-th percentile202109
Maximum202206
Range1605
Interquartile range (IQR)797

Descriptive statistics

Standard deviation457.35431
Coefficient of variation (CV)0.0022705869
Kurtosis-1.1356907
Mean201425.59
Median Absolute Deviation (MAD)398
Skewness-0.039263361
Sum2.0142559 × 109
Variance209172.97
MonotonicityNot monotonic
2024-01-10T06:04:55.167532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201306 63
 
0.6%
201311 62
 
0.6%
202109 62
 
0.6%
201307 62
 
0.6%
201506 62
 
0.6%
201303 61
 
0.6%
201111 61
 
0.6%
201211 61
 
0.6%
202106 61
 
0.6%
201308 60
 
0.6%
Other values (188) 9385
93.8%
ValueCountFrequency (%)
200601 12
 
0.1%
200602 23
0.2%
200603 23
0.2%
200604 26
0.3%
200605 24
0.2%
200606 31
0.3%
200607 28
0.3%
200608 32
0.3%
200609 30
0.3%
200610 39
0.4%
ValueCountFrequency (%)
202206 55
0.5%
202205 55
0.5%
202204 53
0.5%
202203 52
0.5%
202202 57
0.6%
202201 58
0.6%
202112 56
0.6%
202111 58
0.6%
202110 49
0.5%
202109 62
0.6%

거래유형
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
2076 
3
2053 
2
2019 
4
2001 
5
1851 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row5
3rd row1
4th row4
5th row4

Common Values

ValueCountFrequency (%)
1 2076
20.8%
3 2053
20.5%
2 2019
20.2%
4 2001
20.0%
5 1851
18.5%

Length

2024-01-10T06:04:55.312456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:04:55.433687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 2076
20.8%
3 2053
20.5%
2 2019
20.2%
4 2001
20.0%
5 1851
18.5%

외국인거래_건수
Real number (ℝ)

ZEROS 

Distinct139
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.8004
Minimum0
Maximum224
Zeros5962
Zeros (%)59.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:04:55.585844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile25
Maximum224
Range224
Interquartile range (IQR)3

Descriptive statistics

Standard deviation14.821495
Coefficient of variation (CV)3.0875543
Kurtosis55.467458
Mean4.8004
Median Absolute Deviation (MAD)0
Skewness6.3858916
Sum48004
Variance219.67673
MonotonicityNot monotonic
2024-01-10T06:04:55.750641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 5962
59.6%
1 973
 
9.7%
2 531
 
5.3%
3 347
 
3.5%
4 275
 
2.8%
5 225
 
2.2%
6 147
 
1.5%
7 125
 
1.2%
8 117
 
1.2%
10 91
 
0.9%
Other values (129) 1207
 
12.1%
ValueCountFrequency (%)
0 5962
59.6%
1 973
 
9.7%
2 531
 
5.3%
3 347
 
3.5%
4 275
 
2.8%
5 225
 
2.2%
6 147
 
1.5%
7 125
 
1.2%
8 117
 
1.2%
9 85
 
0.9%
ValueCountFrequency (%)
224 1
< 0.1%
213 1
< 0.1%
200 1
< 0.1%
199 1
< 0.1%
196 1
< 0.1%
191 1
< 0.1%
188 1
< 0.1%
185 2
< 0.1%
179 1
< 0.1%
176 1
< 0.1%

지역구분 레벨
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
7988 
2
1278 
0
 
734

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 7988
79.9%
2 1278
 
12.8%
0 734
 
7.3%

Length

2024-01-10T06:04:55.894052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:04:55.993248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 7988
79.9%
2 1278
 
12.8%
0 734
 
7.3%

Interactions

2024-01-10T06:04:53.543506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.391023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.745954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.136918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.626472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.474887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.840428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.237582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.719594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.568428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.938827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.352735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.812496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:52.665143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.042564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:04:53.458104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:04:56.067048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드지역명조사분기거래유형외국인거래_건수지역구분 레벨
번호1.0000.9800.9820.2940.1550.4600.884
지역코드0.9801.0001.0000.1380.1100.1620.510
지역명0.9821.0001.0000.2030.0850.4381.000
조사분기0.2940.1380.2031.0000.0000.1750.131
거래유형0.1550.1100.0850.0001.0000.2590.000
외국인거래_건수0.4600.1620.4380.1750.2591.0000.521
지역구분 레벨0.8840.5101.0000.1310.0000.5211.000
2024-01-10T06:04:56.180385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역명지역구분 레벨거래유형
지역명1.0000.9990.036
지역구분 레벨0.9991.0000.000
거래유형0.0360.0001.000
2024-01-10T06:04:56.277615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드조사분기외국인거래_건수지역명거래유형지역구분 레벨
번호1.0000.9980.055-0.3070.8610.0650.824
지역코드0.9981.000-0.003-0.3091.0000.0390.831
조사분기0.055-0.0031.0000.1680.0770.0000.079
외국인거래_건수-0.307-0.3090.1681.0000.1790.1100.366
지역명0.8611.0000.0770.1791.0000.0360.999
거래유형0.0650.0390.0000.1100.0361.0000.000
지역구분 레벨0.8240.8310.0790.3660.9990.0001.000

Missing values

2024-01-10T06:04:53.916938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:04:54.023080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호지역코드지역명조사분기거래유형외국인거래_건수지역구분 레벨
9555955644270당진시201402401
3472347344133서북구201911502
4285428644150공주시201801151
5027502844180보령시201403401
128031280444790청양군200603401
17918044000충남2012073110
124271242844770서천군202112311
3431343244133서북구201811402
7657765844230논산시201210111
3485348644133서북구2019053112
번호지역코드지역명조사분기거래유형외국인거래_건수지역구분 레벨
110061100744760부여군201204301
6025602644200아산시201402201
112461124744760부여군201010401
129111291244790청양군200608501
5878587944200아산시200901501
2519252044131동남구201704352
1167116844130천안시201004341
1492149344130천안시201011501
123541235544770서천군202105311
102561025744710금산군201306201