Overview

Dataset statistics

Number of variables9
Number of observations1920
Missing cells0
Missing cells (%)0.0%
Duplicate rows242
Duplicate rows (%)12.6%
Total size in memory140.8 KiB
Average record size in memory75.1 B

Variable types

Numeric1
Categorical8

Dataset

Description국토지리정보원의 항공사진 관련 메타데이터 중 LIDAR성과내역 입니다. (LIDAR자료ID, 명칭, 자료형식, 촬영년도 등 포함)
Author국토교통부 국토지리정보원
URLhttps://www.data.go.kr/data/15067646/fileData.do

Alerts

명칭 has constant value ""Constant
지리좌표계 has constant value ""Constant
Dataset has 242 (12.6%) duplicate rowsDuplicates
제작기관 is highly overall correlated with 촬영년도 and 3 other fieldsHigh correlation
자료형식 is highly overall correlated with 촬영년도 and 2 other fieldsHigh correlation
참고사항 is highly overall correlated with LIDAR자료ID and 5 other fieldsHigh correlation
좌표계 is highly overall correlated with 촬영년도 and 3 other fieldsHigh correlation
예비 is highly overall correlated with LIDAR자료ID and 5 other fieldsHigh correlation
촬영년도 is highly overall correlated with 자료형식 and 4 other fieldsHigh correlation
LIDAR자료ID is highly overall correlated with 참고사항 and 1 other fieldsHigh correlation
자료형식 is highly imbalanced (87.2%)Imbalance
촬영년도 is highly imbalanced (63.9%)Imbalance
좌표계 is highly imbalanced (58.1%)Imbalance

Reproduction

Analysis started2023-12-12 09:22:05.377880
Analysis finished2023-12-12 09:22:06.696507
Duration1.32 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

LIDAR자료ID
Real number (ℝ)

HIGH CORRELATION 

Distinct277
Distinct (%)14.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.966667
Minimum1
Maximum356
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.0 KiB
2023-12-12T18:22:06.791793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11
Q148
median93
Q3135
95-th percentile212
Maximum356
Range355
Interquartile range (IQR)87

Descriptive statistics

Standard deviation62.686021
Coefficient of variation (CV)0.63987092
Kurtosis0.57890301
Mean97.966667
Median Absolute Deviation (MAD)44
Skewness0.72102299
Sum188096
Variance3929.5372
MonotonicityNot monotonic
2023-12-12T18:22:07.004499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
110 12
 
0.6%
125 12
 
0.6%
126 12
 
0.6%
95 12
 
0.6%
97 12
 
0.6%
102 12
 
0.6%
104 12
 
0.6%
115 12
 
0.6%
117 12
 
0.6%
120 12
 
0.6%
Other values (267) 1800
93.8%
ValueCountFrequency (%)
1 9
0.5%
2 9
0.5%
3 9
0.5%
4 9
0.5%
5 9
0.5%
6 9
0.5%
7 9
0.5%
8 9
0.5%
9 9
0.5%
10 9
0.5%
ValueCountFrequency (%)
356 1
0.1%
355 1
0.1%
354 1
0.1%
353 1
0.1%
352 1
0.1%
351 1
0.1%
350 1
0.1%
349 1
0.1%
348 1
0.1%
347 1
0.1%

명칭
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
수치표고자료
1920 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수치표고자료
2nd row수치표고자료
3rd row수치표고자료
4th row수치표고자료
5th row수치표고자료

Common Values

ValueCountFrequency (%)
수치표고자료 1920
100.0%

Length

2023-12-12T18:22:07.199537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:07.334103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수치표고자료 1920
100.0%

자료형식
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
ASCII
1869 
SCN
 
29
Ascii
 
22

Length

Max length5
Median length5
Mean length4.9697917
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowASCII
2nd rowASCII
3rd rowASCII
4th rowASCII
5th rowASCII

Common Values

ValueCountFrequency (%)
ASCII 1869
97.3%
SCN 29
 
1.5%
Ascii 22
 
1.1%

Length

2023-12-12T18:22:07.472711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:07.608908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ascii 1891
98.5%
scn 29
 
1.5%

촬영년도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
2009
1694 
2010
207 
2008
 
19

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2009
2nd row2009
3rd row2009
4th row2009
5th row2009

Common Values

ValueCountFrequency (%)
2009 1694
88.2%
2010 207
 
10.8%
2008 19
 
1.0%

Length

2023-12-12T18:22:07.723754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:07.851162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2009 1694
88.2%
2010 207
 
10.8%
2008 19
 
1.0%

좌표계
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
평면직각좌표계
1757 
세계측지계
 
163

Length

Max length7
Median length7
Mean length6.8302083
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평면직각좌표계
2nd row평면직각좌표계
3rd row평면직각좌표계
4th row평면직각좌표계
5th row평면직각좌표계

Common Values

ValueCountFrequency (%)
평면직각좌표계 1757
91.5%
세계측지계 163
 
8.5%

Length

2023-12-12T18:22:08.017293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:08.188139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평면직각좌표계 1757
91.5%
세계측지계 163
 
8.5%

제작기관
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
새한항업(주) (02-3439-2310)
379 
(주)신한항업 (02-2108-3700)
334 
한진정보통신(주) 공간영상개발팀(02-2166-7436)
277 
새한항업(주)
155 
㈜범아엔지니어링 (02-3487-0011)
151 
Other values (8)
624 

Length

Max length31
Median length28
Mean length21.149479
Min length4

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row한진정보통신(주) 공간영상개발팀(02-2166-7436)
2nd row한진정보통신(주) 공간영상개발팀(02-2166-7436)
3rd row한진정보통신(주) 공간영상개발팀(02-2166-7436)
4th row한진정보통신(주) 공간영상개발팀(02-2166-7436)
5th row한진정보통신(주) 공간영상개발팀(02-2166-7436)

Common Values

ValueCountFrequency (%)
새한항업(주) (02-3439-2310) 379
19.7%
(주)신한항업 (02-2108-3700) 334
17.4%
한진정보통신(주) 공간영상개발팀(02-2166-7436) 277
14.4%
새한항업(주) 155
8.1%
㈜범아엔지니어링 (02-3487-0011) 151
 
7.9%
국토지리정보원 공간영상과(031-210-2675) 151
 
7.9%
중앙항업(주) (02-730-0018) 134
 
7.0%
중앙항업㈜ (02-730-0018) 111
 
5.8%
<NA> 106
 
5.5%
(주)아세아항측 (02-3660-6441) 86
 
4.5%
Other values (3) 36
 
1.9%

Length

2023-12-12T18:22:08.354152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
새한항업(주 534
15.0%
02-3439-2310 379
10.7%
주)신한항업 334
9.4%
02-2108-3700 334
9.4%
한진정보통신(주 288
8.1%
공간영상개발팀(02-2166-7436 277
 
7.8%
02-730-0018 245
 
6.9%
공간영상과(031-210-2675 152
 
4.3%
국토지리정보원 151
 
4.2%
02-3487-0011 151
 
4.2%
Other values (9) 710
20.0%

참고사항
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
<NA>
1586 
없음.
334 

Length

Max length4
Median length4
Mean length3.8260417
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1586
82.6%
없음. 334
 
17.4%

Length

2023-12-12T18:22:08.511506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:08.648558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1586
82.6%
없음 334
 
17.4%

예비
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
<NA>
1591 
없음.
329 

Length

Max length4
Median length4
Mean length3.8286458
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1591
82.9%
없음. 329
 
17.1%

Length

2023-12-12T18:22:08.775288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:08.907508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1591
82.9%
없음 329
 
17.1%

지리좌표계
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.1 KiB
1
1920 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 1920
100.0%

Length

2023-12-12T18:22:09.048996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:22:09.158390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 1920
100.0%

Interactions

2023-12-12T18:22:05.901862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:22:09.234851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LIDAR자료ID자료형식촬영년도좌표계제작기관
LIDAR자료ID1.0000.2240.2540.1380.449
자료형식0.2241.0000.8780.0230.598
촬영년도0.2540.8781.0000.6070.908
좌표계0.1380.0230.6071.0001.000
제작기관0.4490.5980.9081.0001.000
2023-12-12T18:22:09.371147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제작기관자료형식참고사항좌표계예비촬영년도
제작기관1.0000.3321.0000.9971.0000.664
자료형식0.3321.0001.0000.0391.0000.577
참고사항1.0001.0001.0001.0001.0001.000
좌표계0.9970.0391.0001.0001.0000.876
예비1.0001.0001.0001.0001.0001.000
촬영년도0.6640.5771.0000.8761.0001.000
2023-12-12T18:22:09.483480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LIDAR자료ID자료형식촬영년도좌표계제작기관참고사항예비
LIDAR자료ID1.0000.1020.1160.1370.2081.0001.000
자료형식0.1021.0000.5770.0390.3321.0001.000
촬영년도0.1160.5771.0000.8760.6641.0001.000
좌표계0.1370.0390.8761.0000.9971.0001.000
제작기관0.2080.3320.6640.9971.0001.0001.000
참고사항1.0001.0001.0001.0001.0001.0001.000
예비1.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-12T18:22:06.410007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:22:06.616951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

LIDAR자료ID명칭자료형식촬영년도좌표계제작기관참고사항예비지리좌표계
090수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
192수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
294수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
396수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
498수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
599수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
6101수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
7103수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
8105수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
9107수치표고자료ASCII2009평면직각좌표계한진정보통신(주) 공간영상개발팀(02-2166-7436)<NA><NA>1
LIDAR자료ID명칭자료형식촬영년도좌표계제작기관참고사항예비지리좌표계
1910164수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1911166수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1912168수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1913170수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1914179수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1915181수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1916183수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1917185수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1918187수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1
1919189수치표고자료ASCII2010세계측지계국토지리정보원 공간영상과(031-210-2675)<NA><NA>1

Duplicate rows

Most frequently occurring

LIDAR자료ID명칭자료형식촬영년도좌표계제작기관참고사항예비지리좌표계# duplicates
01수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
12수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
23수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
34수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
45수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
55수치표고자료ASCII2009평면직각좌표계새한항업(주) (02-3439-2310)<NA><NA>12
66수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
76수치표고자료ASCII2009평면직각좌표계새한항업(주) (02-3439-2310)<NA><NA>12
87수치표고자료ASCII2009평면직각좌표계(주)신한항업 (02-2108-3700)없음.없음.12
97수치표고자료ASCII2009평면직각좌표계새한항업(주) (02-3439-2310)<NA><NA>12