Overview

Dataset statistics

Number of variables11
Number of observations2335
Missing cells20
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory207.6 KiB
Average record size in memory91.1 B

Variable types

Numeric2
Categorical4
Text5

Dataset

Description울산의 도시경관 사업을 수행할 때 경관 사진을 촬영시 반드시 촬영해야 하는 주요 촬영지점 정보에 대한 색인데이터입니다.
URLhttps://www.data.go.kr/data/15095639/fileData.do

Alerts

사업회차 has constant value ""Constant
관리번호 is highly overall correlated with 구-군High correlation
구-군 is highly overall correlated with 관리번호High correlation
대분류 is highly overall correlated with 시점High correlation
시점 is highly overall correlated with 대분류High correlation
시점 is highly imbalanced (84.6%)Imbalance
관리번호 has unique valuesUnique
표준기록점 코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:25:42.157460
Analysis finished2023-12-12 06:25:43.943261
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2335
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4811
Minimum3644
Maximum5978
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.7 KiB
2023-12-12T15:25:44.029894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3644
5-th percentile3760.7
Q14227.5
median4811
Q35394.5
95-th percentile5861.3
Maximum5978
Range2334
Interquartile range (IQR)1167

Descriptive statistics

Standard deviation674.20076
Coefficient of variation (CV)0.14013734
Kurtosis-1.2
Mean4811
Median Absolute Deviation (MAD)584
Skewness0
Sum11233685
Variance454546.67
MonotonicityNot monotonic
2023-12-12T15:25:44.196845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5596 1
 
< 0.1%
4795 1
 
< 0.1%
4779 1
 
< 0.1%
4780 1
 
< 0.1%
4781 1
 
< 0.1%
4782 1
 
< 0.1%
4783 1
 
< 0.1%
4784 1
 
< 0.1%
4785 1
 
< 0.1%
4786 1
 
< 0.1%
Other values (2325) 2325
99.6%
ValueCountFrequency (%)
3644 1
< 0.1%
3645 1
< 0.1%
3646 1
< 0.1%
3647 1
< 0.1%
3648 1
< 0.1%
3649 1
< 0.1%
3650 1
< 0.1%
3651 1
< 0.1%
3652 1
< 0.1%
3653 1
< 0.1%
ValueCountFrequency (%)
5978 1
< 0.1%
5977 1
< 0.1%
5976 1
< 0.1%
5975 1
< 0.1%
5974 1
< 0.1%
5973 1
< 0.1%
5972 1
< 0.1%
5971 1
< 0.1%
5970 1
< 0.1%
5969 1
< 0.1%

사업회차
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
2
2335 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 2335
100.0%

Length

2023-12-12T15:25:44.356188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:25:44.482788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 2335
100.0%

구-군
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
울주군
825 
남구
591 
중구
439 
북구
285 
동구
195 

Length

Max length3
Median length2
Mean length2.3533191
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울주군
2nd row울주군
3rd row울주군
4th row울주군
5th row울주군

Common Values

ValueCountFrequency (%)
울주군 825
35.3%
남구 591
25.3%
중구 439
18.8%
북구 285
 
12.2%
동구 195
 
8.4%

Length

2023-12-12T15:25:44.585167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:25:44.702075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울주군 825
35.3%
남구 591
25.3%
중구 439
18.8%
북구 285
 
12.2%
동구 195
 
8.4%

대분류
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
시가지경관
561 
도시기반시설경관
397 
건축물
394 
자연경관
229 
농산어촌경관
202 
Other values (9)
552 

Length

Max length9
Median length7
Mean length5.4376874
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지역상징경관
2nd row지역상징경관
3rd row지역상징경관
4th row지역상징경관
5th row지역상징경관

Common Values

ValueCountFrequency (%)
시가지경관 561
24.0%
도시기반시설경관 397
17.0%
건축물 394
16.9%
자연경관 229
9.8%
농산어촌경관 202
 
8.7%
신개발지 107
 
4.6%
역사문화경관 100
 
4.3%
도시지형지물라인 89
 
3.8%
산업지역경관 88
 
3.8%
자연의 변화 55
 
2.4%
Other values (4) 113
 
4.8%

Length

2023-12-12T15:25:44.849225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
시가지경관 561
22.8%
도시기반시설경관 397
16.2%
건축물 394
16.0%
자연경관 229
9.3%
농산어촌경관 202
 
8.2%
신개발지 107
 
4.4%
역사문화경관 100
 
4.1%
도시지형지물라인 89
 
3.6%
산업지역경관 88
 
3.6%
자연의 55
 
2.2%
Other values (7) 236
9.6%
Distinct52
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
2023-12-12T15:25:45.074306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length4.8882227
Min length1

Characters and Unicode

Total characters11414
Distinct characters97
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row12경
2nd row12경
3rd row12경
4th row12경
5th row12경
ValueCountFrequency (%)
시가지전경(항공 328
 
13.0%
도로 185
 
7.3%
공공시설 151
 
6.0%
농촌 104
 
4.1%
문화재 100
 
4.0%
주거지(고층 99
 
3.9%
하천 88
 
3.5%
주거지(저층 87
 
3.4%
교육시설 81
 
3.2%
어촌(항,포구 79
 
3.1%
Other values (51) 1225
48.5%
2023-12-12T15:25:45.448879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
845
 
7.4%
817
 
7.2%
683
 
6.0%
( 660
 
5.8%
) 660
 
5.8%
443
 
3.9%
441
 
3.9%
434
 
3.8%
390
 
3.4%
361
 
3.2%
Other values (87) 5680
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9711
85.1%
Open Punctuation 660
 
5.8%
Close Punctuation 660
 
5.8%
Space Separator 192
 
1.7%
Other Punctuation 107
 
0.9%
Decimal Number 84
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
845
 
8.7%
817
 
8.4%
683
 
7.0%
443
 
4.6%
441
 
4.5%
434
 
4.5%
390
 
4.0%
361
 
3.7%
238
 
2.5%
233
 
2.4%
Other values (81) 4826
49.7%
Decimal Number
ValueCountFrequency (%)
2 42
50.0%
1 42
50.0%
Open Punctuation
ValueCountFrequency (%)
( 660
100.0%
Close Punctuation
ValueCountFrequency (%)
) 660
100.0%
Space Separator
ValueCountFrequency (%)
192
100.0%
Other Punctuation
ValueCountFrequency (%)
, 107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9711
85.1%
Common 1703
 
14.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
845
 
8.7%
817
 
8.4%
683
 
7.0%
443
 
4.6%
441
 
4.5%
434
 
4.5%
390
 
4.0%
361
 
3.7%
238
 
2.5%
233
 
2.4%
Other values (81) 4826
49.7%
Common
ValueCountFrequency (%)
( 660
38.8%
) 660
38.8%
192
 
11.3%
, 107
 
6.3%
2 42
 
2.5%
1 42
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9711
85.1%
ASCII 1703
 
14.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
845
 
8.7%
817
 
8.4%
683
 
7.0%
443
 
4.6%
441
 
4.5%
434
 
4.5%
390
 
4.0%
361
 
3.7%
238
 
2.5%
233
 
2.4%
Other values (81) 4826
49.7%
ASCII
ValueCountFrequency (%)
( 660
38.8%
) 660
38.8%
192
 
11.3%
, 107
 
6.3%
2 42
 
2.5%
1 42
 
2.5%
Distinct691
Distinct (%)29.8%
Missing18
Missing (%)0.8%
Memory size18.4 KiB
2023-12-12T15:25:45.785283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length5.8567113
Min length1

Characters and Unicode

Total characters13570
Distinct characters353
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique296 ?
Unique (%)12.8%

Sample

1st row외고산 옹기마을
2nd row외고산 옹기마을
3rd row외고산 옹기마을
4th row신불산
5th row신불산 억새평원
ValueCountFrequency (%)
남구 178
 
5.3%
중구 163
 
4.9%
기타 146
 
4.4%
북구 109
 
3.3%
동구 57
 
1.7%
울주 51
 
1.5%
울주범서 48
 
1.4%
울산미포국가산단 37
 
1.1%
태화강 37
 
1.1%
울산전경 33
 
1.0%
Other values (724) 2469
74.2%
2023-12-12T15:25:46.339155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1011
 
7.5%
684
 
5.0%
668
 
4.9%
586
 
4.3%
507
 
3.7%
293
 
2.2%
288
 
2.1%
224
 
1.7%
204
 
1.5%
201
 
1.5%
Other values (343) 8904
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11885
87.6%
Space Separator 1011
 
7.5%
Uppercase Letter 230
 
1.7%
Decimal Number 190
 
1.4%
Other Punctuation 159
 
1.2%
Connector Punctuation 50
 
0.4%
Close Punctuation 18
 
0.1%
Open Punctuation 18
 
0.1%
Dash Punctuation 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
684
 
5.8%
668
 
5.6%
586
 
4.9%
507
 
4.3%
293
 
2.5%
288
 
2.4%
224
 
1.9%
204
 
1.7%
201
 
1.7%
190
 
1.6%
Other values (317) 8040
67.6%
Uppercase Letter
ValueCountFrequency (%)
C 88
38.3%
K 50
21.7%
T 30
 
13.0%
X 29
 
12.6%
I 13
 
5.7%
B 12
 
5.2%
U 3
 
1.3%
S 2
 
0.9%
N 1
 
0.4%
L 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 72
37.9%
3 38
20.0%
0 18
 
9.5%
7 15
 
7.9%
2 14
 
7.4%
4 11
 
5.8%
9 9
 
4.7%
8 9
 
4.7%
5 4
 
2.1%
Space Separator
ValueCountFrequency (%)
1011
100.0%
Other Punctuation
ValueCountFrequency (%)
, 159
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 50
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11885
87.6%
Common 1455
 
10.7%
Latin 230
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
684
 
5.8%
668
 
5.6%
586
 
4.9%
507
 
4.3%
293
 
2.5%
288
 
2.4%
224
 
1.9%
204
 
1.7%
201
 
1.7%
190
 
1.6%
Other values (317) 8040
67.6%
Common
ValueCountFrequency (%)
1011
69.5%
, 159
 
10.9%
1 72
 
4.9%
_ 50
 
3.4%
3 38
 
2.6%
) 18
 
1.2%
( 18
 
1.2%
0 18
 
1.2%
7 15
 
1.0%
2 14
 
1.0%
Other values (5) 42
 
2.9%
Latin
ValueCountFrequency (%)
C 88
38.3%
K 50
21.7%
T 30
 
13.0%
X 29
 
12.6%
I 13
 
5.7%
B 12
 
5.2%
U 3
 
1.3%
S 2
 
0.9%
N 1
 
0.4%
L 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11885
87.6%
ASCII 1685
 
12.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1011
60.0%
, 159
 
9.4%
C 88
 
5.2%
1 72
 
4.3%
K 50
 
3.0%
_ 50
 
3.0%
3 38
 
2.3%
T 30
 
1.8%
X 29
 
1.7%
) 18
 
1.1%
Other values (16) 140
 
8.3%
Hangul
ValueCountFrequency (%)
684
 
5.8%
668
 
5.6%
586
 
4.9%
507
 
4.3%
293
 
2.5%
288
 
2.4%
224
 
1.9%
204
 
1.7%
201
 
1.7%
190
 
1.6%
Other values (317) 8040
67.6%

촬영지점
Real number (ℝ)

Distinct44
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.2475375
Minimum1
Maximum54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.7 KiB
2023-12-12T15:25:46.781636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q36
95-th percentile20
Maximum54
Range53
Interquartile range (IQR)5

Descriptive statistics

Standard deviation6.6343128
Coefficient of variation (CV)1.2642716
Kurtosis8.8597918
Mean5.2475375
Median Absolute Deviation (MAD)2
Skewness2.7072902
Sum12253
Variance44.014106
MonotonicityNot monotonic
2023-12-12T15:25:46.978262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
1 696
29.8%
2 424
18.2%
3 267
 
11.4%
4 183
 
7.8%
5 115
 
4.9%
6 101
 
4.3%
7 82
 
3.5%
8 55
 
2.4%
9 53
 
2.3%
10 43
 
1.8%
Other values (34) 316
13.5%
ValueCountFrequency (%)
1 696
29.8%
2 424
18.2%
3 267
 
11.4%
4 183
 
7.8%
5 115
 
4.9%
6 101
 
4.3%
7 82
 
3.5%
8 55
 
2.4%
9 53
 
2.3%
10 43
 
1.8%
ValueCountFrequency (%)
54 1
 
< 0.1%
53 1
 
< 0.1%
48 1
 
< 0.1%
44 1
 
< 0.1%
40 1
 
< 0.1%
39 1
 
< 0.1%
38 2
0.1%
37 2
0.1%
36 3
0.1%
35 4
0.2%

시점
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
현재
2198 
미래
 
122
과거
 
11
<NA>
 
2
주요울산
 
2

Length

Max length4
Median length2
Mean length2.0034261
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row현재
2nd row현재
3rd row현재
4th row현재
5th row현재

Common Values

ValueCountFrequency (%)
현재 2198
94.1%
미래 122
 
5.2%
과거 11
 
0.5%
<NA> 2
 
0.1%
주요울산 2
 
0.1%

Length

2023-12-12T15:25:47.148966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:25:47.251971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
현재 2198
94.1%
미래 122
 
5.2%
과거 11
 
0.5%
na 2
 
0.1%
주요울산 2
 
0.1%
Distinct2335
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
2023-12-12T15:25:47.453981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length18.000857
Min length18

Characters and Unicode

Total characters42032
Distinct characters33
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2335 ?
Unique (%)100.0%

Sample

1st row02-U-LS-LU-013-013
2nd row02-U-LS-LU-013-009
3rd row02-U-LS-LU-013-008
4th row02-U-LS-LU-011-003
5th row02-U-LS-LU-011-002
ValueCountFrequency (%)
02-u-ls-lu-013-013 1
 
< 0.1%
02-n-fn-f4-001-012 1
 
< 0.1%
02-n-al-lp-004-001 1
 
< 0.1%
02-n-cl-cs-001-004 1
 
< 0.1%
02-n-al-lp-005-001 1
 
< 0.1%
02-n-al-lp-005-002 1
 
< 0.1%
02-n-al-lp-005-003 1
 
< 0.1%
02-n-al-lp-005-004 1
 
< 0.1%
02-n-cl-cs-001-002 1
 
< 0.1%
02-n-cl-cs-001-003 1
 
< 0.1%
Other values (2327) 2327
99.6%
2023-12-12T15:25:47.851720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 11675
27.8%
0 10403
24.8%
2 3324
 
7.9%
1 2198
 
5.2%
U 2143
 
5.1%
L 1764
 
4.2%
N 1280
 
3.0%
A 880
 
2.1%
I 850
 
2.0%
3 719
 
1.7%
Other values (23) 6796
16.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18809
44.7%
Dash Punctuation 11675
27.8%
Uppercase Letter 11546
27.5%
Space Separator 2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
U 2143
18.6%
L 1764
15.3%
N 1280
11.1%
A 880
7.6%
I 850
 
7.4%
C 696
 
6.0%
P 674
 
5.8%
R 480
 
4.2%
F 439
 
3.8%
J 438
 
3.8%
Other values (11) 1902
16.5%
Decimal Number
ValueCountFrequency (%)
0 10403
55.3%
2 3324
 
17.7%
1 2198
 
11.7%
3 719
 
3.8%
4 579
 
3.1%
5 450
 
2.4%
6 375
 
2.0%
7 308
 
1.6%
9 227
 
1.2%
8 226
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 11675
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30486
72.5%
Latin 11546
 
27.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
U 2143
18.6%
L 1764
15.3%
N 1280
11.1%
A 880
7.6%
I 850
 
7.4%
C 696
 
6.0%
P 674
 
5.8%
R 480
 
4.2%
F 439
 
3.8%
J 438
 
3.8%
Other values (11) 1902
16.5%
Common
ValueCountFrequency (%)
- 11675
38.3%
0 10403
34.1%
2 3324
 
10.9%
1 2198
 
7.2%
3 719
 
2.4%
4 579
 
1.9%
5 450
 
1.5%
6 375
 
1.2%
7 308
 
1.0%
9 227
 
0.7%
Other values (2) 228
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42032
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 11675
27.8%
0 10403
24.8%
2 3324
 
7.9%
1 2198
 
5.2%
U 2143
 
5.1%
L 1764
 
4.2%
N 1280
 
3.0%
A 880
 
2.1%
I 850
 
2.0%
3 719
 
1.7%
Other values (23) 6796
16.2%
Distinct2331
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size18.4 KiB
2023-12-12T15:25:48.245939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length11.538758
Min length3

Characters and Unicode

Total characters26943
Distinct characters448
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2327 ?
Unique (%)99.7%

Sample

1st row천전리각석 전경
2nd row선바위와 선암사
3rd row선바위
4th row옹기마을_전경
5th row옹기마을_상징조형물
ValueCountFrequency (%)
417
 
8.3%
전경 261
 
5.2%
상공에서 194
 
3.9%
전경_2 89
 
1.8%
전경_2020 75
 
1.5%
태화강 38
 
0.8%
주변 36
 
0.7%
주거지 32
 
0.6%
시가지 28
 
0.6%
고층주거군 26
 
0.5%
Other values (2583) 3800
76.1%
2023-12-12T15:25:48.801792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2662
 
9.9%
_ 1101
 
4.1%
2 1001
 
3.7%
752
 
2.8%
695
 
2.6%
694
 
2.6%
598
 
2.2%
596
 
2.2%
572
 
2.1%
496
 
1.8%
Other values (438) 17776
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20633
76.6%
Space Separator 2662
 
9.9%
Decimal Number 1826
 
6.8%
Connector Punctuation 1101
 
4.1%
Open Punctuation 198
 
0.7%
Close Punctuation 195
 
0.7%
Uppercase Letter 172
 
0.6%
Other Punctuation 95
 
0.4%
Dash Punctuation 60
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
752
 
3.6%
695
 
3.4%
694
 
3.4%
598
 
2.9%
596
 
2.9%
572
 
2.8%
496
 
2.4%
464
 
2.2%
440
 
2.1%
365
 
1.8%
Other values (405) 14961
72.5%
Uppercase Letter
ValueCountFrequency (%)
C 59
34.3%
K 23
 
13.4%
T 19
 
11.0%
B 16
 
9.3%
I 14
 
8.1%
X 14
 
8.1%
S 6
 
3.5%
D 5
 
2.9%
G 4
 
2.3%
U 3
 
1.7%
Other values (6) 9
 
5.2%
Decimal Number
ValueCountFrequency (%)
2 1001
54.8%
0 474
26.0%
1 228
 
12.5%
3 46
 
2.5%
4 38
 
2.1%
5 14
 
0.8%
7 9
 
0.5%
6 8
 
0.4%
8 6
 
0.3%
9 2
 
0.1%
Space Separator
ValueCountFrequency (%)
2662
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 198
100.0%
Close Punctuation
ValueCountFrequency (%)
) 195
100.0%
Other Punctuation
ValueCountFrequency (%)
, 95
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20633
76.6%
Common 6137
 
22.8%
Latin 173
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
752
 
3.6%
695
 
3.4%
694
 
3.4%
598
 
2.9%
596
 
2.9%
572
 
2.8%
496
 
2.4%
464
 
2.2%
440
 
2.1%
365
 
1.8%
Other values (405) 14961
72.5%
Latin
ValueCountFrequency (%)
C 59
34.1%
K 23
 
13.3%
T 19
 
11.0%
B 16
 
9.2%
I 14
 
8.1%
X 14
 
8.1%
S 6
 
3.5%
D 5
 
2.9%
G 4
 
2.3%
U 3
 
1.7%
Other values (7) 10
 
5.8%
Common
ValueCountFrequency (%)
2662
43.4%
_ 1101
17.9%
2 1001
 
16.3%
0 474
 
7.7%
1 228
 
3.7%
( 198
 
3.2%
) 195
 
3.2%
, 95
 
1.5%
- 60
 
1.0%
3 46
 
0.7%
Other values (6) 77
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20633
76.6%
ASCII 6310
 
23.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2662
42.2%
_ 1101
17.4%
2 1001
 
15.9%
0 474
 
7.5%
1 228
 
3.6%
( 198
 
3.1%
) 195
 
3.1%
, 95
 
1.5%
- 60
 
1.0%
C 59
 
0.9%
Other values (23) 237
 
3.8%
Hangul
ValueCountFrequency (%)
752
 
3.6%
695
 
3.4%
694
 
3.4%
598
 
2.9%
596
 
2.9%
572
 
2.8%
496
 
2.4%
464
 
2.2%
440
 
2.1%
365
 
1.8%
Other values (405) 14961
72.5%
Distinct1642
Distinct (%)70.4%
Missing2
Missing (%)0.1%
Memory size18.4 KiB
2023-12-12T15:25:49.215441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length16.474068
Min length9

Characters and Unicode

Total characters38434
Distinct characters162
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1272 ?
Unique (%)54.5%

Sample

1st row울산 울주군 두동면 천전리 산 210-2
2nd row울산 울주군 범서읍 구영리 1073-1
3rd row울산 울주군 범서읍 구영리 1073-1
4th row울산 울주군 온양읍 고산리 491-2
5th row울산 울주군 온양읍 고산리 470-14
ValueCountFrequency (%)
울산 2324
22.2%
울주군 807
 
7.7%
남구 607
 
5.8%
중구 436
 
4.2%
311
 
3.0%
북구 274
 
2.6%
동구 201
 
1.9%
신정동 126
 
1.2%
범서읍 117
 
1.1%
무거동 96
 
0.9%
Other values (1697) 5153
49.3%
2023-12-12T15:25:49.721756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8119
21.1%
3132
 
8.1%
2948
 
7.7%
1873
 
4.9%
1 1849
 
4.8%
1593
 
4.1%
- 1578
 
4.1%
2 1134
 
3.0%
3 1012
 
2.6%
4 849
 
2.2%
Other values (152) 14347
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19783
51.5%
Decimal Number 8954
23.3%
Space Separator 8119
21.1%
Dash Punctuation 1578
 
4.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3132
15.8%
2948
14.9%
1873
 
9.5%
1593
 
8.1%
823
 
4.2%
821
 
4.2%
812
 
4.1%
810
 
4.1%
499
 
2.5%
457
 
2.3%
Other values (140) 6015
30.4%
Decimal Number
ValueCountFrequency (%)
1 1849
20.6%
2 1134
12.7%
3 1012
11.3%
4 849
9.5%
6 765
8.5%
8 715
 
8.0%
7 682
 
7.6%
5 667
 
7.4%
9 647
 
7.2%
0 634
 
7.1%
Space Separator
ValueCountFrequency (%)
8119
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1578
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19783
51.5%
Common 18651
48.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3132
15.8%
2948
14.9%
1873
 
9.5%
1593
 
8.1%
823
 
4.2%
821
 
4.2%
812
 
4.1%
810
 
4.1%
499
 
2.5%
457
 
2.3%
Other values (140) 6015
30.4%
Common
ValueCountFrequency (%)
8119
43.5%
1 1849
 
9.9%
- 1578
 
8.5%
2 1134
 
6.1%
3 1012
 
5.4%
4 849
 
4.6%
6 765
 
4.1%
8 715
 
3.8%
7 682
 
3.7%
5 667
 
3.6%
Other values (2) 1281
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19783
51.5%
ASCII 18651
48.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8119
43.5%
1 1849
 
9.9%
- 1578
 
8.5%
2 1134
 
6.1%
3 1012
 
5.4%
4 849
 
4.6%
6 765
 
4.1%
8 715
 
3.8%
7 682
 
3.7%
5 667
 
3.6%
Other values (2) 1281
 
6.9%
Hangul
ValueCountFrequency (%)
3132
15.8%
2948
14.9%
1873
 
9.5%
1593
 
8.1%
823
 
4.2%
821
 
4.2%
812
 
4.1%
810
 
4.1%
499
 
2.5%
457
 
2.3%
Other values (140) 6015
30.4%

Interactions

2023-12-12T15:25:43.308413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:43.128014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:43.420117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:43.218559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:25:49.814931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호구-군대분류중분류촬영지점시점
관리번호1.0000.9740.7430.8410.2340.322
구-군0.9741.0000.4170.6210.1990.081
대분류0.7430.4171.0001.0000.3920.783
중분류0.8410.6211.0001.0000.5410.849
촬영지점0.2340.1990.3920.5411.0000.275
시점0.3220.0810.7830.8490.2751.000
2023-12-12T15:25:49.902280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류시점구-군
대분류1.0000.5750.232
시점0.5751.0000.066
구-군0.2320.0661.000
2023-12-12T15:25:49.986784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호촬영지점구-군대분류시점
관리번호1.000-0.0910.7730.4160.197
촬영지점-0.0911.0000.0840.1690.167
구-군0.7730.0841.0000.2320.066
대분류0.4160.1690.2321.0000.575
시점0.1970.1670.0660.5751.000

Missing values

2023-12-12T15:25:43.570989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:25:43.746337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:25:43.876554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

관리번호사업회차구-군대분류중분류경관기록대상촬영지점시점표준기록점 코드표준기록점명표준기록점주소
055962울주군지역상징경관12경<NA>13현재02-U-LS-LU-013-013천전리각석 전경울산 울주군 두동면 천전리 산 210-2
155952울주군지역상징경관12경<NA>9현재02-U-LS-LU-013-009선바위와 선암사울산 울주군 범서읍 구영리 1073-1
255942울주군지역상징경관12경<NA>8현재02-U-LS-LU-013-008선바위울산 울주군 범서읍 구영리 1073-1
355932울주군지역상징경관12경외고산 옹기마을3현재02-U-LS-LU-011-003옹기마을_전경울산 울주군 온양읍 고산리 491-2
455922울주군지역상징경관12경외고산 옹기마을2현재02-U-LS-LU-011-002옹기마을_상징조형물울산 울주군 온양읍 고산리 470-14
555912울주군지역상징경관12경외고산 옹기마을1현재02-U-LS-LU-011-001옹기마을울산 울주군 온양읍 고산리 산 122-1
655902울주군자연경관신불산1현재02-U-LS-LU-004-005신불산가는길울산 울주군 상북면 등억알프스리 산 181
755892울주군지역상징경관12경신불산 억새평원4현재02-U-LS-LU-004-004신불산억새평원1_2울산 울주군 상북면 이천리 산 2
855882울주군지역상징경관12경신불산 억새평원3현재02-U-LS-LU-004-003신불산 능선1_2경북 청도군 운문면 신원리 산 29-6
955872울주군지역상징경관12경신불산 억새평원2현재02-U-LS-LU-004-002신불산억세평원울산 울주군 상북면 등억알프스리 산 181
관리번호사업회차구-군대분류중분류경관기록대상촬영지점시점표준기록점 코드표준기록점명표준기록점주소
232559692울주군시가지경관시가지전경(항공)울주삼남 방기리4현재02-U-UL-UP-041-004삼남 방기리_2울산 울주군 삼남읍 방기리 67-1
232659702울주군시가지경관시가지전경(항공)울주삼동 하잠리1현재02-U-UL-UP-042-001보은천과 하잠마을울산 울주군 삼동면 하잠리 924
232759712울주군시가지경관시가지전경(항공)울주삼동 하잠리2현재02-U-UL-UP-042-002청보리밭 상공에서 본 하잠마을울산 울주군 삼동면 하잠리 393
232859722울주군시가지경관시가지전경(항공)울주삼동 보은리2현재02-U-UL-UP-043-002왕방마을에서 본 보은마을울산 울주군 삼동면 보은리 676
232959732울주군시가지경관시가지전경(항공)울주삼동 조일리1현재02-U-UL-UP-044-001삼동조일리 보삼마을울산 울주군 삼동면 조일리 산 358
233059742울주군시가지경관시가지전경(항공)울산시 파노라마1현재02-U-UL-UP-045-001가지산 능선에서 본 남측전경울산 울주군 상북면 등억알프스리 산 180
233159752울주군시가지경관시가지전경(항공)울산시 파노라마7현재02-U-UL-UP-045-007봉화산 상공에서 본 동측전경울산 울주군 삼남읍 교동리 산 86-17
233259762울주군시가지경관시가지전경(항공)울산시 파노라마9현재02-U-UL-UP-045-009언양구수리에서 본 서측전경울산 울주군 언양읍 구수리 813-4
233359772울주군시가지경관시가지전경(항공)울산시 파노라마12현재02-U-UL-UP-045-012장검마을 상공에서 본 북측전경울산 울주군 범서읍 굴화리 산 66-2
233459782울주군시가지경관시가지전경(항공)울산시 파노라마13현재02-U-UL-UP-045-013천량읍 상공에서 본 동측전경울산 울주군 청량읍 상남리 482-1