Overview

Dataset statistics

Number of variables17
Number of observations10000
Missing cells13139
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 MiB
Average record size in memory147.0 B

Variable types

Text5
Numeric3
Categorical9

Dataset

Description제주특별자치도개발공사가 관리하는 제주삼다수공장 주변 관측소 정보(관측소명, 주소, 위도, 경도, 표고, 심도, 케이싱구경, 측정항목) 입니다.
Author제주특별자치도개발공사
URLhttps://www.data.go.kr/data/15062324/fileData.do

Alerts

이름 is highly overall correlated with 층계단위 and 1 other fieldsHigh correlation
풍화정도 is highly overall correlated with 이름 and 5 other fieldsHigh correlation
그래픽정보 is highly overall correlated with 층계단위 and 4 other fieldsHigh correlation
암석영문3 is highly overall correlated with 층계단위 and 4 other fieldsHigh correlation
암석한글3 is highly overall correlated with 층계단위 and 4 other fieldsHigh correlation
암석코드 is highly overall correlated with 층계단위 and 4 other fieldsHigh correlation
밀도정도 is highly overall correlated with 층계단위 and 1 other fieldsHigh correlation
층계단위 is highly overall correlated with 깊이 and 8 other fieldsHigh correlation
깊이 is highly overall correlated with 끝깊이 and 1 other fieldsHigh correlation
끝깊이 is highly overall correlated with 깊이 and 1 other fieldsHigh correlation
단위두께 is highly overall correlated with 층계단위 and 1 other fieldsHigh correlation
화학조성정도 is highly overall correlated with 단위두께High correlation
층계단위 is highly imbalanced (72.3%)Imbalance
이름 is highly imbalanced (53.0%)Imbalance
밀도정도 is highly imbalanced (65.8%)Imbalance
화학조성정도 is highly imbalanced (99.1%)Imbalance
풍화정도 is highly imbalanced (99.3%)Imbalance
암석한글1 has 1290 (12.9%) missing valuesMissing
암석영문1 has 2313 (23.1%) missing valuesMissing
단위비고 has 4280 (42.8%) missing valuesMissing
레이어색상 has 5256 (52.6%) missing valuesMissing
깊이 has 383 (3.8%) zerosZeros
단위두께 has 9410 (94.1%) zerosZeros

Reproduction

Analysis started2023-12-12 17:41:10.390099
Analysis finished2023-12-12 17:41:14.272081
Duration3.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1186
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:41:14.536405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length6
Mean length6.0931
Min length4

Characters and Unicode

Total characters60931
Distinct characters152
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)0.3%

Sample

1st rowF-0234
2nd rowF-0485
3rd rowF-0348
4th rowF-0407
5th rowD-0084
ValueCountFrequency (%)
f-0512 36
 
0.4%
f-0542 36
 
0.4%
m1-7 36
 
0.4%
f-0594 33
 
0.3%
감시1-7 29
 
0.3%
f-0541 29
 
0.3%
f-0544 29
 
0.3%
회수지구1호공 29
 
0.3%
f-0485 29
 
0.3%
서귀포시자연휴양림 28
 
0.3%
Other values (1180) 9755
96.9%
2023-12-13T02:41:14.979335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11217
18.4%
- 8221
13.5%
F 5408
 
8.9%
2 3736
 
6.1%
1 3637
 
6.0%
5 2886
 
4.7%
3 2772
 
4.5%
4 2740
 
4.5%
D 2502
 
4.1%
6 1874
 
3.1%
Other values (142) 15938
26.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 33555
55.1%
Uppercase Letter 10556
 
17.3%
Dash Punctuation 8221
 
13.5%
Other Letter 7960
 
13.1%
Open Punctuation 285
 
0.5%
Close Punctuation 285
 
0.5%
Space Separator 69
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
750
 
9.4%
657
 
8.3%
653
 
8.2%
626
 
7.9%
596
 
7.5%
559
 
7.0%
169
 
2.1%
157
 
2.0%
146
 
1.8%
119
 
1.5%
Other values (117) 3528
44.3%
Uppercase Letter
ValueCountFrequency (%)
F 5408
51.2%
D 2502
23.7%
J 1003
 
9.5%
M 412
 
3.9%
R 396
 
3.8%
N 307
 
2.9%
S 143
 
1.4%
W 121
 
1.1%
P 98
 
0.9%
H 83
 
0.8%
Decimal Number
ValueCountFrequency (%)
0 11217
33.4%
2 3736
 
11.1%
1 3637
 
10.8%
5 2886
 
8.6%
3 2772
 
8.3%
4 2740
 
8.2%
6 1874
 
5.6%
7 1613
 
4.8%
9 1565
 
4.7%
8 1515
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 8221
100.0%
Open Punctuation
ValueCountFrequency (%)
( 285
100.0%
Close Punctuation
ValueCountFrequency (%)
) 285
100.0%
Space Separator
ValueCountFrequency (%)
69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 42415
69.6%
Latin 10556
 
17.3%
Hangul 7960
 
13.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
750
 
9.4%
657
 
8.3%
653
 
8.2%
626
 
7.9%
596
 
7.5%
559
 
7.0%
169
 
2.1%
157
 
2.0%
146
 
1.8%
119
 
1.5%
Other values (117) 3528
44.3%
Common
ValueCountFrequency (%)
0 11217
26.4%
- 8221
19.4%
2 3736
 
8.8%
1 3637
 
8.6%
5 2886
 
6.8%
3 2772
 
6.5%
4 2740
 
6.5%
6 1874
 
4.4%
7 1613
 
3.8%
9 1565
 
3.7%
Other values (4) 2154
 
5.1%
Latin
ValueCountFrequency (%)
F 5408
51.2%
D 2502
23.7%
J 1003
 
9.5%
M 412
 
3.9%
R 396
 
3.8%
N 307
 
2.9%
S 143
 
1.4%
W 121
 
1.1%
P 98
 
0.9%
H 83
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52971
86.9%
Hangul 7960
 
13.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11217
21.2%
- 8221
15.5%
F 5408
10.2%
2 3736
 
7.1%
1 3637
 
6.9%
5 2886
 
5.4%
3 2772
 
5.2%
4 2740
 
5.2%
D 2502
 
4.7%
6 1874
 
3.5%
Other values (15) 7978
15.1%
Hangul
ValueCountFrequency (%)
750
 
9.4%
657
 
8.3%
653
 
8.2%
626
 
7.9%
596
 
7.5%
559
 
7.0%
169
 
2.1%
157
 
2.0%
146
 
1.8%
119
 
1.5%
Other values (117) 3528
44.3%

깊이
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1190
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.95576
Minimum0
Maximum646
Zeros383
Zeros (%)3.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:41:15.116393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q128
median64
Q3115
95-th percentile240
Maximum646
Range646
Interquartile range (IQR)87

Descriptive statistics

Standard deviation77.671344
Coefficient of variation (CV)0.9251461
Kurtosis3.9687836
Mean83.95576
Median Absolute Deviation (MAD)40.65
Skewness1.7091717
Sum839557.6
Variance6032.8376
MonotonicityNot monotonic
2023-12-13T02:41:15.300294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 383
 
3.8%
1.0 98
 
1.0%
5.0 81
 
0.8%
50.0 74
 
0.7%
40.0 66
 
0.7%
27.0 65
 
0.7%
60.0 65
 
0.7%
24.0 63
 
0.6%
2.0 63
 
0.6%
42.0 63
 
0.6%
Other values (1180) 8979
89.8%
ValueCountFrequency (%)
0.0 383
3.8%
0.2 4
 
< 0.1%
0.3 8
 
0.1%
0.4 1
 
< 0.1%
0.5 52
 
0.5%
0.6 3
 
< 0.1%
0.7 2
 
< 0.1%
0.8 5
 
0.1%
1.0 98
 
1.0%
1.1 1
 
< 0.1%
ValueCountFrequency (%)
646.0 1
< 0.1%
556.0 1
< 0.1%
552.0 1
< 0.1%
521.0 1
< 0.1%
520.0 2
< 0.1%
510.0 1
< 0.1%
509.0 1
< 0.1%
508.0 1
< 0.1%
507.0 1
< 0.1%
503.0 1
< 0.1%

끝깊이
Real number (ℝ)

HIGH CORRELATION 

Distinct1201
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89.93726
Minimum0
Maximum876
Zeros54
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:41:15.432917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q133
median70
Q3122
95-th percentile251
Maximum876
Range876
Interquartile range (IQR)89

Descriptive statistics

Standard deviation79.54282
Coefficient of variation (CV)0.88442565
Kurtosis4.6847035
Mean89.93726
Median Absolute Deviation (MAD)42
Skewness1.7626484
Sum899372.6
Variance6327.0602
MonotonicityNot monotonic
2023-12-13T02:41:15.559337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 106
 
1.1%
5.0 85
 
0.9%
50.0 74
 
0.7%
80.0 71
 
0.7%
60.0 67
 
0.7%
10.0 63
 
0.6%
30.0 63
 
0.6%
27.0 63
 
0.6%
100.0 62
 
0.6%
90.0 61
 
0.6%
Other values (1191) 9285
92.8%
ValueCountFrequency (%)
0.0 54
0.5%
0.3 9
 
0.1%
0.4 4
 
< 0.1%
0.5 58
0.6%
0.6 2
 
< 0.1%
0.7 3
 
< 0.1%
0.8 4
 
< 0.1%
1.0 106
1.1%
1.2 1
 
< 0.1%
1.5 23
 
0.2%
ValueCountFrequency (%)
876.0 1
 
< 0.1%
646.0 1
 
< 0.1%
615.0 1
 
< 0.1%
560.0 1
 
< 0.1%
525.0 3
< 0.1%
521.0 1
 
< 0.1%
520.0 1
 
< 0.1%
510.0 1
 
< 0.1%
509.0 2
< 0.1%
507.0 1
 
< 0.1%

단위두께
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct111
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.42815
Minimum0
Maximum117
Zeros9410
Zeros (%)94.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:41:15.707006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1.905
Maximum117
Range117
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.9129835
Coefficient of variation (CV)6.8036519
Kurtosis407.56139
Mean0.42815
Median Absolute Deviation (MAD)0
Skewness15.936818
Sum4281.5
Variance8.4854731
MonotonicityNot monotonic
2023-12-13T02:41:15.833601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 9410
94.1%
2.0 62
 
0.6%
3.0 48
 
0.5%
4.0 42
 
0.4%
1.0 40
 
0.4%
5.0 32
 
0.3%
7.0 29
 
0.3%
1.5 27
 
0.3%
6.0 25
 
0.2%
2.5 21
 
0.2%
Other values (101) 264
 
2.6%
ValueCountFrequency (%)
0.0 9410
94.1%
0.3 1
 
< 0.1%
0.4 1
 
< 0.1%
0.5 12
 
0.1%
1.0 40
 
0.4%
1.1 1
 
< 0.1%
1.3 2
 
< 0.1%
1.4 1
 
< 0.1%
1.5 27
 
0.3%
1.6 1
 
< 0.1%
ValueCountFrequency (%)
117.0 1
< 0.1%
74.0 1
< 0.1%
70.0 1
< 0.1%
54.2 1
< 0.1%
52.0 1
< 0.1%
47.0 1
< 0.1%
46.0 1
< 0.1%
44.5 1
< 0.1%
44.0 1
< 0.1%
43.8 1
< 0.1%

층계단위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9523 
서귀포층
 
477

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9523
95.2%
서귀포층 477
 
4.8%

Length

2023-12-13T02:41:15.950782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:16.037126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9523
95.2%
서귀포층 477
 
4.8%

이름
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5698 
연암
1219 
보통암
885 
사력층
822 
풍화암
749 
Other values (18)
627 

Length

Max length7
Median length4
Mean length3.4149
Min length2

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row보통암
4th row보통암
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5698
57.0%
연암 1219
 
12.2%
보통암 885
 
8.8%
사력층 822
 
8.2%
풍화암 749
 
7.5%
토사 236
 
2.4%
토사층 190
 
1.9%
사력 52
 
0.5%
경암 43
 
0.4%
점토층 39
 
0.4%
Other values (13) 67
 
0.7%

Length

2023-12-13T02:41:16.131996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5698
57.0%
연암 1219
 
12.2%
보통암 885
 
8.8%
사력층 822
 
8.2%
풍화암 749
 
7.5%
토사 236
 
2.4%
토사층 190
 
1.9%
사력 52
 
0.5%
경암 43
 
0.4%
점토층 39
 
0.4%
Other values (13) 67
 
0.7%

암석코드
Categorical

HIGH CORRELATION 

Distinct35
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
AFOB
2431 
CL
1614 
APB
1383 
FB
779 
M
503 
Other values (30)
3290 

Length

Max length7
Median length4
Mean length2.969
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowTOPSOIL
2nd rowAFB
3rd rowAPB
4th rowAFOB
5th rowPAFB

Common Values

ValueCountFrequency (%)
AFOB 2431
24.3%
CL 1614
16.1%
APB 1383
13.8%
FB 779
 
7.8%
M 503
 
5.0%
<NA> 427
 
4.3%
PAB 316
 
3.2%
AFB 309
 
3.1%
TOPSOIL 283
 
2.8%
PFB 223
 
2.2%
Other values (25) 1732
17.3%

Length

2023-12-13T02:41:16.264657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
afob 2431
24.3%
cl 1614
16.1%
apb 1383
13.8%
fb 779
 
7.8%
m 503
 
5.0%
na 427
 
4.3%
pab 316
 
3.2%
afb 309
 
3.1%
topsoil 283
 
2.8%
pfb 223
 
2.2%
Other values (25) 1732
17.3%

암석한글1
Text

MISSING 

Distinct149
Distinct (%)1.7%
Missing1290
Missing (%)12.9%
Memory size156.2 KiB
2023-12-13T02:41:16.521290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length6.9354765
Min length2

Characters and Unicode

Total characters60408
Distinct characters101
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)0.5%

Sample

1st row표토층
2nd row비현정질현무암
3rd row침상장석감람석현무암
4th row반상휘석현무암-1
5th row화산쇄설층
ValueCountFrequency (%)
침상장석감람석현무암 1828
20.9%
장석현무암 734
 
8.4%
비현정질현무암 674
 
7.7%
화산쇄설층 519
 
5.9%
함침상장석감람석현무암 502
 
5.8%
반상휘석현무암 284
 
3.3%
화성쇄설층 267
 
3.1%
점토층 251
 
2.9%
표토 244
 
2.8%
장석휘석현무암 214
 
2.5%
Other values (141) 3209
36.8%
2023-12-13T02:41:16.934567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8446
14.0%
6716
11.1%
6558
10.9%
5952
 
9.9%
4183
 
6.9%
3100
 
5.1%
2956
 
4.9%
2956
 
4.9%
2343
 
3.9%
1601
 
2.7%
Other values (91) 15597
25.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59385
98.3%
Decimal Number 491
 
0.8%
Dash Punctuation 199
 
0.3%
Other Punctuation 106
 
0.2%
Open Punctuation 105
 
0.2%
Close Punctuation 105
 
0.2%
Space Separator 16
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8446
14.2%
6716
11.3%
6558
11.0%
5952
10.0%
4183
 
7.0%
3100
 
5.2%
2956
 
5.0%
2956
 
5.0%
2343
 
3.9%
1601
 
2.7%
Other values (79) 14574
24.5%
Decimal Number
ValueCountFrequency (%)
1 171
34.8%
2 111
22.6%
0 104
21.2%
5 104
21.2%
3 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
% 104
98.1%
, 2
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 199
100.0%
Open Punctuation
ValueCountFrequency (%)
( 105
100.0%
Close Punctuation
ValueCountFrequency (%)
) 105
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59385
98.3%
Common 1022
 
1.7%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8446
14.2%
6716
11.3%
6558
11.0%
5952
10.0%
4183
 
7.0%
3100
 
5.2%
2956
 
5.0%
2956
 
5.0%
2343
 
3.9%
1601
 
2.7%
Other values (79) 14574
24.5%
Common
ValueCountFrequency (%)
- 199
19.5%
1 171
16.7%
2 111
10.9%
( 105
10.3%
) 105
10.3%
0 104
10.2%
5 104
10.2%
% 104
10.2%
16
 
1.6%
, 2
 
0.2%
Latin
ValueCountFrequency (%)
U 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59385
98.3%
ASCII 1023
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8446
14.2%
6716
11.3%
6558
11.0%
5952
10.0%
4183
 
7.0%
3100
 
5.2%
2956
 
5.0%
2956
 
5.0%
2343
 
3.9%
1601
 
2.7%
Other values (79) 14574
24.5%
ASCII
ValueCountFrequency (%)
- 199
19.5%
1 171
16.7%
2 111
10.9%
( 105
10.3%
) 105
10.3%
0 104
10.2%
5 104
10.2%
% 104
10.2%
16
 
1.6%
, 2
 
0.2%
Other values (2) 2
 
0.2%

암석영문1
Text

MISSING 

Distinct150
Distinct (%)2.0%
Missing2313
Missing (%)23.1%
Memory size156.2 KiB
2023-12-13T02:41:17.164554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length3.6439443
Min length1

Characters and Unicode

Total characters28011
Distinct characters53
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)0.5%

Sample

1st rowsoil
2nd rowA-pFB
3rd rowAPB
4th rowFOB
5th rowPAB-1
ValueCountFrequency (%)
fob 1509
19.5%
fb 720
 
9.3%
acf-ob 547
 
7.1%
f.o.b 511
 
6.6%
apb 444
 
5.7%
pab 286
 
3.7%
ap-b 271
 
3.5%
tb 226
 
2.9%
pfb 204
 
2.6%
fab 187
 
2.4%
Other values (107) 2831
36.6%
2023-12-13T02:41:17.771707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
B 6256
22.3%
F 4631
16.5%
O 3196
11.4%
A 2438
 
8.7%
. 1473
 
5.3%
P 1229
 
4.4%
- 1224
 
4.4%
a 846
 
3.0%
c 700
 
2.5%
T 560
 
2.0%
Other values (43) 5458
19.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 19781
70.6%
Lowercase Letter 5112
 
18.2%
Other Punctuation 1474
 
5.3%
Dash Punctuation 1224
 
4.4%
Open Punctuation 137
 
0.5%
Close Punctuation 137
 
0.5%
Decimal Number 96
 
0.3%
Space Separator 49
 
0.2%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 846
16.5%
c 700
13.7%
p 467
9.1%
l 397
7.8%
i 361
 
7.1%
s 334
 
6.5%
o 326
 
6.4%
d 285
 
5.6%
u 250
 
4.9%
m 188
 
3.7%
Other values (12) 958
18.7%
Uppercase Letter
ValueCountFrequency (%)
B 6256
31.6%
F 4631
23.4%
O 3196
16.2%
A 2438
 
12.3%
P 1229
 
6.2%
T 560
 
2.8%
S 538
 
2.7%
C 182
 
0.9%
G 179
 
0.9%
I 126
 
0.6%
Other values (11) 446
 
2.3%
Decimal Number
ValueCountFrequency (%)
1 68
70.8%
2 27
 
28.1%
3 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 1473
99.9%
, 1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1224
100.0%
Open Punctuation
ValueCountFrequency (%)
( 137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 137
100.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 24893
88.9%
Common 3118
 
11.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 6256
25.1%
F 4631
18.6%
O 3196
12.8%
A 2438
 
9.8%
P 1229
 
4.9%
a 846
 
3.4%
c 700
 
2.8%
T 560
 
2.2%
S 538
 
2.2%
p 467
 
1.9%
Other values (33) 4032
16.2%
Common
ValueCountFrequency (%)
. 1473
47.2%
- 1224
39.3%
( 137
 
4.4%
) 137
 
4.4%
1 68
 
2.2%
49
 
1.6%
2 27
 
0.9%
~ 1
 
< 0.1%
3 1
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28011
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B 6256
22.3%
F 4631
16.5%
O 3196
11.4%
A 2438
 
8.7%
. 1473
 
5.3%
P 1229
 
4.4%
- 1224
 
4.4%
a 846
 
3.0%
c 700
 
2.5%
T 560
 
2.0%
Other values (43) 5458
19.5%

암석한글3
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
침상장석감람석현무암
2431 
클링커
1614 
비반상현무암
1383 
장석현무암
779 
머드
503 
Other values (28)
3290 

Length

Max length10
Median length8
Mean length5.9879
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row표토
2nd row휘석장석현무암
3rd row비반상현무암
4th row침상장석감람석현무암
5th row반상휘석장석현무암

Common Values

ValueCountFrequency (%)
침상장석감람석현무암 2431
24.3%
클링커 1614
16.1%
비반상현무암 1383
13.8%
장석현무암 779
 
7.8%
머드 503
 
5.0%
<NA> 427
 
4.3%
반상휘석현무암 316
 
3.2%
휘석장석현무암 309
 
3.1%
표토 293
 
2.9%
응회암 225
 
2.2%
Other values (23) 1720
17.2%

Length

2023-12-13T02:41:17.933146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
침상장석감람석현무암 2431
24.3%
클링커 1614
16.1%
비반상현무암 1383
13.8%
장석현무암 779
 
7.8%
머드 503
 
5.0%
na 427
 
4.3%
반상휘석현무암 316
 
3.2%
휘석장석현무암 309
 
3.1%
표토 293
 
2.9%
응회암 225
 
2.2%
Other values (23) 1720
17.2%

암석영문3
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
aFOB
2431 
CL
1614 
ApB
1383 
FB
779 
M
523 
Other values (28)
3270 

Length

Max length9
Median length7
Mean length2.9644
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowTopsoil
2nd rowAFB
3rd rowApB
4th rowaFOB
5th rowPAFB

Common Values

ValueCountFrequency (%)
aFOB 2431
24.3%
CL 1614
16.1%
ApB 1383
13.8%
FB 779
 
7.8%
M 523
 
5.2%
<NA> 427
 
4.3%
PAB 316
 
3.2%
AFB 309
 
3.1%
Topsoil 283
 
2.8%
PFB 223
 
2.2%
Other values (23) 1712
17.1%

Length

2023-12-13T02:41:18.062869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
afob 2431
24.3%
cl 1614
16.1%
apb 1383
13.8%
fb 779
 
7.8%
m 523
 
5.2%
na 427
 
4.3%
pab 316
 
3.2%
afb 309
 
3.1%
topsoil 283
 
2.8%
pfb 223
 
2.2%
Other values (22) 1712
17.1%

그래픽정보
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
침상장석감람석현무암 (AFOB)
2433 
클링커 (CL)
1615 
비반상현무암 (APB)
1382 
장석현무암 (FB)
777 
머드 (M)
509 
Other values (34)
3284 

Length

Max length17
Median length14
Mean length11.5099
Min length4

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row표토 (TS)
2nd row휘석장석현무암 (AFB)
3rd row비반상현무암 (APB)
4th row침상장석감람석현무암 (AFOB)
5th row반상휘석장석현무암 (PAFB)

Common Values

ValueCountFrequency (%)
침상장석감람석현무암 (AFOB) 2433
24.3%
클링커 (CL) 1615
16.2%
비반상현무암 (APB) 1382
13.8%
장석현무암 (FB) 777
 
7.8%
머드 (M) 509
 
5.1%
<NA> 409
 
4.1%
반상휘석현무암 (PAB) 316
 
3.2%
휘석장석현무암 (AFB) 309
 
3.1%
표토 (TS) 284
 
2.8%
반상장석현무암 (PFB) 223
 
2.2%
Other values (29) 1743
17.4%

Length

2023-12-13T02:41:18.185605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
침상장석감람석현무암 2433
 
12.4%
afob 2433
 
12.4%
클링커 1615
 
8.2%
cl 1615
 
8.2%
비반상현무암 1382
 
7.0%
apb 1382
 
7.0%
장석현무암 777
 
4.0%
fb 777
 
4.0%
m 523
 
2.7%
머드 509
 
2.6%
Other values (52) 6209
31.6%

단위비고
Text

MISSING 

Distinct3490
Distinct (%)61.0%
Missing4280
Missing (%)42.8%
Memory size156.2 KiB
2023-12-13T02:41:18.437516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length255
Median length186
Mean length22.226049
Min length1

Characters and Unicode

Total characters127133
Distinct characters499
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2806 ?
Unique (%)49.1%

Sample

1st row6개의 단위층으로 구성
2nd row절리 다소
3rd row절리대
4th row코아내부 적갈색, 기공무, 암석조밀
5th row역 혼재
ValueCountFrequency (%)
1098
 
4.2%
발달 602
 
2.3%
균열 522
 
2.0%
치밀견고 482
 
1.8%
구성 461
 
1.8%
기공 403
 
1.5%
clinker 399
 
1.5%
단위층으로 344
 
1.3%
파쇄 342
 
1.3%
다공질 311
 
1.2%
Other values (4697) 21236
81.1%
2023-12-13T02:41:18.825285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20668
 
16.3%
. 6201
 
4.9%
1 2855
 
2.2%
, 2670
 
2.1%
0 2456
 
1.9%
m 2400
 
1.9%
~ 2171
 
1.7%
1828
 
1.4%
: 1827
 
1.4%
1667
 
1.3%
Other values (489) 82390
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66074
52.0%
Space Separator 20668
 
16.3%
Decimal Number 14190
 
11.2%
Other Punctuation 10928
 
8.6%
Lowercase Letter 10337
 
8.1%
Math Symbol 2296
 
1.8%
Dash Punctuation 1382
 
1.1%
Uppercase Letter 905
 
0.7%
Open Punctuation 167
 
0.1%
Close Punctuation 165
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1828
 
2.8%
1667
 
2.5%
1471
 
2.2%
1459
 
2.2%
1348
 
2.0%
1347
 
2.0%
1299
 
2.0%
1299
 
2.0%
1282
 
1.9%
1246
 
1.9%
Other values (410) 51828
78.4%
Lowercase Letter
ValueCountFrequency (%)
m 2400
23.2%
i 956
 
9.2%
e 880
 
8.5%
l 858
 
8.3%
n 748
 
7.2%
r 667
 
6.5%
s 543
 
5.3%
a 510
 
4.9%
c 500
 
4.8%
k 472
 
4.6%
Other values (16) 1803
17.4%
Uppercase Letter
ValueCountFrequency (%)
C 432
47.7%
S 67
 
7.4%
A 66
 
7.3%
M 59
 
6.5%
B 42
 
4.6%
F 38
 
4.2%
V 34
 
3.8%
O 24
 
2.7%
H 20
 
2.2%
U 16
 
1.8%
Other values (13) 107
 
11.8%
Decimal Number
ValueCountFrequency (%)
1 2855
20.1%
0 2456
17.3%
5 1600
11.3%
2 1569
11.1%
3 1162
8.2%
4 1031
 
7.3%
6 917
 
6.5%
8 901
 
6.3%
7 895
 
6.3%
9 804
 
5.7%
Other Punctuation
ValueCountFrequency (%)
. 6201
56.7%
, 2670
24.4%
: 1827
 
16.7%
/ 206
 
1.9%
% 16
 
0.1%
* 6
 
0.1%
· 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 2171
94.6%
88
 
3.8%
+ 28
 
1.2%
= 5
 
0.2%
3
 
0.1%
> 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
17
81.0%
3
 
14.3%
° 1
 
4.8%
Space Separator
ValueCountFrequency (%)
20668
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1382
100.0%
Open Punctuation
ValueCountFrequency (%)
( 167
100.0%
Close Punctuation
ValueCountFrequency (%)
) 165
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66073
52.0%
Common 49817
39.2%
Latin 11242
 
8.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1828
 
2.8%
1667
 
2.5%
1471
 
2.2%
1459
 
2.2%
1348
 
2.0%
1347
 
2.0%
1299
 
2.0%
1299
 
2.0%
1282
 
1.9%
1246
 
1.9%
Other values (409) 51827
78.4%
Latin
ValueCountFrequency (%)
m 2400
21.3%
i 956
 
8.5%
e 880
 
7.8%
l 858
 
7.6%
n 748
 
6.7%
r 667
 
5.9%
s 543
 
4.8%
a 510
 
4.5%
c 500
 
4.4%
k 472
 
4.2%
Other values (39) 2708
24.1%
Common
ValueCountFrequency (%)
20668
41.5%
. 6201
 
12.4%
1 2855
 
5.7%
, 2670
 
5.4%
0 2456
 
4.9%
~ 2171
 
4.4%
: 1827
 
3.7%
5 1600
 
3.2%
2 1569
 
3.1%
- 1382
 
2.8%
Other values (20) 6418
 
12.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66070
52.0%
ASCII 60945
47.9%
None 91
 
0.1%
CJK Compat 20
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Math Operators 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20668
33.9%
. 6201
 
10.2%
1 2855
 
4.7%
, 2670
 
4.4%
0 2456
 
4.0%
m 2400
 
3.9%
~ 2171
 
3.6%
: 1827
 
3.0%
5 1600
 
2.6%
2 1569
 
2.6%
Other values (63) 16528
27.1%
Hangul
ValueCountFrequency (%)
1828
 
2.8%
1667
 
2.5%
1471
 
2.2%
1459
 
2.2%
1348
 
2.0%
1347
 
2.0%
1299
 
2.0%
1299
 
2.0%
1282
 
1.9%
1246
 
1.9%
Other values (408) 51824
78.4%
None
ValueCountFrequency (%)
88
96.7%
· 2
 
2.2%
° 1
 
1.1%
CJK Compat
ValueCountFrequency (%)
17
85.0%
3
 
15.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Math Operators
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

밀도정도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7041 
다공질
1707 
치밀질
1026 
부분 기공발달
 
207
약간 기공발달
 
6
Other values (8)
 
13

Length

Max length7
Median length4
Mean length3.7912
Min length2

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row다공질
2nd row다공질
3rd row치밀질
4th row치밀질
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7041
70.4%
다공질 1707
 
17.1%
치밀질 1026
 
10.3%
부분 기공발달 207
 
2.1%
약간 기공발달 6
 
0.1%
부분기공발달 4
 
< 0.1%
치밀 2
 
< 0.1%
미고결 2
 
< 0.1%
하부다공질 1
 
< 0.1%
약간기공발달 1
 
< 0.1%
Other values (3) 3
 
< 0.1%

Length

2023-12-13T02:41:18.963664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 7041
68.9%
다공질 1707
 
16.7%
치밀질 1027
 
10.1%
기공발달 213
 
2.1%
부분 207
 
2.0%
약간 6
 
0.1%
부분기공발달 4
 
< 0.1%
치밀 2
 
< 0.1%
미고결 2
 
< 0.1%
하부다공질 1
 
< 0.1%
Other values (4) 4
 
< 0.1%

화학조성정도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9981 
조금 불량
 
8
매우 불량
 
5
매우 양호
 
4
조금 양호
 
1

Length

Max length10
Median length4
Mean length4.0024
Min length4

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9981
99.8%
조금 불량 8
 
0.1%
매우 불량 5
 
0.1%
매우 양호 4
 
< 0.1%
조금 양호 1
 
< 0.1%
olivine 반정 1
 
< 0.1%

Length

2023-12-13T02:41:19.078111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:19.180775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9981
99.6%
불량 13
 
0.1%
조금 9
 
0.1%
매우 9
 
0.1%
양호 5
 
< 0.1%
olivine 1
 
< 0.1%
반정 1
 
< 0.1%

레이어색상
Text

MISSING 

Distinct307
Distinct (%)6.5%
Missing5256
Missing (%)52.6%
Memory size156.2 KiB
2023-12-13T02:41:19.419238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.4030354
Min length2

Characters and Unicode

Total characters16144
Distinct characters86
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)3.5%

Sample

1st row회색-암갈색
2nd row암회색
3rd row암회색
4th row담홍색
5th row암회색
ValueCountFrequency (%)
암회색 1298
27.0%
담회색 570
11.8%
회색 398
 
8.3%
적갈색 369
 
7.7%
암갈색 237
 
4.9%
회갈색 145
 
3.0%
황갈색 116
 
2.4%
암적갈색 92
 
1.9%
적회색 88
 
1.8%
갈색 83
 
1.7%
Other values (285) 1420
29.5%
2023-12-13T02:41:19.840273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5216
32.3%
3559
22.0%
2284
14.1%
1451
 
9.0%
860
 
5.3%
782
 
4.8%
400
 
2.5%
- 366
 
2.3%
~ 161
 
1.0%
123
 
0.8%
Other values (76) 942
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15267
94.6%
Dash Punctuation 366
 
2.3%
Lowercase Letter 245
 
1.5%
Math Symbol 175
 
1.1%
Space Separator 72
 
0.4%
Other Punctuation 13
 
0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5216
34.2%
3559
23.3%
2284
15.0%
1451
 
9.5%
860
 
5.6%
782
 
5.1%
400
 
2.6%
123
 
0.8%
99
 
0.6%
97
 
0.6%
Other values (49) 396
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
r 39
15.9%
g 36
14.7%
a 36
14.7%
y 24
9.8%
l 17
6.9%
i 16
6.5%
d 15
 
6.1%
h 15
 
6.1%
t 12
 
4.9%
e 10
 
4.1%
Other values (7) 25
10.2%
Math Symbol
ValueCountFrequency (%)
~ 161
92.0%
11
 
6.3%
3
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
R 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 366
100.0%
Space Separator
ValueCountFrequency (%)
72
100.0%
Other Punctuation
ValueCountFrequency (%)
, 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15267
94.6%
Common 630
 
3.9%
Latin 247
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5216
34.2%
3559
23.3%
2284
15.0%
1451
 
9.5%
860
 
5.6%
782
 
5.1%
400
 
2.6%
123
 
0.8%
99
 
0.6%
97
 
0.6%
Other values (49) 396
 
2.6%
Latin
ValueCountFrequency (%)
r 39
15.8%
g 36
14.6%
a 36
14.6%
y 24
9.7%
l 17
6.9%
i 16
6.5%
d 15
 
6.1%
h 15
 
6.1%
t 12
 
4.9%
e 10
 
4.0%
Other values (9) 27
10.9%
Common
ValueCountFrequency (%)
- 366
58.1%
~ 161
25.6%
72
 
11.4%
, 13
 
2.1%
11
 
1.7%
3
 
0.5%
) 2
 
0.3%
( 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15267
94.6%
ASCII 863
 
5.3%
None 11
 
0.1%
Math Operators 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5216
34.2%
3559
23.3%
2284
15.0%
1451
 
9.5%
860
 
5.6%
782
 
5.1%
400
 
2.6%
123
 
0.8%
99
 
0.6%
97
 
0.6%
Other values (49) 396
 
2.6%
ASCII
ValueCountFrequency (%)
- 366
42.4%
~ 161
18.7%
72
 
8.3%
r 39
 
4.5%
g 36
 
4.2%
a 36
 
4.2%
y 24
 
2.8%
l 17
 
2.0%
i 16
 
1.9%
d 15
 
1.7%
Other values (15) 81
 
9.4%
None
ValueCountFrequency (%)
11
100.0%
Math Operators
ValueCountFrequency (%)
3
100.0%

풍화정도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9984 
조금풍화
 
7
매우 풍화
 
3
조금 풍화
 
2
조금신선
 
1
Other values (3)
 
3

Length

Max length5
Median length4
Mean length4.0003
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9984
99.8%
조금풍화 7
 
0.1%
매우 풍화 3
 
< 0.1%
조금 풍화 2
 
< 0.1%
조금신선 1
 
< 0.1%
매우 신선 1
 
< 0.1%
보통 1
 
< 0.1%
치밀질 1
 
< 0.1%

Length

2023-12-13T02:41:19.998796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:20.117606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9984
99.8%
조금풍화 7
 
0.1%
풍화 5
 
< 0.1%
매우 4
 
< 0.1%
조금 2
 
< 0.1%
조금신선 1
 
< 0.1%
신선 1
 
< 0.1%
보통 1
 
< 0.1%
치밀질 1
 
< 0.1%

Interactions

2023-12-13T02:41:13.351273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:12.698024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:13.011454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:13.447708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:12.808705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:13.139164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:13.553174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:12.911991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:13.256577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:41:20.212173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
깊이끝깊이단위두께이름암석코드암석한글3암석영문3그래픽정보밀도정도화학조성정도풍화정도
깊이1.0000.9240.0320.3200.3600.3600.3550.3620.1380.0000.587
끝깊이0.9241.0000.0300.2770.3440.3430.3410.3460.1550.0000.345
단위두께0.0320.0301.0000.0000.0340.0550.0340.0540.000NaN0.000
이름0.3200.2770.0001.0000.7390.7300.8100.7340.6350.3800.756
암석코드0.3600.3440.0340.7391.0001.0000.9991.0000.2590.6300.911
암석한글30.3600.3430.0550.7301.0001.0001.0000.9990.2400.6300.911
암석영문30.3550.3410.0340.8100.9991.0001.0001.0000.2590.6300.911
그래픽정보0.3620.3460.0540.7341.0000.9991.0001.0000.2590.6300.911
밀도정도0.1380.1550.0000.6350.2590.2400.2590.2591.0000.0001.000
화학조성정도0.0000.000NaN0.3800.6300.6300.6300.6300.0001.000NaN
풍화정도0.5870.3450.0000.7560.9110.9110.9110.9111.000NaN1.000
2023-12-13T02:41:20.329829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이름풍화정도그래픽정보화학조성정도암석영문3암석한글3암석코드밀도정도층계단위
이름1.0000.5400.2680.2870.3400.2690.2730.3311.000
풍화정도0.5401.0000.528NaN0.5280.5280.5281.000NaN
그래픽정보0.2680.5281.0000.4150.9840.9820.9830.0891.000
화학조성정도0.287NaN0.4151.0000.4150.4150.4150.000NaN
암석영문30.3400.5280.9840.4151.0000.9510.9850.0891.000
암석한글30.2690.5280.9820.4150.9511.0001.0000.0831.000
암석코드0.2730.5280.9830.4150.9851.0001.0000.0891.000
밀도정도0.3311.0000.0890.0000.0890.0830.0891.0001.000
층계단위1.000NaN1.000NaN1.0001.0001.0001.0001.000
2023-12-13T02:41:20.449860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
깊이끝깊이단위두께층계단위이름암석코드암석한글3암석영문3그래픽정보밀도정도화학조성정도풍화정도
깊이1.0000.980-0.0161.0000.1300.1340.1350.1330.1340.0670.0000.370
끝깊이0.9801.000-0.0081.0000.1300.1340.1340.1330.1340.0860.0000.254
단위두께-0.016-0.0081.0001.0000.0000.0130.0190.0120.0210.0001.0000.000
층계단위1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
이름0.1300.1300.0001.0001.0000.2730.2690.3400.2680.3310.2870.540
암석코드0.1340.1340.0131.0000.2731.0001.0000.9850.9830.0890.4150.528
암석한글30.1350.1340.0191.0000.2691.0001.0000.9510.9820.0830.4150.528
암석영문30.1330.1330.0121.0000.3400.9850.9511.0000.9840.0890.4150.528
그래픽정보0.1340.1340.0211.0000.2680.9830.9820.9841.0000.0890.4150.528
밀도정도0.0670.0860.0001.0000.3310.0890.0830.0890.0891.0000.0001.000
화학조성정도0.0000.0001.0000.0000.2870.4150.4150.4150.4150.0001.0000.000
풍화정도0.3700.2540.0000.0000.5400.5280.5280.5280.5281.0000.0001.000

Missing values

2023-12-13T02:41:13.709588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:41:13.919960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:41:14.143193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

포인트아이디깊이끝깊이단위두께층계단위이름암석코드암석한글1암석영문1암석한글3암석영문3그래픽정보단위비고밀도정도화학조성정도레이어색상풍화정도
10591F-02340.03.00.0<NA><NA>TOPSOIL표토층soil표토Topsoil표토 (TS)<NA>다공질<NA><NA><NA>
17261F-0485169.0180.00.0<NA><NA>AFB<NA>A-pFB휘석장석현무암AFB휘석장석현무암 (AFB)6개의 단위층으로 구성다공질<NA><NA><NA>
13320F-03487.522.00.0<NA>보통암APB비현정질현무암APB비반상현무암ApB비반상현무암 (APB)절리 다소치밀질<NA>회색-암갈색<NA>
14819F-040745.047.00.0<NA>보통암AFOB침상장석감람석현무암FOB침상장석감람석현무암aFOB침상장석감람석현무암 (AFOB)<NA>치밀질<NA>암회색<NA>
1140D-008467.069.00.0<NA><NA>PAFB반상휘석현무암-1PAB-1반상휘석장석현무암PAFB반상휘석장석현무암 (PAFB)절리대<NA><NA><NA><NA>
11551F-027034.038.00.0<NA>사력층CL화산쇄설층<NA>클링커CL클링커 (CL)<NA><NA><NA>암회색<NA>
1696D-0133111.0113.00.0<NA><NA>M점토층Clay머드M머드 (M)<NA><NA><NA><NA><NA>
19474F-0546171.0182.00.0<NA>보통암FB장석현무암FB장석현무암FB장석현무암 (FB)코아내부 적갈색, 기공무, 암석조밀<NA><NA><NA><NA>
8293F-012916.022.06.0<NA><NA>CL화산쇄설성퇴적암<NA>클링커CL클링커 (CL)역 혼재다공질<NA>담홍색<NA>
12436F-03073.011.00.0<NA>풍화암FB장석현무암FB장석현무암FB장석현무암 (FB)<NA><NA><NA><NA><NA>
포인트아이디깊이끝깊이단위두께층계단위이름암석코드암석한글1암석영문1암석한글3암석영문3그래픽정보단위비고밀도정도화학조성정도레이어색상풍화정도
16273F-0453101.0102.00.0<NA><NA>OB감람석현무암OB감람석현무암OB감람석현무암 (OB)<NA>치밀질<NA><NA><NA>
25041JR어음2154.0157.00.0<NA><NA>CL퇴적층<NA>클링커CL클링커 (CL)화산회토 퇴적층, 반고결, 실트질이 주를 이룸<NA><NA>암황색<NA>
19451F-054671.074.00.0<NA>연암FB장석현무암FB장석현무암FB장석현무암 (FB)기공크고, 넓게 분포<NA><NA><NA><NA>
3768D-024715.019.00.0<NA>사력CL<NA><NA>클링커CL클링커 (CL)10cm 내외 역다공질<NA><NA><NA>
2157D-017012.019.00.0<NA><NA>AFOB침상장석감람석현무암F.O.B침상장석감람석현무암aFOB침상장석감람석현무암 (AFOB)<NA><NA><NA><NA><NA>
30318회수지구1호공42.047.00.0<NA><NA>APB조면암질현무암TB비반상현무암ApB비반상현무암 (APB)최하부구간 파쇄대치밀질<NA>암회색<NA>
6518F-0044(N)6.07.00.0<NA>연암TUFF응회암Tuff응회암TUFF응회암 TF<NA><NA><NA>갈색<NA>
21319F-0594251.5253.50.0<NA><NA>OB감람석현무암OB감람석현무암OB감람석현무암 (OB)치밀견고 간혹 대형기공<NA><NA>회색<NA>
19472F-0546161.0165.00.0<NA>사력층CL스코리아<NA>클링커CL클링커 (CL)기공이 작고, 넓게 분포, 코아회수율 저조<NA><NA>연적색<NA>
5229D-03095.010.00.0<NA>연암<NA><NA><NA><NA><NA><NA><NA>다공질<NA>암갈색<NA>