Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory859.4 KiB
Average record size in memory88.0 B

Variable types

Categorical5
Text5

Dataset

Description창원시 관내에 관리되고 등록되어있는 개, 고양이의 동물품종, 동물연령, 해당동물의 주소, 중성화 여부, 소유 상태 등을 상세하게 나타낸 데이터입니다.
URLhttps://www.data.go.kr/data/15114483/fileData.do

Alerts

Dataset has 2 (< 0.1%) duplicate rowsDuplicates
전자태그(RFID) 종류 is highly overall correlated with 축종 and 3 other fieldsHigh correlation
축종 is highly overall correlated with 전자태그(RFID) 종류 and 3 other fieldsHigh correlation
성별 is highly overall correlated with 전자태그(RFID) 종류 and 3 other fieldsHigh correlation
중성화여부 is highly overall correlated with 전자태그(RFID) 종류 and 3 other fieldsHigh correlation
소유정보 is highly overall correlated with 전자태그(RFID) 종류 and 3 other fieldsHigh correlation
축종 is highly imbalanced (96.8%)Imbalance
소유정보 is highly imbalanced (83.6%)Imbalance

Reproduction

Analysis started2023-12-12 13:33:37.636658
Analysis finished2023-12-12 13:33:39.060007
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

전자태그(RFID) 종류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
삽입형
5021 
인식표
2582 
외장형
2396 
RFID종류
 
1

Length

Max length6
Median length3
Mean length3.0003
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row인식표
2nd row삽입형
3rd row인식표
4th row외장형
5th row외장형

Common Values

ValueCountFrequency (%)
삽입형 5021
50.2%
인식표 2582
25.8%
외장형 2396
24.0%
RFID종류 1
 
< 0.1%

Length

2023-12-12T22:33:39.124959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:33:39.220078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
삽입형 5021
50.2%
인식표 2582
25.8%
외장형 2396
24.0%
rfid종류 1
 
< 0.1%
Distinct3268
Distinct (%)32.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:33:39.604098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length2
Mean length2.195
Min length1

Characters and Unicode

Total characters21950
Distinct characters711
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2260 ?
Unique (%)22.6%

Sample

1st row대박
2nd row루피
3rd row토니
4th row빅토리
5th row똘이
ValueCountFrequency (%)
보리 167
 
1.7%
콩이 134
 
1.3%
코코 126
 
1.3%
초코 109
 
1.1%
사랑이 95
 
0.9%
똘이 91
 
0.9%
몽이 88
 
0.9%
까미 85
 
0.8%
별이 83
 
0.8%
두부 78
 
0.8%
Other values (3147) 8946
89.4%
2023-12-12T22:33:40.142022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2901
 
13.2%
793
 
3.6%
453
 
2.1%
453
 
2.1%
371
 
1.7%
337
 
1.5%
316
 
1.4%
313
 
1.4%
294
 
1.3%
287
 
1.3%
Other values (701) 15432
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21651
98.6%
Space Separator 190
 
0.9%
Lowercase Letter 62
 
0.3%
Uppercase Letter 29
 
0.1%
Decimal Number 16
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2901
 
13.4%
793
 
3.7%
453
 
2.1%
453
 
2.1%
371
 
1.7%
337
 
1.6%
316
 
1.5%
313
 
1.4%
294
 
1.4%
287
 
1.3%
Other values (655) 15133
69.9%
Lowercase Letter
ValueCountFrequency (%)
e 9
14.5%
o 7
11.3%
a 7
11.3%
h 5
 
8.1%
l 4
 
6.5%
n 4
 
6.5%
c 3
 
4.8%
i 3
 
4.8%
u 3
 
4.8%
p 2
 
3.2%
Other values (10) 15
24.2%
Uppercase Letter
ValueCountFrequency (%)
B 4
13.8%
L 3
 
10.3%
M 2
 
6.9%
H 2
 
6.9%
E 2
 
6.9%
T 2
 
6.9%
R 2
 
6.9%
O 2
 
6.9%
J 1
 
3.4%
P 1
 
3.4%
Other values (8) 8
27.6%
Decimal Number
ValueCountFrequency (%)
2 7
43.8%
1 5
31.2%
3 1
 
6.2%
0 1
 
6.2%
9 1
 
6.2%
5 1
 
6.2%
Space Separator
ValueCountFrequency (%)
190
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21651
98.6%
Common 208
 
0.9%
Latin 91
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2901
 
13.4%
793
 
3.7%
453
 
2.1%
453
 
2.1%
371
 
1.7%
337
 
1.6%
316
 
1.5%
313
 
1.4%
294
 
1.4%
287
 
1.3%
Other values (655) 15133
69.9%
Latin
ValueCountFrequency (%)
e 9
 
9.9%
o 7
 
7.7%
a 7
 
7.7%
h 5
 
5.5%
l 4
 
4.4%
n 4
 
4.4%
B 4
 
4.4%
L 3
 
3.3%
c 3
 
3.3%
i 3
 
3.3%
Other values (28) 42
46.2%
Common
ValueCountFrequency (%)
190
91.3%
2 7
 
3.4%
1 5
 
2.4%
. 2
 
1.0%
3 1
 
0.5%
0 1
 
0.5%
9 1
 
0.5%
5 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21650
98.6%
ASCII 299
 
1.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2901
 
13.4%
793
 
3.7%
453
 
2.1%
453
 
2.1%
371
 
1.7%
337
 
1.6%
316
 
1.5%
313
 
1.4%
294
 
1.4%
287
 
1.3%
Other values (654) 15132
69.9%
ASCII
ValueCountFrequency (%)
190
63.5%
e 9
 
3.0%
o 7
 
2.3%
2 7
 
2.3%
a 7
 
2.3%
h 5
 
1.7%
1 5
 
1.7%
l 4
 
1.3%
n 4
 
1.3%
B 4
 
1.3%
Other values (36) 57
 
19.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

축종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
9943 
고양이
 
56
축종
 
1

Length

Max length3
Median length1
Mean length1.0113
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
9943
99.4%
고양이 56
 
0.6%
축종 1
 
< 0.1%

Length

2023-12-12T22:33:40.302909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:33:40.708170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9943
99.4%
고양이 56
 
0.6%
축종 1
 
< 0.1%

품종
Text

Distinct114
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:33:40.986356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length3.7341
Min length1

Characters and Unicode

Total characters37341
Distinct characters174
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)0.3%

Sample

1st row말티즈
2nd row비글
3rd row푸들
4th row진도견
5th row푸들
ValueCountFrequency (%)
말티즈 1954
16.3%
믹스견 1934
16.1%
푸들 1533
12.8%
포메라니안 836
 
7.0%
테리어 514
 
4.3%
시츄 471
 
3.9%
요크셔 443
 
3.7%
진도견 422
 
3.5%
프리제 419
 
3.5%
비숑 419
 
3.5%
Other values (125) 3039
25.4%
2023-12-12T22:33:41.513901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2420
 
6.5%
2418
 
6.5%
2045
 
5.5%
1984
 
5.3%
1969
 
5.3%
1965
 
5.3%
1938
 
5.2%
1553
 
4.2%
1537
 
4.1%
1380
 
3.7%
Other values (164) 18132
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35356
94.7%
Space Separator 1984
 
5.3%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2420
 
6.8%
2418
 
6.8%
2045
 
5.8%
1969
 
5.6%
1965
 
5.6%
1938
 
5.5%
1553
 
4.4%
1537
 
4.3%
1380
 
3.9%
1222
 
3.5%
Other values (162) 16909
47.8%
Space Separator
ValueCountFrequency (%)
1984
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35356
94.7%
Common 1985
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2420
 
6.8%
2418
 
6.8%
2045
 
5.8%
1969
 
5.6%
1965
 
5.6%
1938
 
5.5%
1553
 
4.4%
1537
 
4.3%
1380
 
3.9%
1222
 
3.5%
Other values (162) 16909
47.8%
Common
ValueCountFrequency (%)
1984
99.9%
- 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35356
94.7%
ASCII 1985
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2420
 
6.8%
2418
 
6.8%
2045
 
5.8%
1969
 
5.6%
1965
 
5.6%
1938
 
5.5%
1553
 
4.4%
1537
 
4.3%
1380
 
3.9%
1222
 
3.5%
Other values (162) 16909
47.8%
ASCII
ValueCountFrequency (%)
1984
99.9%
- 1
 
0.1%
Distinct82
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:33:41.758565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length2
Mean length2.8916
Min length2

Characters and Unicode

Total characters28916
Distinct characters105
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)0.1%

Sample

1st row흰색
2nd row얼룩무늬
3rd row갈색
4th row흰색
5th row갈색
ValueCountFrequency (%)
흰색 3872
38.4%
갈색 1655
16.4%
갈색&흰색 659
 
6.5%
검정색 627
 
6.2%
기타 454
 
4.5%
크림색 443
 
4.4%
검정&흰색 354
 
3.5%
황색 235
 
2.3%
갈검색 184
 
1.8%
회색 163
 
1.6%
Other values (77) 1426
 
14.2%
2023-12-12T22:33:42.205545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9848
34.1%
5026
17.4%
2856
 
9.9%
& 1599
 
5.5%
1418
 
4.9%
1125
 
3.9%
475
 
1.6%
454
 
1.6%
454
 
1.6%
443
 
1.5%
Other values (95) 5218
18.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24243
83.8%
Uppercase Letter 2988
 
10.3%
Other Punctuation 1613
 
5.6%
Space Separator 72
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9848
40.6%
5026
20.7%
2856
 
11.8%
1418
 
5.8%
1125
 
4.6%
475
 
2.0%
454
 
1.9%
454
 
1.9%
443
 
1.8%
443
 
1.8%
Other values (69) 1701
 
7.0%
Uppercase Letter
ValueCountFrequency (%)
T 329
11.0%
E 294
9.8%
L 274
9.2%
I 268
 
9.0%
O 248
 
8.3%
A 212
 
7.1%
W 202
 
6.8%
H 182
 
6.1%
G 171
 
5.7%
D 138
 
4.6%
Other values (12) 670
22.4%
Other Punctuation
ValueCountFrequency (%)
& 1599
99.1%
/ 10
 
0.6%
· 4
 
0.2%
Space Separator
ValueCountFrequency (%)
72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24243
83.8%
Latin 2988
 
10.3%
Common 1685
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9848
40.6%
5026
20.7%
2856
 
11.8%
1418
 
5.8%
1125
 
4.6%
475
 
2.0%
454
 
1.9%
454
 
1.9%
443
 
1.8%
443
 
1.8%
Other values (69) 1701
 
7.0%
Latin
ValueCountFrequency (%)
T 329
11.0%
E 294
9.8%
L 274
9.2%
I 268
 
9.0%
O 248
 
8.3%
A 212
 
7.1%
W 202
 
6.8%
H 182
 
6.1%
G 171
 
5.7%
D 138
 
4.6%
Other values (12) 670
22.4%
Common
ValueCountFrequency (%)
& 1599
94.9%
72
 
4.3%
/ 10
 
0.6%
· 4
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24243
83.8%
ASCII 4669
 
16.1%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9848
40.6%
5026
20.7%
2856
 
11.8%
1418
 
5.8%
1125
 
4.6%
475
 
2.0%
454
 
1.9%
454
 
1.9%
443
 
1.8%
443
 
1.8%
Other values (69) 1701
 
7.0%
ASCII
ValueCountFrequency (%)
& 1599
34.2%
T 329
 
7.0%
E 294
 
6.3%
L 274
 
5.9%
I 268
 
5.7%
O 248
 
5.3%
A 212
 
4.5%
W 202
 
4.3%
H 182
 
3.9%
G 171
 
3.7%
Other values (15) 890
19.1%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct4609
Distinct (%)46.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:33:42.606307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9994
Min length4

Characters and Unicode

Total characters99994
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2318 ?
Unique (%)23.2%

Sample

1st row2005-09-11
2nd row2013-06-23
3rd row2012-02-06
4th row2013-09-01
5th row2010-06-12
ValueCountFrequency (%)
2010-01-01 28
 
0.3%
2013-01-01 27
 
0.3%
2011-01-01 27
 
0.3%
2015-01-01 27
 
0.3%
2010-05-01 25
 
0.2%
2012-05-01 22
 
0.2%
2012-01-01 22
 
0.2%
2017-01-01 21
 
0.2%
2014-01-01 21
 
0.2%
2018-01-01 21
 
0.2%
Other values (4599) 9759
97.6%
2023-12-12T22:33:43.189008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 27368
27.4%
- 19998
20.0%
2 17680
17.7%
1 17082
17.1%
5 3298
 
3.3%
3 2840
 
2.8%
6 2506
 
2.5%
7 2464
 
2.5%
8 2283
 
2.3%
9 2277
 
2.3%
Other values (5) 2198
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79992
80.0%
Dash Punctuation 19998
 
20.0%
Other Letter 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 27368
34.2%
2 17680
22.1%
1 17082
21.4%
5 3298
 
4.1%
3 2840
 
3.6%
6 2506
 
3.1%
7 2464
 
3.1%
8 2283
 
2.9%
9 2277
 
2.8%
4 2194
 
2.7%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 19998
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99990
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 27368
27.4%
- 19998
20.0%
2 17680
17.7%
1 17082
17.1%
5 3298
 
3.3%
3 2840
 
2.8%
6 2506
 
2.5%
7 2464
 
2.5%
8 2283
 
2.3%
9 2277
 
2.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99990
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 27368
27.4%
- 19998
20.0%
2 17680
17.7%
1 17082
17.1%
5 3298
 
3.3%
3 2840
 
2.8%
6 2506
 
2.5%
7 2464
 
2.5%
8 2283
 
2.3%
9 2277
 
2.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수컷
5098 
암컷
4901 
성별
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row수컷
2nd row수컷
3rd row수컷
4th row암컷
5th row수컷

Common Values

ValueCountFrequency (%)
수컷 5098
51.0%
암컷 4901
49.0%
성별 1
 
< 0.1%

Length

2023-12-12T22:33:43.351839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:33:43.466436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수컷 5098
51.0%
암컷 4901
49.0%
성별 1
 
< 0.1%

중성화여부
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미중성
5191 
중성
4808 
중성화여부
 
1

Length

Max length5
Median length3
Mean length2.5194
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row중성
2nd row미중성
3rd row중성
4th row미중성
5th row중성

Common Values

ValueCountFrequency (%)
미중성 5191
51.9%
중성 4808
48.1%
중성화여부 1
 
< 0.1%

Length

2023-12-12T22:33:43.596643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:33:43.727373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미중성 5191
51.9%
중성 4808
48.1%
중성화여부 1
 
< 0.1%

소유정보
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
소유
9251 
사망
 
604
<NA>
 
77
분실
 
40
칩훼손·분실
 
26
Other values (2)
 
2

Length

Max length6
Median length2
Mean length2.0262
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row소유
2nd row소유
3rd row소유
4th row소유
5th row소유

Common Values

ValueCountFrequency (%)
소유 9251
92.5%
사망 604
 
6.0%
<NA> 77
 
0.8%
분실 40
 
0.4%
칩훼손·분실 26
 
0.3%
소유정보 1
 
< 0.1%
다시찾음 1
 
< 0.1%

Length

2023-12-12T22:33:43.854846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:33:44.020649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소유 9251
92.5%
사망 604
 
6.0%
na 77
 
0.8%
분실 40
 
0.4%
칩훼손·분실 26
 
0.3%
소유정보 1
 
< 0.1%
다시찾음 1
 
< 0.1%

주소
Text

Distinct91
Distinct (%)0.9%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T22:33:44.247144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters49995
Distinct characters89
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)0.3%

Sample

1st row 마산합포
2nd row 진해구
3rd row 성산구
4th row 의창구
5th row 마산회원
ValueCountFrequency (%)
의창구 2394
23.2%
진해구 1973
19.1%
성산구 1918
18.6%
마산합포 1689
16.4%
마산회원 1564
15.2%
산구 107
 
1.0%
해구 99
 
1.0%
창구 97
 
0.9%
산합포구 89
 
0.9%
산회원구 67
 
0.7%
Other values (83) 307
 
3.0%
2023-12-12T22:33:44.607760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16284
32.6%
6745
13.5%
5446
 
10.9%
3259
 
6.5%
2509
 
5.0%
2401
 
4.8%
2083
 
4.2%
1982
 
4.0%
1921
 
3.8%
1778
 
3.6%
Other values (79) 5587
 
11.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33709
67.4%
Space Separator 16284
32.6%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6745
20.0%
5446
16.2%
3259
9.7%
2509
 
7.4%
2401
 
7.1%
2083
 
6.2%
1982
 
5.9%
1921
 
5.7%
1778
 
5.3%
1778
 
5.3%
Other values (76) 3807
11.3%
Decimal Number
ValueCountFrequency (%)
4 1
50.0%
3 1
50.0%
Space Separator
ValueCountFrequency (%)
16284
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33709
67.4%
Common 16286
32.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6745
20.0%
5446
16.2%
3259
9.7%
2509
 
7.4%
2401
 
7.1%
2083
 
6.2%
1982
 
5.9%
1921
 
5.7%
1778
 
5.3%
1778
 
5.3%
Other values (76) 3807
11.3%
Common
ValueCountFrequency (%)
16284
> 99.9%
4 1
 
< 0.1%
3 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33709
67.4%
ASCII 16286
32.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16284
> 99.9%
4 1
 
< 0.1%
3 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
6745
20.0%
5446
16.2%
3259
9.7%
2509
 
7.4%
2401
 
7.1%
2083
 
6.2%
1982
 
5.9%
1921
 
5.7%
1778
 
5.3%
1778
 
5.3%
Other values (76) 3807
11.3%

Correlations

2023-12-12T22:33:44.713004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전자태그(RFID) 종류축종털색깔성별중성화여부소유정보주소
전자태그(RFID) 종류1.0000.6780.8410.6760.6870.7430.440
축종0.6781.0000.9630.9430.9430.9410.000
털색깔0.8410.9631.0000.8980.9010.7710.000
성별0.6760.9430.8981.0000.9510.9410.038
중성화여부0.6870.9430.9010.9511.0000.9420.140
소유정보0.7430.9410.7710.9410.9421.0000.000
주소0.4400.0000.0000.0380.1400.0001.000
2023-12-12T22:33:44.836249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전자태그(RFID) 종류성별소유정보축종중성화여부
전자태그(RFID) 종류1.0000.7070.5790.7090.721
성별0.7071.0000.7070.7070.729
소유정보0.5790.7071.0000.7070.707
축종0.7090.7070.7071.0000.709
중성화여부0.7210.7290.7070.7091.000
2023-12-12T22:33:44.951734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전자태그(RFID) 종류축종성별중성화여부소유정보
전자태그(RFID) 종류1.0000.7090.7070.7210.579
축종0.7091.0000.7070.7090.707
성별0.7070.7071.0000.7290.707
중성화여부0.7210.7090.7291.0000.707
소유정보0.5790.7070.7070.7071.000

Missing values

2023-12-12T22:33:38.799992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:33:38.978781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

전자태그(RFID) 종류동물이름축종품종털색깔생년월일성별중성화여부소유정보주소
35712인식표대박말티즈흰색2005-09-11수컷중성소유마산합포
48635삽입형루피비글얼룩무늬2013-06-23수컷미중성소유진해구
20455인식표토니푸들갈색2012-02-06수컷중성소유성산구
16577외장형빅토리진도견흰색2013-09-01암컷미중성소유의창구
37717외장형똘이푸들갈색2010-06-12수컷중성소유마산회원
41327외장형뚱이푸들흰색2013-01-11수컷미중성소유성산구
39169삽입형뽀미포메라니안갈색2010-06-23암컷미중성소유마산합포
44491삽입형도도믹스견검정색2016-12-17암컷미중성소유진해구
8718인식표쪼이닥스훈트검정색2016-03-15암컷미중성소유의창구
16807외장형이월벨지안 셰퍼드 독검정색2014-02-10수컷미중성소유의창구
전자태그(RFID) 종류동물이름축종품종털색깔생년월일성별중성화여부소유정보주소
94삽입형박봄시츄갈색&흰색2021-11-12암컷미중성소유의창구
40982삽입형또롱이슈나우져회색2008-05-06수컷중성소유마산합포
39314외장형단지믹스견얼룩무늬2009-06-05수컷중성소유마산회원
48901인식표박새벽말티즈흰색2013-08-09수컷중성소유진해구
42747삽입형감자포메라니안크림색2021-03-02암컷중성소유진해구
19920인식표깜디토이 푸들검정색2010-12-01암컷미중성소유의창구
36452삽입형레오닥스훈트검정색2012-07-01수컷중성사망마산회원
26387외장형콩이믹스견크림색2021-09-11암컷미중성소유산회원구
23759외장형뭉실이닥스훈트검정색2011-08-07암컷중성소유의창구
60삽입형오복이비숑 프리제흰색2023-03-14수컷미중성소유성산구

Duplicate rows

Most frequently occurring

전자태그(RFID) 종류동물이름축종품종털색깔생년월일성별중성화여부소유정보주소# duplicates
0삽입형콩이말티즈흰색2016-01-01수컷중성소유성산구2
1외장형다복이푸들갈색2013-04-19수컷중성소유성산구2