Overview

Dataset statistics

Number of variables3
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory27.9 B

Variable types

Categorical1
Text1
Numeric1

Dataset

Description영상 데이터 인식 기술을 개발하기 위한 과학기술정보통신부 소프트웨어 분야 R&D 과제인 딥뷰(DeepView) 과제에서는 영상 데이터 이해 및 예측을 위한 플랫폼을 개발하고 있습니다. 딥뷰 과제를 수행하면서 구축한 약 20만장(기존 약 10만장) 정도의 객체 검출용 이미지 학습데이터를 배포하여, 유사 분야 연구에 도움이 되고자 합니다.
Author한국전자통신연구원
URLhttps://www.data.go.kr/data/15100681/fileData.do

Alerts

세부 카테고리 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:03:12.563540
Analysis finished2023-12-12 18:03:12.889068
Duration0.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct5
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size492.0 B
인공물
21 
동물
12 
사람
식물
자연 구조물

Length

Max length6
Median length3
Mean length2.7333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사람
2nd row사람
3rd row사람
4th row사람
5th row사람

Common Values

ValueCountFrequency (%)
인공물 21
46.7%
동물 12
26.7%
사람 6
 
13.3%
식물 3
 
6.7%
자연 구조물 3
 
6.7%

Length

2023-12-13T03:03:12.953251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:03:13.066584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인공물 21
43.8%
동물 12
25.0%
사람 6
 
12.5%
식물 3
 
6.2%
자연 3
 
6.2%
구조물 3
 
6.2%

세부 카테고리
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T03:03:13.227262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length11.066667
Min length1

Characters and Unicode

Total characters498
Distinct characters57
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row사람
2nd row사람 2
3rd row사람 3
4th row얼굴
5th row얼굴 2
ValueCountFrequency (%)
3 15
 
9.4%
2 15
 
9.4%
건설 9
 
5.7%
구조 6
 
3.8%
6
 
3.8%
장난감 3
 
1.9%
자동차 3
 
1.9%
지형 3
 
1.9%
무기 3
 
1.9%
3
 
1.9%
Other values (31) 93
58.5%
2023-12-13T03:03:13.808126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
114
22.9%
, 63
 
12.7%
21
 
4.2%
18
 
3.6%
18
 
3.6%
2 15
 
3.0%
3 15
 
3.0%
15
 
3.0%
12
 
2.4%
12
 
2.4%
Other values (47) 195
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 285
57.2%
Space Separator 114
 
22.9%
Other Punctuation 63
 
12.7%
Decimal Number 30
 
6.0%
Dash Punctuation 6
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
7.4%
18
 
6.3%
18
 
6.3%
15
 
5.3%
12
 
4.2%
12
 
4.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
Other values (42) 153
53.7%
Decimal Number
ValueCountFrequency (%)
2 15
50.0%
3 15
50.0%
Space Separator
ValueCountFrequency (%)
114
100.0%
Other Punctuation
ValueCountFrequency (%)
, 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 285
57.2%
Common 213
42.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
7.4%
18
 
6.3%
18
 
6.3%
15
 
5.3%
12
 
4.2%
12
 
4.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
Other values (42) 153
53.7%
Common
ValueCountFrequency (%)
114
53.5%
, 63
29.6%
2 15
 
7.0%
3 15
 
7.0%
- 6
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 285
57.2%
ASCII 213
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114
53.5%
, 63
29.6%
2 15
 
7.0%
3 15
 
7.0%
- 6
 
2.8%
Hangul
ValueCountFrequency (%)
21
 
7.4%
18
 
6.3%
18
 
6.3%
15
 
5.3%
12
 
4.2%
12
 
4.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
Other values (42) 153
53.7%

이미지 수
Real number (ℝ)

Distinct44
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9656.4667
Minimum301
Maximum25261
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T03:03:13.942935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum301
5-th percentile1722.4
Q15563
median8067
Q312660
95-th percentile19226.6
Maximum25261
Range24960
Interquartile range (IQR)7097

Descriptive statistics

Standard deviation5950.0538
Coefficient of variation (CV)0.61617298
Kurtosis0.022897259
Mean9656.4667
Median Absolute Deviation (MAD)3447
Skewness0.76441935
Sum434541
Variance35403140
MonotonicityNot monotonic
2023-12-13T03:03:14.062724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
16817 2
 
4.4%
18646 1
 
2.2%
6038 1
 
2.2%
8534 1
 
2.2%
23314 1
 
2.2%
12203 1
 
2.2%
11095 1
 
2.2%
15716 1
 
2.2%
4620 1
 
2.2%
3645 1
 
2.2%
Other values (34) 34
75.6%
ValueCountFrequency (%)
301 1
2.2%
548 1
2.2%
1322 1
2.2%
3324 1
2.2%
3645 1
2.2%
4443 1
2.2%
4502 1
2.2%
4620 1
2.2%
4653 1
2.2%
4996 1
2.2%
ValueCountFrequency (%)
25261 1
2.2%
23314 1
2.2%
19259 1
2.2%
19097 1
2.2%
18646 1
2.2%
17249 1
2.2%
16817 2
4.4%
15716 1
2.2%
15180 1
2.2%
14070 1
2.2%

Interactions

2023-12-13T03:03:12.650888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:03:14.143754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상위 카테고리세부 카테고리이미지 수
상위 카테고리1.0001.0000.000
세부 카테고리1.0001.0001.000
이미지 수0.0001.0001.000
2023-12-13T03:03:14.225653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이미지 수상위 카테고리
이미지 수1.0000.000
상위 카테고리0.0001.000

Missing values

2023-12-13T03:03:12.750547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:03:12.857831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상위 카테고리세부 카테고리이미지 수
0사람사람18646
1사람사람 210069
2사람사람 38928
3사람얼굴5563
4사람얼굴 2301
5사람얼굴 3548
6동물17249
7동물새 26750
8동물새 38256
9동물양서류, 파충류, 절지동물, 무척추동물19259
상위 카테고리세부 카테고리이미지 수
35인공물자동차 류 37911
36인공물도구, 기계, 장비, 기구14070
37인공물도구, 기계, 장비, 기구 24996
38인공물도구, 기계, 장비, 기구 35358
39인공물가구, 가전, 악기, 장난감, 무기15180
40인공물가구, 가전, 악기, 장난감, 무기 27451
41인공물가구, 가전, 악기, 장난감, 무기 35937
42자연 구조물지형, 자연구조물5916
43자연 구조물지형, 자연구조물 21322
44자연 구조물지형, 자연구조물 33324