Overview

Dataset statistics

Number of variables1
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory156.2 KiB
Average record size in memory16.0 B

Variable types

Categorical1

Dataset

Description2011년 대구 지역 교량 지점교통량
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3074818&dataSetDetailId=30748181cf035c0f6ea6&provdMethod=FILE

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
2a가3나 is highly imbalanced (99.9%)Imbalance

Reproduction

Analysis started2024-04-19 05:19:00.511689
Analysis finished2024-04-19 05:19:00.623221
Duration0.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2a가3나
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
50
9999 
지점교통량DB_ID
 
1

Length

Max length10
Median length2
Mean length2.0008
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row50
2nd row50
3rd row50
4th row50
5th row50

Common Values

ValueCountFrequency (%)
50 9999
> 99.9%
지점교통량DB_ID 1
 
< 0.1%

Length

2024-04-19T14:19:00.686132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:19:00.771250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50 9999
> 99.9%
지점교통량db_id 1
 
< 0.1%

Missing values

2024-04-19T14:19:00.556295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:19:00.602490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

2a가3나
1512950
544850
1820950
1845250
597750
69850
766750
2577250
2728850
784450
2a가3나
542850
397150
2160750
857250
1524250
2220850
2294850
1993050
454450
2671450

Duplicate rows

Most frequently occurring

2a가3나# duplicates
0509999