Overview

Dataset statistics

Number of variables1
Number of observations3072
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory24.1 KiB
Average record size in memory8.0 B

Variable types

Categorical1

Dataset

Description2011년 대구 지역 간선도로 지점교통량
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3074818&dataSetDetailId=30748181fcf9387134af&provdMethod=FILE

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
2a가3나 is highly imbalanced (99.6%)Imbalance

Reproduction

Analysis started2024-04-18 03:11:39.405892
Analysis finished2024-04-18 03:11:40.855377
Duration1.45 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2a가3나
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.1 KiB
51
3071 
지점교통량DB_ID
 
1

Length

Max length10
Median length2
Mean length2.0026042
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row지점교통량DB_ID
2nd row51
3rd row51
4th row51
5th row51

Common Values

ValueCountFrequency (%)
51 3071
> 99.9%
지점교통량DB_ID 1
 
< 0.1%

Length

2024-04-18T12:11:40.918951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T12:11:41.005906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
51 3071
> 99.9%
지점교통량db_id 1
 
< 0.1%

Missing values

2024-04-18T12:11:40.830508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

2a가3나
0지점교통량DB_ID
151
251
351
451
551
651
751
851
951
2a가3나
306251
306351
306451
306551
306651
306751
306851
306951
307051
307151

Duplicate rows

Most frequently occurring

2a가3나# duplicates
0513071