Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_161

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.702AlaCys: 0.702 ± 0.473
4.916AlaAsp: 4.916 ± 1.639
2.107AlaGlu: 2.107 ± 0.843
3.511AlaPhe: 3.511 ± 1.668
1.404AlaGly: 1.404 ± 1.306
0.702AlaHis: 0.702 ± 0.789
2.809AlaIle: 2.809 ± 1.931
2.809AlaLys: 2.809 ± 1.372
4.916AlaLeu: 4.916 ± 1.874
0.702AlaMet: 0.702 ± 0.653
0.702AlaAsn: 0.702 ± 0.746
1.404AlaPro: 1.404 ± 0.946
3.511AlaGln: 3.511 ± 1.205
3.511AlaArg: 3.511 ± 1.18
8.427AlaSer: 8.427 ± 1.914
3.511AlaThr: 3.511 ± 1.618
1.404AlaVal: 1.404 ± 1.011
0.702AlaTrp: 0.702 ± 0.473
2.809AlaTyr: 2.809 ± 0.783
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.809CysAsp: 2.809 ± 1.099
0.702CysGlu: 0.702 ± 0.651
0.702CysPhe: 0.702 ± 0.789
0.702CysGly: 0.702 ± 0.651
0.0CysHis: 0.0 ± 0.0
1.404CysIle: 1.404 ± 1.302
2.107CysLys: 2.107 ± 1.109
1.404CysLeu: 1.404 ± 0.55
2.107CysMet: 2.107 ± 0.843
0.0CysAsn: 0.0 ± 0.0
0.702CysPro: 0.702 ± 0.789
0.0CysGln: 0.0 ± 0.0
0.702CysArg: 0.702 ± 0.651
2.809CysSer: 2.809 ± 1.099
0.0CysThr: 0.0 ± 0.0
1.404CysVal: 1.404 ± 1.039
0.0CysTrp: 0.0 ± 0.0
0.702CysTyr: 0.702 ± 0.651
0.0CysXaa: 0.0 ± 0.0
Asp
2.809AspAla: 2.809 ± 1.931
1.404AspCys: 1.404 ± 1.302
7.725AspAsp: 7.725 ± 3.149
4.916AspGlu: 4.916 ± 1.596
3.511AspPhe: 3.511 ± 2.273
0.702AspGly: 0.702 ± 0.651
1.404AspHis: 1.404 ± 0.55
6.32AspIle: 6.32 ± 1.275
5.618AspLys: 5.618 ± 2.754
4.213AspLeu: 4.213 ± 0.552
3.511AspMet: 3.511 ± 1.489
2.809AspAsn: 2.809 ± 1.022
2.107AspPro: 2.107 ± 1.418
3.511AspGln: 3.511 ± 1.641
2.107AspArg: 2.107 ± 0.804
6.32AspSer: 6.32 ± 0.821
2.809AspThr: 2.809 ± 1.279
5.618AspVal: 5.618 ± 1.06
2.107AspTrp: 2.107 ± 1.109
5.618AspTyr: 5.618 ± 1.004
0.0AspXaa: 0.0 ± 0.0
Glu
3.511GluAla: 3.511 ± 1.651
1.404GluCys: 1.404 ± 0.55
2.107GluAsp: 2.107 ± 0.804
0.0GluGlu: 0.0 ± 0.0
2.809GluPhe: 2.809 ± 1.234
0.702GluGly: 0.702 ± 0.653
1.404GluHis: 1.404 ± 0.55
2.809GluIle: 2.809 ± 1.322
5.618GluLys: 5.618 ± 2.548
5.618GluLeu: 5.618 ± 2.647
1.404GluMet: 1.404 ± 0.587
4.916GluAsn: 4.916 ± 0.838
0.702GluPro: 0.702 ± 0.473
4.213GluGln: 4.213 ± 1.596
2.107GluArg: 2.107 ± 1.148
4.213GluSer: 4.213 ± 1.169
2.107GluThr: 2.107 ± 0.84
4.916GluVal: 4.916 ± 1.564
1.404GluTrp: 1.404 ± 0.946
4.213GluTyr: 4.213 ± 2.217
0.0GluXaa: 0.0 ± 0.0
Phe
2.809PheAla: 2.809 ± 0.783
1.404PheCys: 1.404 ± 0.55
5.618PheAsp: 5.618 ± 1.289
1.404PheGlu: 1.404 ± 0.821
1.404PhePhe: 1.404 ± 0.55
3.511PheGly: 3.511 ± 0.819
1.404PheHis: 1.404 ± 0.946
2.809PheIle: 2.809 ± 1.234
1.404PheLys: 1.404 ± 0.851
2.107PheLeu: 2.107 ± 0.667
1.404PheMet: 1.404 ± 0.884
4.213PheAsn: 4.213 ± 2.315
0.702PhePro: 0.702 ± 0.473
4.213PheGln: 4.213 ± 0.96
2.107PheArg: 2.107 ± 0.497
4.213PheSer: 4.213 ± 1.099
2.107PheThr: 2.107 ± 0.792
2.809PheVal: 2.809 ± 0.517
1.404PheTrp: 1.404 ± 0.851
2.107PheTyr: 2.107 ± 0.497
0.0PheXaa: 0.0 ± 0.0
Gly
3.511GlyAla: 3.511 ± 1.703
1.404GlyCys: 1.404 ± 0.55
2.107GlyAsp: 2.107 ± 0.667
4.213GlyGlu: 4.213 ± 1.69
2.107GlyPhe: 2.107 ± 0.497
6.32GlyGly: 6.32 ± 2.512
1.404GlyHis: 1.404 ± 0.55
2.107GlyIle: 2.107 ± 1.727
2.809GlyLys: 2.809 ± 1.733
7.022GlyLeu: 7.022 ± 2.756
0.702GlyMet: 0.702 ± 0.473
2.809GlyAsn: 2.809 ± 0.541
0.0GlyPro: 0.0 ± 0.0
0.702GlyGln: 0.702 ± 0.746
1.404GlyArg: 1.404 ± 0.587
6.32GlySer: 6.32 ± 2.249
4.213GlyThr: 4.213 ± 2.315
7.022GlyVal: 7.022 ± 0.918
0.0GlyTrp: 0.0 ± 0.0
2.809GlyTyr: 2.809 ± 1.183
0.0GlyXaa: 0.0 ± 0.0
His
1.404HisAla: 1.404 ± 0.821
0.702HisCys: 0.702 ± 0.651
1.404HisAsp: 1.404 ± 0.946
0.0HisGlu: 0.0 ± 0.0
1.404HisPhe: 1.404 ± 0.946
0.702HisGly: 0.702 ± 0.473
0.702HisHis: 0.702 ± 0.473
1.404HisIle: 1.404 ± 1.039
2.107HisLys: 2.107 ± 1.954
0.702HisLeu: 0.702 ± 0.473
1.404HisMet: 1.404 ± 0.55
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.702HisGln: 0.702 ± 0.473
0.0HisArg: 0.0 ± 0.0
3.511HisSer: 3.511 ± 1.308
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.702HisTyr: 0.702 ± 0.651
0.0HisXaa: 0.0 ± 0.0
Ile
2.107IleAla: 2.107 ± 1.142
1.404IleCys: 1.404 ± 1.302
3.511IleAsp: 3.511 ± 1.641
5.618IleGlu: 5.618 ± 1.74
2.809IlePhe: 2.809 ± 1.733
5.618IleGly: 5.618 ± 1.263
2.107IleHis: 2.107 ± 1.109
3.511IleIle: 3.511 ± 3.256
3.511IleLys: 3.511 ± 1.195
4.213IleLeu: 4.213 ± 1.505
0.702IleMet: 0.702 ± 0.571
1.404IleAsn: 1.404 ± 0.55
2.107IlePro: 2.107 ± 0.497
2.809IleGln: 2.809 ± 1.372
1.404IleArg: 1.404 ± 1.302
2.809IleSer: 2.809 ± 1.757
2.107IleThr: 2.107 ± 0.83
2.107IleVal: 2.107 ± 1.006
1.404IleTrp: 1.404 ± 0.77
1.404IleTyr: 1.404 ± 0.946
0.0IleXaa: 0.0 ± 0.0
Lys
2.809LysAla: 2.809 ± 0.783
2.107LysCys: 2.107 ± 1.33
5.618LysAsp: 5.618 ± 2.364
4.213LysGlu: 4.213 ± 2.184
1.404LysPhe: 1.404 ± 0.821
4.916LysGly: 4.916 ± 0.896
2.107LysHis: 2.107 ± 0.667
3.511LysIle: 3.511 ± 2.123
8.427LysLys: 8.427 ± 3.597
4.916LysLeu: 4.916 ± 2.918
2.107LysMet: 2.107 ± 0.937
4.916LysAsn: 4.916 ± 1.933
1.404LysPro: 1.404 ± 0.946
3.511LysGln: 3.511 ± 2.115
4.213LysArg: 4.213 ± 0.552
4.916LysSer: 4.916 ± 1.678
0.702LysThr: 0.702 ± 0.653
2.809LysVal: 2.809 ± 0.783
1.404LysTrp: 1.404 ± 0.587
2.107LysTyr: 2.107 ± 0.667
0.0LysXaa: 0.0 ± 0.0
Leu
2.809LeuAla: 2.809 ± 1.174
0.0LeuCys: 0.0 ± 0.0
5.618LeuAsp: 5.618 ± 1.004
4.916LeuGlu: 4.916 ± 2.629
2.809LeuPhe: 2.809 ± 0.783
4.213LeuGly: 4.213 ± 1.616
0.702LeuHis: 0.702 ± 0.473
2.107LeuIle: 2.107 ± 1.109
8.427LeuLys: 8.427 ± 1.653
3.511LeuLeu: 3.511 ± 2.413
1.404LeuMet: 1.404 ± 0.587
6.32LeuAsn: 6.32 ± 2.213
7.725LeuPro: 7.725 ± 1.494
6.32LeuGln: 6.32 ± 1.12
1.404LeuArg: 1.404 ± 0.821
8.427LeuSer: 8.427 ± 2.667
2.107LeuThr: 2.107 ± 1.462
6.32LeuVal: 6.32 ± 1.388
1.404LeuTrp: 1.404 ± 0.587
4.213LeuTyr: 4.213 ± 2.195
0.0LeuXaa: 0.0 ± 0.0
Met
1.404MetAla: 1.404 ± 1.491
0.702MetCys: 0.702 ± 0.473
1.404MetAsp: 1.404 ± 1.306
2.107MetGlu: 2.107 ± 0.937
1.404MetPhe: 1.404 ± 0.946
1.404MetGly: 1.404 ± 1.011
0.702MetHis: 0.702 ± 0.473
2.107MetIle: 2.107 ± 1.545
0.702MetLys: 0.702 ± 0.651
1.404MetLeu: 1.404 ± 0.789
0.0MetMet: 0.0 ± 0.0
0.702MetAsn: 0.702 ± 0.789
0.702MetPro: 0.702 ± 0.473
0.702MetGln: 0.702 ± 0.473
2.809MetArg: 2.809 ± 1.183
1.404MetSer: 1.404 ± 0.587
2.107MetThr: 2.107 ± 0.843
0.702MetVal: 0.702 ± 0.473
0.702MetTrp: 0.702 ± 0.473
1.404MetTyr: 1.404 ± 0.587
0.0MetXaa: 0.0 ± 0.0
Asn
6.32AsnAla: 6.32 ± 1.283
2.107AsnCys: 2.107 ± 1.109
2.809AsnAsp: 2.809 ± 0.895
4.213AsnGlu: 4.213 ± 1.335
4.213AsnPhe: 4.213 ± 1.284
3.511AsnGly: 3.511 ± 2.514
0.0AsnHis: 0.0 ± 0.0
2.107AsnIle: 2.107 ± 1.332
2.809AsnLys: 2.809 ± 1.291
8.427AsnLeu: 8.427 ± 2.574
0.0AsnMet: 0.0 ± 0.0
4.213AsnAsn: 4.213 ± 2.237
2.107AsnPro: 2.107 ± 1.418
1.404AsnGln: 1.404 ± 0.941
2.809AsnArg: 2.809 ± 1.448
1.404AsnSer: 1.404 ± 0.946
3.511AsnThr: 3.511 ± 1.229
3.511AsnVal: 3.511 ± 0.969
1.404AsnTrp: 1.404 ± 0.55
2.107AsnTyr: 2.107 ± 1.109
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.404ProCys: 1.404 ± 1.039
0.702ProAsp: 0.702 ± 0.473
2.107ProGlu: 2.107 ± 1.418
4.213ProPhe: 4.213 ± 0.891
1.404ProGly: 1.404 ± 0.946
1.404ProHis: 1.404 ± 0.821
2.107ProIle: 2.107 ± 1.418
1.404ProLys: 1.404 ± 1.302
2.107ProLeu: 2.107 ± 1.11
1.404ProMet: 1.404 ± 0.946
1.404ProAsn: 1.404 ± 0.946
0.702ProPro: 0.702 ± 0.473
2.107ProGln: 2.107 ± 0.83
2.107ProArg: 2.107 ± 0.792
4.916ProSer: 4.916 ± 1.126
0.702ProThr: 0.702 ± 0.473
2.107ProVal: 2.107 ± 0.843
0.0ProTrp: 0.0 ± 0.0
1.404ProTyr: 1.404 ± 0.77
0.0ProXaa: 0.0 ± 0.0
Gln
1.404GlnAla: 1.404 ± 1.491
1.404GlnCys: 1.404 ± 1.039
1.404GlnAsp: 1.404 ± 1.306
2.107GlnGlu: 2.107 ± 0.804
3.511GlnPhe: 3.511 ± 1.804
0.702GlnGly: 0.702 ± 0.473
0.0GlnHis: 0.0 ± 0.0
1.404GlnIle: 1.404 ± 0.946
7.022GlnLys: 7.022 ± 2.184
3.511GlnLeu: 3.511 ± 1.804
1.404GlnMet: 1.404 ± 0.851
4.213GlnAsn: 4.213 ± 0.683
1.404GlnPro: 1.404 ± 0.77
4.213GlnGln: 4.213 ± 2.416
2.107GlnArg: 2.107 ± 1.332
4.916GlnSer: 4.916 ± 2.196
3.511GlnThr: 3.511 ± 1.374
2.107GlnVal: 2.107 ± 1.148
0.0GlnTrp: 0.0 ± 0.0
4.213GlnTyr: 4.213 ± 1.076
0.0GlnXaa: 0.0 ± 0.0
Arg
2.107ArgAla: 2.107 ± 1.418
0.0ArgCys: 0.0 ± 0.0
2.107ArgAsp: 2.107 ± 0.497
4.916ArgGlu: 4.916 ± 1.4
1.404ArgPhe: 1.404 ± 0.851
3.511ArgGly: 3.511 ± 1.618
0.0ArgHis: 0.0 ± 0.0
2.809ArgIle: 2.809 ± 1.702
0.702ArgLys: 0.702 ± 0.653
2.809ArgLeu: 2.809 ± 1.183
1.404ArgMet: 1.404 ± 0.789
3.511ArgAsn: 3.511 ± 1.108
0.702ArgPro: 0.702 ± 0.651
0.702ArgGln: 0.702 ± 0.653
2.107ArgArg: 2.107 ± 1.545
2.107ArgSer: 2.107 ± 0.497
2.107ArgThr: 2.107 ± 1.418
0.702ArgVal: 0.702 ± 0.651
2.107ArgTrp: 2.107 ± 1.545
4.916ArgTyr: 4.916 ± 0.896
0.0ArgXaa: 0.0 ± 0.0
Ser
7.022SerAla: 7.022 ± 2.073
0.702SerCys: 0.702 ± 0.473
9.129SerAsp: 9.129 ± 2.011
4.213SerGlu: 4.213 ± 1.616
4.213SerPhe: 4.213 ± 1.513
7.725SerGly: 7.725 ± 1.437
0.702SerHis: 0.702 ± 0.746
4.213SerIle: 4.213 ± 0.993
4.916SerLys: 4.916 ± 2.968
8.427SerLeu: 8.427 ± 1.865
2.809SerMet: 2.809 ± 0.882
4.213SerAsn: 4.213 ± 1.743
2.107SerPro: 2.107 ± 1.332
6.32SerGln: 6.32 ± 3.197
0.0SerArg: 0.0 ± 0.0
11.236SerSer: 11.236 ± 1.483
2.809SerThr: 2.809 ± 1.37
8.427SerVal: 8.427 ± 2.774
0.0SerTrp: 0.0 ± 0.0
4.916SerTyr: 4.916 ± 1.304
0.0SerXaa: 0.0 ± 0.0
Thr
2.809ThrAla: 2.809 ± 0.517
0.0ThrCys: 0.0 ± 0.0
2.809ThrAsp: 2.809 ± 1.931
1.404ThrGlu: 1.404 ± 0.77
1.404ThrPhe: 1.404 ± 0.789
3.511ThrGly: 3.511 ± 1.089
0.0ThrHis: 0.0 ± 0.0
2.809ThrIle: 2.809 ± 1.37
2.107ThrLys: 2.107 ± 0.843
2.107ThrLeu: 2.107 ± 0.792
0.0ThrMet: 0.0 ± 0.0
1.404ThrAsn: 1.404 ± 0.946
3.511ThrPro: 3.511 ± 1.139
0.702ThrGln: 0.702 ± 0.746
3.511ThrArg: 3.511 ± 1.618
5.618ThrSer: 5.618 ± 1.841
3.511ThrThr: 3.511 ± 0.871
2.809ThrVal: 2.809 ± 1.891
0.0ThrTrp: 0.0 ± 0.0
1.404ThrTyr: 1.404 ± 0.55
0.0ThrXaa: 0.0 ± 0.0
Val
4.916ValAla: 4.916 ± 0.76
0.702ValCys: 0.702 ± 0.473
6.32ValAsp: 6.32 ± 1.86
4.916ValGlu: 4.916 ± 0.746
2.107ValPhe: 2.107 ± 1.418
5.618ValGly: 5.618 ± 1.541
0.0ValHis: 0.0 ± 0.0
2.107ValIle: 2.107 ± 0.83
2.107ValLys: 2.107 ± 0.497
6.32ValLeu: 6.32 ± 0.917
0.702ValMet: 0.702 ± 0.789
2.809ValAsn: 2.809 ± 1.448
3.511ValPro: 3.511 ± 0.819
2.809ValGln: 2.809 ± 1.222
0.702ValArg: 0.702 ± 0.746
7.022ValSer: 7.022 ± 0.867
0.702ValThr: 0.702 ± 0.473
1.404ValVal: 1.404 ± 1.306
0.0ValTrp: 0.0 ± 0.0
4.916ValTyr: 4.916 ± 2.006
0.0ValXaa: 0.0 ± 0.0
Trp
0.702TrpAla: 0.702 ± 0.651
0.0TrpCys: 0.0 ± 0.0
2.107TrpAsp: 2.107 ± 0.497
0.0TrpGlu: 0.0 ± 0.0
0.702TrpPhe: 0.702 ± 0.473
0.0TrpGly: 0.0 ± 0.0
1.404TrpHis: 1.404 ± 0.55
0.702TrpIle: 0.702 ± 0.473
1.404TrpLys: 1.404 ± 0.77
1.404TrpLeu: 1.404 ± 0.946
0.0TrpMet: 0.0 ± 0.0
2.107TrpAsn: 2.107 ± 1.11
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.702TrpArg: 0.702 ± 0.651
2.107TrpSer: 2.107 ± 1.109
0.702TrpThr: 0.702 ± 0.653
0.0TrpVal: 0.0 ± 0.0
0.702TrpTrp: 0.702 ± 0.789
0.702TrpTyr: 0.702 ± 0.473
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.107TyrAla: 2.107 ± 0.804
0.702TyrCys: 0.702 ± 0.789
4.916TyrAsp: 4.916 ± 3.104
2.107TyrGlu: 2.107 ± 0.667
3.511TyrPhe: 3.511 ± 1.624
2.809TyrGly: 2.809 ± 1.099
0.702TyrHis: 0.702 ± 0.651
3.511TyrIle: 3.511 ± 0.997
2.809TyrLys: 2.809 ± 1.022
5.618TyrLeu: 5.618 ± 1.542
0.702TyrMet: 0.702 ± 0.473
6.32TyrAsn: 6.32 ± 1.611
2.107TyrPro: 2.107 ± 1.006
2.107TyrGln: 2.107 ± 0.843
4.916TyrArg: 4.916 ± 1.949
1.404TyrSer: 1.404 ± 1.302
2.107TyrThr: 2.107 ± 1.109
3.511TyrVal: 3.511 ± 0.954
0.702TyrTrp: 0.702 ± 0.473
1.404TyrTyr: 1.404 ± 0.55
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski