Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_478

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.353AlaAla: 4.353 ± 2.827
0.622AlaCys: 0.622 ± 0.43
3.731AlaAsp: 3.731 ± 1.289
3.731AlaGlu: 3.731 ± 2.082
1.244AlaPhe: 1.244 ± 1.663
3.109AlaGly: 3.109 ± 1.448
0.0AlaHis: 0.0 ± 0.0
1.244AlaIle: 1.244 ± 0.953
3.109AlaLys: 3.109 ± 1.248
6.219AlaLeu: 6.219 ± 1.864
0.622AlaMet: 0.622 ± 0.43
8.085AlaAsn: 8.085 ± 2.889
3.109AlaPro: 3.109 ± 1.305
3.731AlaGln: 3.731 ± 1.977
4.353AlaArg: 4.353 ± 1.439
4.975AlaSer: 4.975 ± 2.304
3.109AlaThr: 3.109 ± 1.569
3.731AlaVal: 3.731 ± 1.608
0.622AlaTrp: 0.622 ± 0.43
6.219AlaTyr: 6.219 ± 2.496
0.0AlaXaa: 0.0 ± 0.0
Cys
1.866CysAla: 1.866 ± 0.939
0.0CysCys: 0.0 ± 0.0
0.622CysAsp: 0.622 ± 0.527
1.866CysGlu: 1.866 ± 2.793
0.622CysPhe: 0.622 ± 0.527
2.488CysGly: 2.488 ± 1.08
0.0CysHis: 0.0 ± 0.0
1.244CysIle: 1.244 ± 0.966
0.622CysLys: 0.622 ± 0.43
3.109CysLeu: 3.109 ± 1.251
0.0CysMet: 0.0 ± 0.0
0.622CysAsn: 0.622 ± 0.43
1.244CysPro: 1.244 ± 1.054
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.622CysSer: 0.622 ± 0.931
0.0CysThr: 0.0 ± 0.0
1.244CysVal: 1.244 ± 0.884
0.622CysTrp: 0.622 ± 0.527
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.975AspAla: 4.975 ± 1.608
1.244AspCys: 1.244 ± 0.966
0.622AspAsp: 0.622 ± 0.43
1.244AspGlu: 1.244 ± 0.68
3.109AspPhe: 3.109 ± 1.54
1.244AspGly: 1.244 ± 0.809
0.622AspHis: 0.622 ± 0.43
4.975AspIle: 4.975 ± 1.705
7.463AspLys: 7.463 ± 2.811
9.95AspLeu: 9.95 ± 2.1
1.866AspMet: 1.866 ± 1.154
3.731AspAsn: 3.731 ± 1.319
1.244AspPro: 1.244 ± 0.68
0.0AspGln: 0.0 ± 0.0
0.622AspArg: 0.622 ± 0.43
6.841AspSer: 6.841 ± 1.537
2.488AspThr: 2.488 ± 1.439
1.244AspVal: 1.244 ± 0.86
1.866AspTrp: 1.866 ± 1.016
5.597AspTyr: 5.597 ± 1.606
0.0AspXaa: 0.0 ± 0.0
Glu
4.975GluAla: 4.975 ± 1.622
1.866GluCys: 1.866 ± 1.782
4.975GluAsp: 4.975 ± 2.419
4.353GluGlu: 4.353 ± 2.081
3.731GluPhe: 3.731 ± 1.01
2.488GluGly: 2.488 ± 1.101
1.866GluHis: 1.866 ± 1.063
1.866GluIle: 1.866 ± 1.149
4.975GluLys: 4.975 ± 1.991
4.975GluLeu: 4.975 ± 1.45
1.866GluMet: 1.866 ± 1.07
6.219GluAsn: 6.219 ± 1.211
0.0GluPro: 0.0 ± 0.0
1.866GluGln: 1.866 ± 0.747
0.622GluArg: 0.622 ± 0.921
1.866GluSer: 1.866 ± 1.15
1.866GluThr: 1.866 ± 0.6
5.597GluVal: 5.597 ± 3.451
2.488GluTrp: 2.488 ± 0.987
2.488GluTyr: 2.488 ± 1.293
0.0GluXaa: 0.0 ± 0.0
Phe
1.244PheAla: 1.244 ± 0.86
0.0PheCys: 0.0 ± 0.0
1.866PheAsp: 1.866 ± 0.969
2.488PheGlu: 2.488 ± 1.103
1.244PhePhe: 1.244 ± 0.54
3.109PheGly: 3.109 ± 0.978
1.244PheHis: 1.244 ± 0.54
1.866PheIle: 1.866 ± 1.056
1.866PheLys: 1.866 ± 1.613
4.975PheLeu: 4.975 ± 1.188
3.731PheMet: 3.731 ± 1.553
5.597PheAsn: 5.597 ± 2.037
0.622PhePro: 0.622 ± 0.527
0.622PheGln: 0.622 ± 0.43
1.866PheArg: 1.866 ± 1.056
1.866PheSer: 1.866 ± 1.4
2.488PheThr: 2.488 ± 1.618
3.731PheVal: 3.731 ± 1.544
0.622PheTrp: 0.622 ± 0.43
1.866PheTyr: 1.866 ± 0.844
0.0PheXaa: 0.0 ± 0.0
Gly
3.109GlyAla: 3.109 ± 1.448
0.0GlyCys: 0.0 ± 0.0
5.597GlyAsp: 5.597 ± 1.512
6.219GlyGlu: 6.219 ± 2.494
1.866GlyPhe: 1.866 ± 0.822
4.353GlyGly: 4.353 ± 1.994
0.0GlyHis: 0.0 ± 0.0
3.731GlyIle: 3.731 ± 1.22
6.219GlyLys: 6.219 ± 1.272
3.109GlyLeu: 3.109 ± 1.367
0.0GlyMet: 0.0 ± 0.0
3.731GlyAsn: 3.731 ± 1.913
0.622GlyPro: 0.622 ± 0.43
1.866GlyGln: 1.866 ± 0.747
1.244GlyArg: 1.244 ± 0.884
3.731GlySer: 3.731 ± 1.345
1.866GlyThr: 1.866 ± 1.29
4.353GlyVal: 4.353 ± 1.754
0.622GlyTrp: 0.622 ± 0.527
3.731GlyTyr: 3.731 ± 1.032
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.622HisCys: 0.622 ± 0.527
0.622HisAsp: 0.622 ± 0.43
0.622HisGlu: 0.622 ± 0.931
1.244HisPhe: 1.244 ± 0.54
1.244HisGly: 1.244 ± 0.54
0.622HisHis: 0.622 ± 0.43
0.622HisIle: 0.622 ± 0.931
2.488HisLys: 2.488 ± 0.688
1.244HisLeu: 1.244 ± 0.54
0.0HisMet: 0.0 ± 0.0
1.866HisAsn: 1.866 ± 0.844
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.244HisSer: 1.244 ± 0.86
0.622HisThr: 0.622 ± 0.43
0.622HisVal: 0.622 ± 0.43
0.622HisTrp: 0.622 ± 0.527
0.622HisTyr: 0.622 ± 0.931
0.0HisXaa: 0.0 ± 0.0
Ile
4.353IleAla: 4.353 ± 1.953
1.244IleCys: 1.244 ± 1.177
1.244IleAsp: 1.244 ± 0.902
3.731IleGlu: 3.731 ± 2.692
1.866IlePhe: 1.866 ± 0.861
5.597IleGly: 5.597 ± 1.678
0.622IleHis: 0.622 ± 0.43
1.866IleIle: 1.866 ± 0.977
5.597IleLys: 5.597 ± 1.468
4.975IleLeu: 4.975 ± 3.676
1.244IleMet: 1.244 ± 0.753
5.597IleAsn: 5.597 ± 1.346
3.731IlePro: 3.731 ± 1.644
1.866IleGln: 1.866 ± 1.15
1.866IleArg: 1.866 ± 0.822
1.866IleSer: 1.866 ± 0.747
1.866IleThr: 1.866 ± 0.931
3.731IleVal: 3.731 ± 0.976
1.866IleTrp: 1.866 ± 0.844
3.109IleTyr: 3.109 ± 1.086
0.0IleXaa: 0.0 ± 0.0
Lys
4.353LysAla: 4.353 ± 2.035
1.244LysCys: 1.244 ± 1.054
6.219LysAsp: 6.219 ± 2.185
3.731LysGlu: 3.731 ± 1.753
5.597LysPhe: 5.597 ± 2.039
4.975LysGly: 4.975 ± 1.656
1.866LysHis: 1.866 ± 1.131
8.085LysIle: 8.085 ± 3.796
6.841LysLys: 6.841 ± 3.708
4.975LysLeu: 4.975 ± 3.203
1.244LysMet: 1.244 ± 1.15
3.109LysAsn: 3.109 ± 1.875
1.244LysPro: 1.244 ± 0.54
1.244LysGln: 1.244 ± 1.097
2.488LysArg: 2.488 ± 0.92
1.866LysSer: 1.866 ± 1.01
5.597LysThr: 5.597 ± 1.639
4.353LysVal: 4.353 ± 2.218
0.0LysTrp: 0.0 ± 0.0
4.975LysTyr: 4.975 ± 2.41
0.0LysXaa: 0.0 ± 0.0
Leu
4.975LeuAla: 4.975 ± 1.263
0.622LeuCys: 0.622 ± 0.527
6.841LeuAsp: 6.841 ± 2.272
9.328LeuGlu: 9.328 ± 1.589
1.244LeuPhe: 1.244 ± 0.655
5.597LeuGly: 5.597 ± 2.186
1.244LeuHis: 1.244 ± 1.054
6.219LeuIle: 6.219 ± 2.879
5.597LeuLys: 5.597 ± 3.068
5.597LeuLeu: 5.597 ± 1.157
1.244LeuMet: 1.244 ± 0.966
4.975LeuAsn: 4.975 ± 2.889
3.731LeuPro: 3.731 ± 1.494
3.109LeuGln: 3.109 ± 1.429
3.109LeuArg: 3.109 ± 1.323
3.731LeuSer: 3.731 ± 1.17
4.353LeuThr: 4.353 ± 1.757
4.975LeuVal: 4.975 ± 2.045
0.622LeuTrp: 0.622 ± 0.527
7.463LeuTyr: 7.463 ± 1.96
0.0LeuXaa: 0.0 ± 0.0
Met
1.244MetAla: 1.244 ± 1.026
0.622MetCys: 0.622 ± 0.527
1.866MetAsp: 1.866 ± 1.004
1.244MetGlu: 1.244 ± 1.193
1.244MetPhe: 1.244 ± 1.193
1.244MetGly: 1.244 ± 0.86
0.0MetHis: 0.0 ± 0.0
0.622MetIle: 0.622 ± 0.831
1.244MetLys: 1.244 ± 0.54
1.244MetLeu: 1.244 ± 1.128
0.0MetMet: 0.0 ± 0.0
0.622MetAsn: 0.622 ± 0.527
1.244MetPro: 1.244 ± 0.86
1.866MetGln: 1.866 ± 0.6
0.622MetArg: 0.622 ± 0.717
3.109MetSer: 3.109 ± 1.139
1.866MetThr: 1.866 ± 1.226
1.244MetVal: 1.244 ± 1.128
0.0MetTrp: 0.0 ± 0.0
0.622MetTyr: 0.622 ± 0.43
0.0MetXaa: 0.0 ± 0.0
Asn
4.353AsnAla: 4.353 ± 2.195
1.866AsnCys: 1.866 ± 1.247
2.488AsnAsp: 2.488 ± 1.285
6.841AsnGlu: 6.841 ± 2.495
3.731AsnPhe: 3.731 ± 1.097
4.353AsnGly: 4.353 ± 3.136
0.622AsnHis: 0.622 ± 0.717
2.488AsnIle: 2.488 ± 1.71
8.085AsnLys: 8.085 ± 2.326
6.219AsnLeu: 6.219 ± 3.031
0.622AsnMet: 0.622 ± 0.416
8.706AsnAsn: 8.706 ± 2.038
1.244AsnPro: 1.244 ± 0.785
1.866AsnGln: 1.866 ± 1.129
3.109AsnArg: 3.109 ± 1.566
6.841AsnSer: 6.841 ± 4.423
8.085AsnThr: 8.085 ± 1.874
8.706AsnVal: 8.706 ± 2.666
0.0AsnTrp: 0.0 ± 0.0
2.488AsnTyr: 2.488 ± 1.169
0.0AsnXaa: 0.0 ± 0.0
Pro
1.866ProAla: 1.866 ± 0.844
1.244ProCys: 1.244 ± 0.54
3.731ProAsp: 3.731 ± 1.318
1.244ProGlu: 1.244 ± 0.68
2.488ProPhe: 2.488 ± 1.195
2.488ProGly: 2.488 ± 1.08
0.622ProHis: 0.622 ± 0.527
2.488ProIle: 2.488 ± 1.106
0.0ProLys: 0.0 ± 0.0
4.353ProLeu: 4.353 ± 2.006
0.622ProMet: 0.622 ± 0.831
0.622ProAsn: 0.622 ± 0.43
0.0ProPro: 0.0 ± 0.0
0.622ProGln: 0.622 ± 0.43
0.622ProArg: 0.622 ± 0.527
1.866ProSer: 1.866 ± 0.822
1.244ProThr: 1.244 ± 0.68
2.488ProVal: 2.488 ± 1.72
0.0ProTrp: 0.0 ± 0.0
1.866ProTyr: 1.866 ± 0.703
0.0ProXaa: 0.0 ± 0.0
Gln
1.244GlnAla: 1.244 ± 1.434
0.0GlnCys: 0.0 ± 0.0
1.244GlnAsp: 1.244 ± 0.655
3.109GlnGlu: 3.109 ± 1.521
0.622GlnPhe: 0.622 ± 0.43
0.622GlnGly: 0.622 ± 0.43
0.622GlnHis: 0.622 ± 0.43
2.488GlnIle: 2.488 ± 1.404
3.731GlnLys: 3.731 ± 1.386
1.244GlnLeu: 1.244 ± 0.852
0.622GlnMet: 0.622 ± 0.921
0.622GlnAsn: 0.622 ± 0.527
1.866GlnPro: 1.866 ± 0.822
2.488GlnGln: 2.488 ± 0.688
2.488GlnArg: 2.488 ± 1.309
3.109GlnSer: 3.109 ± 1.366
0.622GlnThr: 0.622 ± 0.43
3.731GlnVal: 3.731 ± 1.988
0.0GlnTrp: 0.0 ± 0.0
3.731GlnTyr: 3.731 ± 0.892
0.0GlnXaa: 0.0 ± 0.0
Arg
3.731ArgAla: 3.731 ± 1.244
0.0ArgCys: 0.0 ± 0.0
2.488ArgAsp: 2.488 ± 1.18
0.622ArgGlu: 0.622 ± 0.527
1.866ArgPhe: 1.866 ± 0.822
0.622ArgGly: 0.622 ± 0.43
0.0ArgHis: 0.0 ± 0.0
1.866ArgIle: 1.866 ± 0.977
1.244ArgLys: 1.244 ± 0.54
3.109ArgLeu: 3.109 ± 1.185
1.244ArgMet: 1.244 ± 0.809
1.866ArgAsn: 1.866 ± 0.903
3.109ArgPro: 3.109 ± 1.323
1.866ArgGln: 1.866 ± 1.304
0.622ArgArg: 0.622 ± 0.527
3.731ArgSer: 3.731 ± 1.946
1.244ArgThr: 1.244 ± 0.655
1.866ArgVal: 1.866 ± 0.822
0.0ArgTrp: 0.0 ± 0.0
3.109ArgTyr: 3.109 ± 1.205
0.0ArgXaa: 0.0 ± 0.0
Ser
5.597SerAla: 5.597 ± 2.397
0.622SerCys: 0.622 ± 0.43
2.488SerAsp: 2.488 ± 1.71
2.488SerGlu: 2.488 ± 1.57
4.353SerPhe: 4.353 ± 1.241
5.597SerGly: 5.597 ± 2.419
1.866SerHis: 1.866 ± 1.29
6.841SerIle: 6.841 ± 2.532
1.866SerLys: 1.866 ± 0.837
5.597SerLeu: 5.597 ± 1.253
1.244SerMet: 1.244 ± 0.828
6.219SerAsn: 6.219 ± 1.688
3.731SerPro: 3.731 ± 1.1
2.488SerGln: 2.488 ± 1.309
1.866SerArg: 1.866 ± 0.844
5.597SerSer: 5.597 ± 1.519
5.597SerThr: 5.597 ± 2.567
3.109SerVal: 3.109 ± 1.204
0.622SerTrp: 0.622 ± 0.717
2.488SerTyr: 2.488 ± 1.231
0.0SerXaa: 0.0 ± 0.0
Thr
4.353ThrAla: 4.353 ± 2.142
0.622ThrCys: 0.622 ± 0.65
4.353ThrAsp: 4.353 ± 1.307
0.0ThrGlu: 0.0 ± 0.0
0.0ThrPhe: 0.0 ± 0.0
4.353ThrGly: 4.353 ± 2.07
0.622ThrHis: 0.622 ± 0.527
3.731ThrIle: 3.731 ± 2.079
2.488ThrLys: 2.488 ± 1.557
4.975ThrLeu: 4.975 ± 2.2
1.244ThrMet: 1.244 ± 1.012
4.353ThrAsn: 4.353 ± 1.616
0.0ThrPro: 0.0 ± 0.0
1.244ThrGln: 1.244 ± 0.68
3.109ThrArg: 3.109 ± 2.15
6.219ThrSer: 6.219 ± 1.519
2.488ThrThr: 2.488 ± 1.618
3.109ThrVal: 3.109 ± 1.392
0.622ThrTrp: 0.622 ± 0.43
3.731ThrTyr: 3.731 ± 1.345
0.0ThrXaa: 0.0 ± 0.0
Val
3.109ValAla: 3.109 ± 1.139
1.244ValCys: 1.244 ± 0.809
1.866ValAsp: 1.866 ± 1.097
4.353ValGlu: 4.353 ± 1.71
3.731ValPhe: 3.731 ± 0.77
1.244ValGly: 1.244 ± 0.68
0.622ValHis: 0.622 ± 0.43
3.731ValIle: 3.731 ± 2.242
5.597ValLys: 5.597 ± 2.741
4.353ValLeu: 4.353 ± 1.026
1.244ValMet: 1.244 ± 0.655
7.463ValAsn: 7.463 ± 1.671
3.109ValPro: 3.109 ± 2.15
3.109ValGln: 3.109 ± 0.618
3.731ValArg: 3.731 ± 1.169
6.841ValSer: 6.841 ± 1.632
0.0ValThr: 0.0 ± 0.0
2.488ValVal: 2.488 ± 1.106
1.244ValTrp: 1.244 ± 1.128
4.353ValTyr: 4.353 ± 1.598
0.0ValXaa: 0.0 ± 0.0
Trp
2.488TrpAla: 2.488 ± 0.893
0.622TrpCys: 0.622 ± 0.527
1.244TrpAsp: 1.244 ± 1.128
0.0TrpGlu: 0.0 ± 0.0
0.622TrpPhe: 0.622 ± 0.43
0.0TrpGly: 0.0 ± 0.0
0.622TrpHis: 0.622 ± 0.43
0.0TrpIle: 0.0 ± 0.0
1.244TrpLys: 1.244 ± 1.054
1.244TrpLeu: 1.244 ± 1.128
0.0TrpMet: 0.0 ± 0.0
1.866TrpAsn: 1.866 ± 1.304
0.0TrpPro: 0.0 ± 0.0
1.866TrpGln: 1.866 ± 0.6
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.244TrpThr: 1.244 ± 0.86
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.731TyrAla: 3.731 ± 0.829
2.488TyrCys: 2.488 ± 1.021
6.841TyrAsp: 6.841 ± 1.893
3.731TyrGlu: 3.731 ± 1.923
2.488TyrPhe: 2.488 ± 1.312
1.244TyrGly: 1.244 ± 0.884
1.244TyrHis: 1.244 ± 0.902
2.488TyrIle: 2.488 ± 1.195
3.731TyrLys: 3.731 ± 1.181
3.109TyrLeu: 3.109 ± 1.02
2.488TyrMet: 2.488 ± 1.206
6.841TyrAsn: 6.841 ± 1.316
0.622TyrPro: 0.622 ± 0.43
2.488TyrGln: 2.488 ± 1.309
1.866TyrArg: 1.866 ± 0.6
4.353TyrSer: 4.353 ± 1.985
4.975TyrThr: 4.975 ± 1.005
3.109TyrVal: 3.109 ± 1.184
0.622TyrTrp: 0.622 ± 0.43
4.975TyrTyr: 4.975 ± 1.591
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1609 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski