Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_263

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.522AlaAla: 1.522 ± 1.387
0.0AlaCys: 0.0 ± 0.0
0.761AlaAsp: 0.761 ± 0.529
6.088AlaGlu: 6.088 ± 2.886
1.522AlaPhe: 1.522 ± 0.673
3.805AlaGly: 3.805 ± 2.596
1.522AlaHis: 1.522 ± 0.673
3.044AlaIle: 3.044 ± 1.346
1.522AlaLys: 1.522 ± 1.387
4.566AlaLeu: 4.566 ± 1.072
0.0AlaMet: 0.0 ± 0.0
2.283AlaAsn: 2.283 ± 0.993
3.044AlaPro: 3.044 ± 1.346
1.522AlaGln: 1.522 ± 0.673
1.522AlaArg: 1.522 ± 1.387
6.088AlaSer: 6.088 ± 2.08
1.522AlaThr: 1.522 ± 0.673
1.522AlaVal: 1.522 ± 0.673
0.0AlaTrp: 0.0 ± 0.0
2.283AlaTyr: 2.283 ± 0.51
0.0AlaXaa: 0.0 ± 0.0
Cys
0.761CysAla: 0.761 ± 0.807
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.761CysGly: 0.761 ± 0.807
0.0CysHis: 0.0 ± 0.0
3.044CysIle: 3.044 ± 2.743
1.522CysLys: 1.522 ± 1.5
3.805CysLeu: 3.805 ± 1.643
0.0CysMet: 0.0 ± 0.0
0.761CysAsn: 0.761 ± 0.529
0.761CysPro: 0.761 ± 0.807
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.522CysSer: 1.522 ± 1.5
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.761CysTrp: 0.761 ± 0.529
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.283AspAla: 2.283 ± 1.26
1.522AspCys: 1.522 ± 0.721
3.044AspAsp: 3.044 ± 1.443
3.044AspGlu: 3.044 ± 1.239
5.327AspPhe: 5.327 ± 3.646
3.044AspGly: 3.044 ± 1.392
0.0AspHis: 0.0 ± 0.0
3.805AspIle: 3.805 ± 2.291
4.566AspLys: 4.566 ± 1.019
9.132AspLeu: 9.132 ± 1.736
0.761AspMet: 0.761 ± 0.726
3.044AspAsn: 3.044 ± 1.04
1.522AspPro: 1.522 ± 1.058
3.805AspGln: 3.805 ± 1.898
0.761AspArg: 0.761 ± 0.529
3.805AspSer: 3.805 ± 2.291
3.044AspThr: 3.044 ± 1.441
0.761AspVal: 0.761 ± 0.693
0.761AspTrp: 0.761 ± 0.807
3.805AspTyr: 3.805 ± 1.867
0.0AspXaa: 0.0 ± 0.0
Glu
1.522GluAla: 1.522 ± 1.387
0.0GluCys: 0.0 ± 0.0
3.805GluAsp: 3.805 ± 1.931
3.044GluGlu: 3.044 ± 3.632
3.044GluPhe: 3.044 ± 1.04
0.761GluGly: 0.761 ± 0.529
2.283GluHis: 2.283 ± 1.26
3.044GluIle: 3.044 ± 0.946
3.044GluLys: 3.044 ± 1.316
6.849GluLeu: 6.849 ± 3.509
2.283GluMet: 2.283 ± 1.027
4.566GluAsn: 4.566 ± 2.019
0.761GluPro: 0.761 ± 1.251
3.805GluGln: 3.805 ± 1.031
2.283GluArg: 2.283 ± 1.027
5.327GluSer: 5.327 ± 3.484
3.044GluThr: 3.044 ± 1.04
5.327GluVal: 5.327 ± 2.401
0.0GluTrp: 0.0 ± 0.0
5.327GluTyr: 5.327 ± 1.4
0.0GluXaa: 0.0 ± 0.0
Phe
3.805PheAla: 3.805 ± 1.898
0.0PheCys: 0.0 ± 0.0
6.088PheAsp: 6.088 ± 3.47
3.044PheGlu: 3.044 ± 1.239
6.088PhePhe: 6.088 ± 1.915
6.088PheGly: 6.088 ± 2.543
0.761PheHis: 0.761 ± 1.251
2.283PheIle: 2.283 ± 1.27
4.566PheLys: 4.566 ± 1.019
3.044PheLeu: 3.044 ± 2.415
0.0PheMet: 0.0 ± 0.0
3.805PheAsn: 3.805 ± 2.005
0.761PhePro: 0.761 ± 0.807
0.0PheGln: 0.0 ± 0.0
3.044PheArg: 3.044 ± 1.785
5.327PheSer: 5.327 ± 2.326
3.044PheThr: 3.044 ± 2.216
5.327PheVal: 5.327 ± 2.326
1.522PheTrp: 1.522 ± 0.673
3.805PheTyr: 3.805 ± 1.067
0.0PheXaa: 0.0 ± 0.0
Gly
2.283GlyAla: 2.283 ± 0.993
0.0GlyCys: 0.0 ± 0.0
0.761GlyAsp: 0.761 ± 0.529
4.566GlyGlu: 4.566 ± 0.828
1.522GlyPhe: 1.522 ± 0.721
4.566GlyGly: 4.566 ± 3.174
3.044GlyHis: 3.044 ± 2.216
3.044GlyIle: 3.044 ± 1.441
3.805GlyLys: 3.805 ± 1.039
5.327GlyLeu: 5.327 ± 1.618
0.0GlyMet: 0.0 ± 0.0
3.805GlyAsn: 3.805 ± 2.645
0.761GlyPro: 0.761 ± 0.529
3.044GlyGln: 3.044 ± 0.98
2.283GlyArg: 2.283 ± 0.974
3.044GlySer: 3.044 ± 1.441
4.566GlyThr: 4.566 ± 1.518
5.327GlyVal: 5.327 ± 1.192
0.0GlyTrp: 0.0 ± 0.0
3.805GlyTyr: 3.805 ± 1.612
0.0GlyXaa: 0.0 ± 0.0
His
0.761HisAla: 0.761 ± 0.693
0.0HisCys: 0.0 ± 0.0
1.522HisAsp: 1.522 ± 1.614
3.044HisGlu: 3.044 ± 0.946
0.761HisPhe: 0.761 ± 0.529
2.283HisGly: 2.283 ± 0.974
0.761HisHis: 0.761 ± 0.807
1.522HisIle: 1.522 ± 0.721
2.283HisLys: 2.283 ± 1.306
3.044HisLeu: 3.044 ± 1.443
0.0HisMet: 0.0 ± 0.0
1.522HisAsn: 1.522 ± 1.387
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.761HisArg: 0.761 ± 0.529
2.283HisSer: 2.283 ± 1.489
0.0HisThr: 0.0 ± 0.0
1.522HisVal: 1.522 ± 1.614
0.0HisTrp: 0.0 ± 0.0
3.805HisTyr: 3.805 ± 2.463
0.0HisXaa: 0.0 ± 0.0
Ile
1.522IleAla: 1.522 ± 0.721
0.0IleCys: 0.0 ± 0.0
3.044IleAsp: 3.044 ± 1.842
2.283IleGlu: 2.283 ± 1.027
5.327IlePhe: 5.327 ± 1.845
5.327IleGly: 5.327 ± 1.715
0.761IleHis: 0.761 ± 0.807
1.522IleIle: 1.522 ± 0.673
4.566IleLys: 4.566 ± 3.889
3.805IleLeu: 3.805 ± 1.031
3.805IleMet: 3.805 ± 0.715
5.327IleAsn: 5.327 ± 2.278
2.283IlePro: 2.283 ± 0.993
1.522IleGln: 1.522 ± 1.058
0.0IleArg: 0.0 ± 0.0
6.088IleSer: 6.088 ± 0.952
1.522IleThr: 1.522 ± 0.673
1.522IleVal: 1.522 ± 1.387
0.761IleTrp: 0.761 ± 0.529
3.044IleTyr: 3.044 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
3.044LysAla: 3.044 ± 1.919
1.522LysCys: 1.522 ± 1.614
6.849LysAsp: 6.849 ± 2.66
2.283LysGlu: 2.283 ± 2.08
5.327LysPhe: 5.327 ± 1.854
0.0LysGly: 0.0 ± 0.0
4.566LysHis: 4.566 ± 2.978
2.283LysIle: 2.283 ± 1.356
6.088LysLys: 6.088 ± 2.764
5.327LysLeu: 5.327 ± 1.001
0.761LysMet: 0.761 ± 0.529
3.044LysAsn: 3.044 ± 2.216
2.283LysPro: 2.283 ± 1.489
3.805LysGln: 3.805 ± 2.491
4.566LysArg: 4.566 ± 1.949
6.088LysSer: 6.088 ± 4.692
4.566LysThr: 4.566 ± 1.072
3.044LysVal: 3.044 ± 1.239
0.761LysTrp: 0.761 ± 0.693
3.805LysTyr: 3.805 ± 1.427
0.0LysXaa: 0.0 ± 0.0
Leu
3.805LeuAla: 3.805 ± 1.898
0.761LeuCys: 0.761 ± 0.807
9.893LeuAsp: 9.893 ± 1.869
4.566LeuGlu: 4.566 ± 2.52
5.327LeuPhe: 5.327 ± 1.001
4.566LeuGly: 4.566 ± 1.665
1.522LeuHis: 1.522 ± 1.26
8.371LeuIle: 8.371 ± 1.939
7.61LeuLys: 7.61 ± 4.105
7.61LeuLeu: 7.61 ± 2.393
0.761LeuMet: 0.761 ± 0.439
12.938LeuAsn: 12.938 ± 1.804
3.805LeuPro: 3.805 ± 1.291
6.849LeuGln: 6.849 ± 3.458
6.849LeuArg: 6.849 ± 2.084
5.327LeuSer: 5.327 ± 1.192
3.805LeuThr: 3.805 ± 0.88
3.805LeuVal: 3.805 ± 2.238
1.522LeuTrp: 1.522 ± 1.387
0.761LeuTyr: 0.761 ± 0.693
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.761MetCys: 0.761 ± 0.807
0.761MetAsp: 0.761 ± 0.693
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.283MetGly: 2.283 ± 1.027
0.761MetHis: 0.761 ± 0.529
0.0MetIle: 0.0 ± 0.0
2.283MetLys: 2.283 ± 1.27
1.522MetLeu: 1.522 ± 1.26
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.761MetPro: 0.761 ± 0.529
0.761MetGln: 0.761 ± 0.693
0.0MetArg: 0.0 ± 0.0
2.283MetSer: 2.283 ± 1.436
0.0MetThr: 0.0 ± 0.0
0.761MetVal: 0.761 ± 0.529
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.088AsnAla: 6.088 ± 3.15
0.761AsnCys: 0.761 ± 1.251
5.327AsnAsp: 5.327 ± 1.003
8.371AsnGlu: 8.371 ± 2.842
6.849AsnPhe: 6.849 ± 2.613
3.805AsnGly: 3.805 ± 0.88
0.0AsnHis: 0.0 ± 0.0
3.044AsnIle: 3.044 ± 1.316
6.849AsnLys: 6.849 ± 2.344
9.132AsnLeu: 9.132 ± 2.918
0.761AsnMet: 0.761 ± 0.529
4.566AsnAsn: 4.566 ± 1.459
1.522AsnPro: 1.522 ± 0.673
3.044AsnGln: 3.044 ± 2.773
2.283AsnArg: 2.283 ± 0.51
5.327AsnSer: 5.327 ± 1.003
4.566AsnThr: 4.566 ± 1.668
2.283AsnVal: 2.283 ± 0.993
0.0AsnTrp: 0.0 ± 0.0
1.522AsnTyr: 1.522 ± 0.835
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
2.283ProCys: 2.283 ± 2.058
1.522ProAsp: 1.522 ± 1.157
1.522ProGlu: 1.522 ± 0.673
3.805ProPhe: 3.805 ± 1.031
1.522ProGly: 1.522 ± 1.058
1.522ProHis: 1.522 ± 0.721
2.283ProIle: 2.283 ± 0.993
3.805ProLys: 3.805 ± 1.031
4.566ProLeu: 4.566 ± 1.941
0.761ProMet: 0.761 ± 0.807
2.283ProAsn: 2.283 ± 0.993
0.0ProPro: 0.0 ± 0.0
1.522ProGln: 1.522 ± 1.058
3.044ProArg: 3.044 ± 0.618
2.283ProSer: 2.283 ± 1.436
1.522ProThr: 1.522 ± 1.058
0.761ProVal: 0.761 ± 0.529
0.761ProTrp: 0.761 ± 0.529
3.044ProTyr: 3.044 ± 0.98
0.0ProXaa: 0.0 ± 0.0
Gln
1.522GlnAla: 1.522 ± 0.673
0.0GlnCys: 0.0 ± 0.0
1.522GlnAsp: 1.522 ± 1.387
3.044GlnGlu: 3.044 ± 1.346
1.522GlnPhe: 1.522 ± 0.835
1.522GlnGly: 1.522 ± 1.058
0.761GlnHis: 0.761 ± 0.529
3.044GlnIle: 3.044 ± 1.346
3.044GlnLys: 3.044 ± 1.785
3.805GlnLeu: 3.805 ± 1.279
0.761GlnMet: 0.761 ± 0.693
5.327GlnAsn: 5.327 ± 3.967
3.805GlnPro: 3.805 ± 2.396
4.566GlnGln: 4.566 ± 3.28
4.566GlnArg: 4.566 ± 1.985
3.805GlnSer: 3.805 ± 1.619
1.522GlnThr: 1.522 ± 0.673
3.044GlnVal: 3.044 ± 1.17
0.761GlnTrp: 0.761 ± 0.693
3.044GlnTyr: 3.044 ± 1.346
0.0GlnXaa: 0.0 ± 0.0
Arg
1.522ArgAla: 1.522 ± 1.058
1.522ArgCys: 1.522 ± 1.058
1.522ArgAsp: 1.522 ± 1.058
0.761ArgGlu: 0.761 ± 0.693
3.805ArgPhe: 3.805 ± 1.205
2.283ArgGly: 2.283 ± 1.587
1.522ArgHis: 1.522 ± 0.673
3.044ArgIle: 3.044 ± 0.618
1.522ArgLys: 1.522 ± 1.614
4.566ArgLeu: 4.566 ± 0.828
0.761ArgMet: 0.761 ± 0.807
2.283ArgAsn: 2.283 ± 1.436
5.327ArgPro: 5.327 ± 2.344
2.283ArgGln: 2.283 ± 1.26
2.283ArgArg: 2.283 ± 0.51
2.283ArgSer: 2.283 ± 1.293
2.283ArgThr: 2.283 ± 0.51
2.283ArgVal: 2.283 ± 0.974
0.761ArgTrp: 0.761 ± 0.529
2.283ArgTyr: 2.283 ± 0.974
0.0ArgXaa: 0.0 ± 0.0
Ser
5.327SerAla: 5.327 ± 2.242
2.283SerCys: 2.283 ± 1.27
3.805SerAsp: 3.805 ± 0.715
4.566SerGlu: 4.566 ± 2.095
3.805SerPhe: 3.805 ± 1.031
3.044SerGly: 3.044 ± 1.602
2.283SerHis: 2.283 ± 2.421
4.566SerIle: 4.566 ± 3.218
5.327SerLys: 5.327 ± 2.883
7.61SerLeu: 7.61 ± 0.262
0.761SerMet: 0.761 ± 1.009
6.849SerAsn: 6.849 ± 3.458
1.522SerPro: 1.522 ± 0.835
3.044SerGln: 3.044 ± 1.443
4.566SerArg: 4.566 ± 2.456
6.849SerSer: 6.849 ± 2.468
3.044SerThr: 3.044 ± 1.842
6.849SerVal: 6.849 ± 1.9
0.761SerTrp: 0.761 ± 0.807
3.805SerTyr: 3.805 ± 0.715
0.0SerXaa: 0.0 ± 0.0
Thr
3.044ThrAla: 3.044 ± 1.346
0.0ThrCys: 0.0 ± 0.0
3.044ThrAsp: 3.044 ± 2.116
1.522ThrGlu: 1.522 ± 1.387
4.566ThrPhe: 4.566 ± 2.273
2.283ThrGly: 2.283 ± 0.993
0.761ThrHis: 0.761 ± 0.807
3.805ThrIle: 3.805 ± 1.067
1.522ThrLys: 1.522 ± 1.058
6.088ThrLeu: 6.088 ± 1.759
0.0ThrMet: 0.0 ± 0.0
3.805ThrAsn: 3.805 ± 2.57
2.283ThrPro: 2.283 ± 1.587
2.283ThrGln: 2.283 ± 0.51
0.0ThrArg: 0.0 ± 0.0
6.088ThrSer: 6.088 ± 2.144
0.0ThrThr: 0.0 ± 0.0
0.761ThrVal: 0.761 ± 0.693
0.0ThrTrp: 0.0 ± 0.0
3.044ThrTyr: 3.044 ± 0.618
0.0ThrXaa: 0.0 ± 0.0
Val
3.805ValAla: 3.805 ± 1.836
0.761ValCys: 0.761 ± 1.251
2.283ValAsp: 2.283 ± 1.587
4.566ValGlu: 4.566 ± 2.286
2.283ValPhe: 2.283 ± 1.587
2.283ValGly: 2.283 ± 0.993
1.522ValHis: 1.522 ± 0.721
1.522ValIle: 1.522 ± 1.058
3.805ValLys: 3.805 ± 0.715
2.283ValLeu: 2.283 ± 1.587
0.0ValMet: 0.0 ± 0.0
2.283ValAsn: 2.283 ± 1.604
4.566ValPro: 4.566 ± 2.164
3.805ValGln: 3.805 ± 0.715
2.283ValArg: 2.283 ± 0.974
3.805ValSer: 3.805 ± 2.774
3.805ValThr: 3.805 ± 1.944
2.283ValVal: 2.283 ± 1.587
0.0ValTrp: 0.0 ± 0.0
0.761ValTyr: 0.761 ± 0.693
0.0ValXaa: 0.0 ± 0.0
Trp
0.761TrpAla: 0.761 ± 0.693
0.0TrpCys: 0.0 ± 0.0
0.761TrpAsp: 0.761 ± 0.529
0.761TrpGlu: 0.761 ± 0.807
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.761TrpHis: 0.761 ± 0.529
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.283TrpLeu: 2.283 ± 1.26
0.0TrpMet: 0.0 ± 0.0
0.761TrpAsn: 0.761 ± 0.693
0.761TrpPro: 0.761 ± 0.807
0.0TrpGln: 0.0 ± 0.0
0.761TrpArg: 0.761 ± 0.529
1.522TrpSer: 1.522 ± 0.673
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.761TyrAla: 0.761 ± 0.529
1.522TyrCys: 1.522 ± 1.614
1.522TyrAsp: 1.522 ± 1.5
3.044TyrGlu: 3.044 ± 0.946
1.522TyrPhe: 1.522 ± 1.058
5.327TyrGly: 5.327 ± 2.099
0.761TyrHis: 0.761 ± 0.807
0.761TyrIle: 0.761 ± 0.529
2.283TyrLys: 2.283 ± 0.974
6.088TyrLeu: 6.088 ± 1.759
0.0TyrMet: 0.0 ± 0.636
6.849TyrAsn: 6.849 ± 0.599
2.283TyrPro: 2.283 ± 1.26
5.327TyrGln: 5.327 ± 2.262
3.044TyrArg: 3.044 ± 0.618
1.522TyrSer: 1.522 ± 0.673
3.044TyrThr: 3.044 ± 0.618
1.522TyrVal: 1.522 ± 1.157
0.0TyrTrp: 0.0 ± 0.0
1.522TyrTyr: 1.522 ± 1.614
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1315 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski