Amino acid dipepetide frequency for Lepus americanus faeces associated microvirus SHP1 6472

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.077AlaAla: 3.077 ± 1.559
0.0AlaCys: 0.0 ± 0.0
6.154AlaAsp: 6.154 ± 0.688
3.692AlaGlu: 3.692 ± 2.191
2.462AlaPhe: 2.462 ± 0.791
3.077AlaGly: 3.077 ± 1.559
0.615AlaHis: 0.615 ± 0.407
3.077AlaIle: 3.077 ± 1.443
4.308AlaLys: 4.308 ± 1.996
6.154AlaLeu: 6.154 ± 3.119
0.0AlaMet: 0.0 ± 0.0
3.077AlaAsn: 3.077 ± 1.559
2.462AlaPro: 2.462 ± 1.627
3.077AlaGln: 3.077 ± 1.688
3.077AlaArg: 3.077 ± 1.875
3.077AlaSer: 3.077 ± 2.38
1.846AlaThr: 1.846 ± 1.14
4.308AlaVal: 4.308 ± 0.782
0.615AlaTrp: 0.615 ± 0.598
6.154AlaTyr: 6.154 ± 1.722
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.846CysAsp: 1.846 ± 0.95
1.231CysGlu: 1.231 ± 0.42
0.615CysPhe: 0.615 ± 0.407
1.846CysGly: 1.846 ± 1.793
0.0CysHis: 0.0 ± 0.0
0.615CysIle: 0.615 ± 0.598
0.615CysLys: 0.615 ± 0.598
1.846CysLeu: 1.846 ± 0.346
0.0CysMet: 0.0 ± 0.0
1.231CysAsn: 1.231 ± 0.736
1.846CysPro: 1.846 ± 0.572
1.231CysGln: 1.231 ± 0.42
1.846CysArg: 1.846 ± 1.181
0.615CysSer: 0.615 ± 0.598
0.0CysThr: 0.0 ± 0.0
1.846CysVal: 1.846 ± 1.793
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.308AspAla: 4.308 ± 1.812
0.615AspCys: 0.615 ± 0.598
3.077AspAsp: 3.077 ± 0.918
4.923AspGlu: 4.923 ± 0.928
1.846AspPhe: 1.846 ± 0.346
1.231AspGly: 1.231 ± 0.813
1.231AspHis: 1.231 ± 0.813
4.923AspIle: 4.923 ± 0.337
5.538AspLys: 5.538 ± 2.849
6.154AspLeu: 6.154 ± 2.887
0.0AspMet: 0.0 ± 0.0
4.308AspAsn: 4.308 ± 2.824
0.615AspPro: 0.615 ± 0.598
1.846AspGln: 1.846 ± 0.763
0.615AspArg: 0.615 ± 0.407
3.692AspSer: 3.692 ± 1.825
4.923AspThr: 4.923 ± 0.994
5.538AspVal: 5.538 ± 1.585
1.231AspTrp: 1.231 ± 0.813
6.769AspTyr: 6.769 ± 3.234
0.0AspXaa: 0.0 ± 0.0
Glu
3.077GluAla: 3.077 ± 1.688
0.615GluCys: 0.615 ± 0.598
0.615GluAsp: 0.615 ± 0.407
0.615GluGlu: 0.615 ± 0.407
2.462GluPhe: 2.462 ± 0.899
2.462GluGly: 2.462 ± 0.169
0.615GluHis: 0.615 ± 0.598
3.077GluIle: 3.077 ± 2.38
1.846GluLys: 1.846 ± 1.181
6.154GluLeu: 6.154 ± 1.373
0.0GluMet: 0.0 ± 0.358
4.923GluAsn: 4.923 ± 0.928
1.231GluPro: 1.231 ± 0.42
1.231GluGln: 1.231 ± 0.736
3.692GluArg: 3.692 ± 2.209
1.846GluSer: 1.846 ± 0.763
3.077GluThr: 3.077 ± 2.12
0.615GluVal: 0.615 ± 0.598
0.0GluTrp: 0.0 ± 0.0
5.538GluTyr: 5.538 ± 3.649
0.0GluXaa: 0.0 ± 0.0
Phe
2.462PheAla: 2.462 ± 0.899
2.462PheCys: 2.462 ± 0.841
6.769PheAsp: 6.769 ± 1.202
0.615PheGlu: 0.615 ± 0.636
1.231PhePhe: 1.231 ± 0.572
5.538PheGly: 5.538 ± 1.585
0.0PheHis: 0.0 ± 0.0
1.846PheIle: 1.846 ± 1.181
0.615PheLys: 0.615 ± 0.407
3.077PheLeu: 3.077 ± 1.342
1.846PheMet: 1.846 ± 0.95
2.462PheAsn: 2.462 ± 0.169
2.462PhePro: 2.462 ± 1.53
1.231PheGln: 1.231 ± 0.813
3.692PheArg: 3.692 ± 1.261
4.923PheSer: 4.923 ± 2.611
3.692PheThr: 3.692 ± 1.144
1.846PheVal: 1.846 ± 0.95
0.615PheTrp: 0.615 ± 0.407
1.231PheTyr: 1.231 ± 0.813
0.0PheXaa: 0.0 ± 0.0
Gly
3.077GlyAla: 3.077 ± 1.286
0.0GlyCys: 0.0 ± 0.0
1.231GlyAsp: 1.231 ± 0.42
2.462GlyGlu: 2.462 ± 0.791
1.846GlyPhe: 1.846 ± 0.572
3.077GlyGly: 3.077 ± 0.918
0.615GlyHis: 0.615 ± 0.407
6.154GlyIle: 6.154 ± 1.958
2.462GlyLys: 2.462 ± 1.721
4.308GlyLeu: 4.308 ± 1.27
0.615GlyMet: 0.615 ± 0.407
3.692GlyAsn: 3.692 ± 1.717
0.0GlyPro: 0.0 ± 0.0
1.231GlyGln: 1.231 ± 0.813
4.923GlyArg: 4.923 ± 0.664
5.538GlySer: 5.538 ± 3.265
3.692GlyThr: 3.692 ± 1.526
3.692GlyVal: 3.692 ± 0.287
0.0GlyTrp: 0.0 ± 0.0
3.692GlyTyr: 3.692 ± 1.144
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.615HisCys: 0.615 ± 0.598
0.0HisAsp: 0.0 ± 0.0
0.615HisGlu: 0.615 ± 0.636
1.231HisPhe: 1.231 ± 0.42
0.615HisGly: 0.615 ± 0.407
1.231HisHis: 1.231 ± 0.572
0.615HisIle: 0.615 ± 0.407
0.0HisLys: 0.0 ± 0.0
0.615HisLeu: 0.615 ± 0.407
0.615HisMet: 0.615 ± 0.598
0.615HisAsn: 0.615 ± 0.598
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.462HisSer: 2.462 ± 0.899
0.0HisThr: 0.0 ± 0.0
1.231HisVal: 1.231 ± 0.42
0.0HisTrp: 0.0 ± 0.0
0.615HisTyr: 0.615 ± 0.598
0.0HisXaa: 0.0 ± 0.0
Ile
2.462IleAla: 2.462 ± 1.754
1.231IleCys: 1.231 ± 0.42
1.846IleAsp: 1.846 ± 0.763
3.077IleGlu: 3.077 ± 1.076
0.615IlePhe: 0.615 ± 0.407
4.923IleGly: 4.923 ± 1.867
0.0IleHis: 0.0 ± 0.0
1.846IleIle: 1.846 ± 1.14
6.769IleLys: 6.769 ± 1.296
4.923IleLeu: 4.923 ± 0.664
1.846IleMet: 1.846 ± 1.264
4.308IleAsn: 4.308 ± 2.847
3.077IlePro: 3.077 ± 1.273
1.846IleGln: 1.846 ± 1.907
2.462IleArg: 2.462 ± 0.841
7.385IleSer: 7.385 ± 2.204
4.923IleThr: 4.923 ± 1.47
1.846IleVal: 1.846 ± 0.572
1.231IleTrp: 1.231 ± 0.42
0.615IleTyr: 0.615 ± 0.598
0.0IleXaa: 0.0 ± 0.0
Lys
2.462LysAla: 2.462 ± 0.934
1.846LysCys: 1.846 ± 1.181
7.385LysAsp: 7.385 ± 1.193
4.308LysGlu: 4.308 ± 1.629
3.077LysPhe: 3.077 ± 1.076
1.231LysGly: 1.231 ± 1.272
0.615LysHis: 0.615 ± 0.407
4.923LysIle: 4.923 ± 3.129
3.077LysLys: 3.077 ± 0.734
5.538LysLeu: 5.538 ± 2.158
1.231LysMet: 1.231 ± 1.081
6.154LysAsn: 6.154 ± 1.373
1.846LysPro: 1.846 ± 0.572
0.615LysGln: 0.615 ± 0.636
2.462LysArg: 2.462 ± 1.721
1.846LysSer: 1.846 ± 1.181
2.462LysThr: 2.462 ± 0.899
2.462LysVal: 2.462 ± 1.754
0.615LysTrp: 0.615 ± 0.407
3.692LysTyr: 3.692 ± 1.144
0.0LysXaa: 0.0 ± 0.0
Leu
6.154LeuAla: 6.154 ± 3.119
3.692LeuCys: 3.692 ± 1.945
7.385LeuAsp: 7.385 ± 1.385
4.308LeuGlu: 4.308 ± 0.554
5.538LeuPhe: 5.538 ± 1.178
4.923LeuGly: 4.923 ± 1.47
0.615LeuHis: 0.615 ± 0.407
1.846LeuIle: 1.846 ± 1.14
8.615LeuLys: 8.615 ± 3.185
5.538LeuLeu: 5.538 ± 2.644
3.077LeuMet: 3.077 ± 0.435
7.385LeuAsn: 7.385 ± 1.391
3.692LeuPro: 3.692 ± 1.261
1.846LeuGln: 1.846 ± 1.239
5.538LeuArg: 5.538 ± 1.21
11.077LeuSer: 11.077 ± 0.558
5.538LeuThr: 5.538 ± 1.952
4.308LeuVal: 4.308 ± 1.451
1.231LeuTrp: 1.231 ± 0.42
1.846LeuTyr: 1.846 ± 0.95
0.0LeuXaa: 0.0 ± 0.0
Met
1.231MetAla: 1.231 ± 0.736
0.0MetCys: 0.0 ± 0.0
1.231MetAsp: 1.231 ± 0.42
0.0MetGlu: 0.0 ± 0.0
1.231MetPhe: 1.231 ± 0.42
0.0MetGly: 0.0 ± 0.0
0.615MetHis: 0.615 ± 0.598
0.615MetIle: 0.615 ± 0.636
1.846MetLys: 1.846 ± 1.181
2.462MetLeu: 2.462 ± 0.841
1.846MetMet: 1.846 ± 0.346
1.231MetAsn: 1.231 ± 0.42
2.462MetPro: 2.462 ± 0.899
1.231MetGln: 1.231 ± 1.272
1.231MetArg: 1.231 ± 0.813
0.615MetSer: 0.615 ± 0.407
1.231MetThr: 1.231 ± 0.42
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.231MetTyr: 1.231 ± 1.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.692AsnAla: 3.692 ± 0.287
0.0AsnCys: 0.0 ± 0.0
6.154AsnAsp: 6.154 ± 3.409
4.308AsnGlu: 4.308 ± 2.593
3.077AsnPhe: 3.077 ± 0.517
4.923AsnGly: 4.923 ± 1.723
0.615AsnHis: 0.615 ± 0.407
4.923AsnIle: 4.923 ± 0.994
2.462AsnLys: 2.462 ± 1.627
8.0AsnLeu: 8.0 ± 2.827
1.846AsnMet: 1.846 ± 0.346
3.692AsnAsn: 3.692 ± 1.662
4.923AsnPro: 4.923 ± 0.7
3.077AsnGln: 3.077 ± 0.734
2.462AsnArg: 2.462 ± 0.169
4.923AsnSer: 4.923 ± 1.234
6.769AsnThr: 6.769 ± 2.901
3.692AsnVal: 3.692 ± 1.717
0.615AsnTrp: 0.615 ± 0.407
1.846AsnTyr: 1.846 ± 0.572
0.0AsnXaa: 0.0 ± 0.0
Pro
1.846ProAla: 1.846 ± 0.346
1.846ProCys: 1.846 ± 1.793
2.462ProAsp: 2.462 ± 0.791
1.846ProGlu: 1.846 ± 0.572
1.846ProPhe: 1.846 ± 0.572
0.615ProGly: 0.615 ± 0.407
1.231ProHis: 1.231 ± 1.195
5.538ProIle: 5.538 ± 2.166
2.462ProLys: 2.462 ± 0.841
5.538ProLeu: 5.538 ± 2.824
1.231ProMet: 1.231 ± 0.813
3.077ProAsn: 3.077 ± 1.273
1.846ProPro: 1.846 ± 0.763
1.231ProGln: 1.231 ± 0.813
0.615ProArg: 0.615 ± 0.407
4.308ProSer: 4.308 ± 0.554
1.231ProThr: 1.231 ± 0.813
4.923ProVal: 4.923 ± 1.798
0.615ProTrp: 0.615 ± 0.407
1.846ProTyr: 1.846 ± 0.572
0.0ProXaa: 0.0 ± 0.0
Gln
4.308GlnAla: 4.308 ± 1.839
0.0GlnCys: 0.0 ± 0.0
0.615GlnAsp: 0.615 ± 0.636
1.231GlnGlu: 1.231 ± 0.572
2.462GlnPhe: 2.462 ± 0.841
2.462GlnGly: 2.462 ± 1.754
0.615GlnHis: 0.615 ± 0.636
1.846GlnIle: 1.846 ± 1.907
1.231GlnLys: 1.231 ± 0.42
3.692GlnLeu: 3.692 ± 0.732
0.615GlnMet: 0.615 ± 0.407
1.846GlnAsn: 1.846 ± 1.907
1.846GlnPro: 1.846 ± 1.22
2.462GlnGln: 2.462 ± 1.145
3.077GlnArg: 3.077 ± 0.918
2.462GlnSer: 2.462 ± 1.145
3.077GlnThr: 3.077 ± 1.286
1.846GlnVal: 1.846 ± 1.14
0.615GlnTrp: 0.615 ± 0.636
1.846GlnTyr: 1.846 ± 0.572
0.0GlnXaa: 0.0 ± 0.0
Arg
4.923ArgAla: 4.923 ± 2.186
1.231ArgCys: 1.231 ± 0.736
3.077ArgAsp: 3.077 ± 0.517
1.846ArgGlu: 1.846 ± 1.793
3.692ArgPhe: 3.692 ± 1.9
1.846ArgGly: 1.846 ± 0.572
0.0ArgHis: 0.0 ± 0.0
1.846ArgIle: 1.846 ± 0.763
3.077ArgLys: 3.077 ± 2.988
4.923ArgLeu: 4.923 ± 1.362
0.0ArgMet: 0.0 ± 0.0
3.692ArgAsn: 3.692 ± 2.44
3.692ArgPro: 3.692 ± 1.261
3.077ArgGln: 3.077 ± 2.439
1.846ArgArg: 1.846 ± 1.793
1.231ArgSer: 1.231 ± 0.42
1.231ArgThr: 1.231 ± 0.572
4.308ArgVal: 4.308 ± 0.782
0.0ArgTrp: 0.0 ± 0.0
2.462ArgTyr: 2.462 ± 0.841
0.0ArgXaa: 0.0 ± 0.0
Ser
8.615SerAla: 8.615 ± 1.583
0.615SerCys: 0.615 ± 0.407
6.154SerAsp: 6.154 ± 2.572
2.462SerGlu: 2.462 ± 0.791
4.308SerPhe: 4.308 ± 1.319
7.385SerGly: 7.385 ± 1.464
0.615SerHis: 0.615 ± 0.407
6.154SerIle: 6.154 ± 1.692
4.308SerLys: 4.308 ± 2.156
8.615SerLeu: 8.615 ± 1.615
1.231SerMet: 1.231 ± 0.42
5.538SerAsn: 5.538 ± 3.009
3.077SerPro: 3.077 ± 0.435
4.308SerGln: 4.308 ± 1.996
4.308SerArg: 4.308 ± 2.536
11.077SerSer: 11.077 ± 3.656
4.308SerThr: 4.308 ± 2.058
4.308SerVal: 4.308 ± 0.554
1.846SerTrp: 1.846 ± 0.572
3.692SerTyr: 3.692 ± 2.44
0.0SerXaa: 0.0 ± 0.0
Thr
3.077ThrAla: 3.077 ± 1.443
0.0ThrCys: 0.0 ± 0.0
3.077ThrAsp: 3.077 ± 0.517
0.615ThrGlu: 0.615 ± 0.598
4.308ThrPhe: 4.308 ± 2.058
1.846ThrGly: 1.846 ± 0.572
0.0ThrHis: 0.0 ± 0.0
1.846ThrIle: 1.846 ± 0.763
3.692ThrLys: 3.692 ± 1.9
6.154ThrLeu: 6.154 ± 1.118
1.846ThrMet: 1.846 ± 0.346
3.077ThrAsn: 3.077 ± 1.273
3.692ThrPro: 3.692 ± 0.287
4.308ThrGln: 4.308 ± 2.215
1.231ThrArg: 1.231 ± 0.813
7.385ThrSer: 7.385 ± 2.204
4.308ThrThr: 4.308 ± 2.058
4.308ThrVal: 4.308 ± 0.929
1.231ThrTrp: 1.231 ± 0.42
3.077ThrTyr: 3.077 ± 0.517
0.0ThrXaa: 0.0 ± 0.0
Val
4.308ValAla: 4.308 ± 2.156
1.231ValCys: 1.231 ± 0.42
3.077ValAsp: 3.077 ± 1.359
1.231ValGlu: 1.231 ± 0.572
2.462ValPhe: 2.462 ± 1.627
2.462ValGly: 2.462 ± 1.145
1.231ValHis: 1.231 ± 1.195
1.231ValIle: 1.231 ± 0.42
2.462ValLys: 2.462 ± 0.841
4.923ValLeu: 4.923 ± 0.7
0.0ValMet: 0.0 ± 0.0
3.692ValAsn: 3.692 ± 1.526
6.154ValPro: 6.154 ± 1.118
1.231ValGln: 1.231 ± 0.42
1.846ValArg: 1.846 ± 0.572
11.692ValSer: 11.692 ± 0.766
2.462ValThr: 2.462 ± 1.627
1.231ValVal: 1.231 ± 1.195
0.0ValTrp: 0.0 ± 0.0
2.462ValTyr: 2.462 ± 0.841
0.0ValXaa: 0.0 ± 0.0
Trp
0.615TrpAla: 0.615 ± 0.407
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.615TrpPhe: 0.615 ± 0.407
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.231TrpIle: 1.231 ± 0.42
0.615TrpLys: 0.615 ± 0.407
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.846TrpAsn: 1.846 ± 1.181
0.0TrpPro: 0.0 ± 0.0
0.615TrpGln: 0.615 ± 0.407
0.615TrpArg: 0.615 ± 0.407
1.846TrpSer: 1.846 ± 0.572
1.231TrpThr: 1.231 ± 0.42
0.615TrpVal: 0.615 ± 0.407
0.0TrpTrp: 0.0 ± 0.0
0.615TrpTyr: 0.615 ± 0.407
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.615TyrAla: 0.615 ± 0.407
1.231TyrCys: 1.231 ± 0.42
1.231TyrAsp: 1.231 ± 0.813
4.923TyrGlu: 4.923 ± 1.681
3.692TyrPhe: 3.692 ± 1.261
1.846TyrGly: 1.846 ± 0.346
0.615TyrHis: 0.615 ± 0.598
3.077TyrIle: 3.077 ± 1.342
1.846TyrLys: 1.846 ± 0.572
5.538TyrLeu: 5.538 ± 0.836
1.846TyrMet: 1.846 ± 0.95
6.154TyrAsn: 6.154 ± 0.869
1.231TyrPro: 1.231 ± 0.813
2.462TyrGln: 2.462 ± 1.627
2.462TyrArg: 2.462 ± 0.899
4.923TyrSer: 4.923 ± 2.287
3.077TyrThr: 3.077 ± 2.033
2.462TyrVal: 2.462 ± 1.53
0.0TyrTrp: 0.0 ± 0.0
2.462TyrTyr: 2.462 ± 0.899
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1626 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski