Amino acid dipepetide frequency for Spiroplasma virus 4 (SpV4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.813AlaAla: 2.813 ± 1.713
0.703AlaCys: 0.703 ± 0.654
4.923AlaAsp: 4.923 ± 2.553
2.11AlaGlu: 2.11 ± 0.836
2.11AlaPhe: 2.11 ± 1.414
4.219AlaGly: 4.219 ± 2.534
0.0AlaHis: 0.0 ± 0.0
2.11AlaIle: 2.11 ± 1.038
1.406AlaLys: 1.406 ± 0.607
3.516AlaLeu: 3.516 ± 1.234
0.0AlaMet: 0.0 ± 0.0
5.626AlaAsn: 5.626 ± 1.809
3.516AlaPro: 3.516 ± 2.356
2.813AlaGln: 2.813 ± 1.366
6.329AlaArg: 6.329 ± 1.485
2.813AlaSer: 2.813 ± 1.233
1.406AlaThr: 1.406 ± 0.842
2.11AlaVal: 2.11 ± 1.187
0.703AlaTrp: 0.703 ± 0.471
1.406AlaTyr: 1.406 ± 1.086
0.0AlaXaa: 0.0 ± 0.0
Cys
0.703CysAla: 0.703 ± 0.471
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.703CysGlu: 0.703 ± 0.754
0.703CysPhe: 0.703 ± 0.654
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.406CysIle: 1.406 ± 1.354
1.406CysLys: 1.406 ± 1.508
2.11CysLeu: 2.11 ± 1.049
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.406CysArg: 1.406 ± 1.307
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.406CysVal: 1.406 ± 1.307
0.0CysTrp: 0.0 ± 0.0
0.703CysTyr: 0.703 ± 1.013
0.0CysXaa: 0.0 ± 0.0
Asp
4.219AspAla: 4.219 ± 1.737
0.0AspCys: 0.0 ± 0.0
2.813AspAsp: 2.813 ± 1.613
4.923AspGlu: 4.923 ± 2.525
2.11AspPhe: 2.11 ± 1.038
0.0AspGly: 0.0 ± 0.0
1.406AspHis: 1.406 ± 0.942
1.406AspIle: 1.406 ± 0.942
4.923AspLys: 4.923 ± 1.851
7.032AspLeu: 7.032 ± 2.427
2.11AspMet: 2.11 ± 0.836
4.923AspAsn: 4.923 ± 1.592
0.703AspPro: 0.703 ± 0.471
0.703AspGln: 0.703 ± 0.471
1.406AspArg: 1.406 ± 0.607
4.219AspSer: 4.219 ± 1.546
2.11AspThr: 2.11 ± 1.72
2.813AspVal: 2.813 ± 1.033
1.406AspTrp: 1.406 ± 0.942
5.626AspTyr: 5.626 ± 1.608
0.0AspXaa: 0.0 ± 0.0
Glu
2.813GluAla: 2.813 ± 1.063
0.703GluCys: 0.703 ± 0.471
2.813GluAsp: 2.813 ± 0.945
0.0GluGlu: 0.0 ± 0.0
2.813GluPhe: 2.813 ± 0.773
1.406GluGly: 1.406 ± 1.292
2.813GluHis: 2.813 ± 1.214
6.329GluIle: 6.329 ± 2.03
9.845GluLys: 9.845 ± 2.96
2.11GluLeu: 2.11 ± 1.857
1.406GluMet: 1.406 ± 0.758
5.626GluAsn: 5.626 ± 4.375
0.0GluPro: 0.0 ± 0.0
2.11GluGln: 2.11 ± 0.823
4.219GluArg: 4.219 ± 2.198
0.703GluSer: 0.703 ± 0.938
1.406GluThr: 1.406 ± 0.842
0.703GluVal: 0.703 ± 1.122
2.813GluTrp: 2.813 ± 1.225
2.813GluTyr: 2.813 ± 2.757
0.0GluXaa: 0.0 ± 0.0
Phe
1.406PheAla: 1.406 ± 0.942
0.0PheCys: 0.0 ± 0.0
1.406PheAsp: 1.406 ± 0.607
1.406PheGlu: 1.406 ± 0.904
1.406PhePhe: 1.406 ± 0.607
6.329PheGly: 6.329 ± 3.179
0.703PheHis: 0.703 ± 0.654
3.516PheIle: 3.516 ± 1.884
4.219PheLys: 4.219 ± 1.446
1.406PheLeu: 1.406 ± 0.942
2.11PheMet: 2.11 ± 0.942
2.813PheAsn: 2.813 ± 1.017
0.0PhePro: 0.0 ± 0.0
2.11PheGln: 2.11 ± 0.836
3.516PheArg: 3.516 ± 2.356
2.11PheSer: 2.11 ± 0.991
2.11PheThr: 2.11 ± 1.414
1.406PheVal: 1.406 ± 1.142
0.703PheTrp: 0.703 ± 0.654
0.703PheTyr: 0.703 ± 0.654
0.0PheXaa: 0.0 ± 0.0
Gly
2.11GlyAla: 2.11 ± 1.706
0.703GlyCys: 0.703 ± 0.654
3.516GlyAsp: 3.516 ± 2.851
5.626GlyGlu: 5.626 ± 0.983
2.813GlyPhe: 2.813 ± 1.017
5.626GlyGly: 5.626 ± 1.996
1.406GlyHis: 1.406 ± 0.607
7.032GlyIle: 7.032 ± 1.774
2.11GlyLys: 2.11 ± 0.778
5.626GlyLeu: 5.626 ± 3.943
2.11GlyMet: 2.11 ± 1.706
2.11GlyAsn: 2.11 ± 1.414
1.406GlyPro: 1.406 ± 0.856
2.11GlyGln: 2.11 ± 1.038
2.11GlyArg: 2.11 ± 0.951
7.032GlySer: 7.032 ± 5.3
3.516GlyThr: 3.516 ± 1.017
3.516GlyVal: 3.516 ± 1.142
0.703GlyTrp: 0.703 ± 0.471
2.11GlyTyr: 2.11 ± 1.532
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.406HisAsp: 1.406 ± 0.942
0.0HisGlu: 0.0 ± 0.0
2.11HisPhe: 2.11 ± 1.414
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
4.219HisIle: 4.219 ± 1.822
0.703HisLys: 0.703 ± 0.471
1.406HisLeu: 1.406 ± 0.904
0.0HisMet: 0.0 ± 0.0
1.406HisAsn: 1.406 ± 1.307
0.703HisPro: 0.703 ± 0.471
0.703HisGln: 0.703 ± 0.902
1.406HisArg: 1.406 ± 2.443
2.813HisSer: 2.813 ± 1.066
2.11HisThr: 2.11 ± 0.868
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.813HisTyr: 2.813 ± 1.214
0.0HisXaa: 0.0 ± 0.0
Ile
2.11IleAla: 2.11 ± 1.187
0.703IleCys: 0.703 ± 0.654
2.813IleAsp: 2.813 ± 1.622
2.11IleGlu: 2.11 ± 1.111
2.11IlePhe: 2.11 ± 1.187
3.516IleGly: 3.516 ± 1.787
0.703IleHis: 0.703 ± 0.471
5.626IleIle: 5.626 ± 3.894
8.439IleLys: 8.439 ± 2.565
3.516IleLeu: 3.516 ± 3.83
2.11IleMet: 2.11 ± 1.139
3.516IleAsn: 3.516 ± 1.913
3.516IlePro: 3.516 ± 1.889
2.11IleGln: 2.11 ± 0.991
2.11IleArg: 2.11 ± 0.991
4.219IleSer: 4.219 ± 1.369
3.516IleThr: 3.516 ± 1.562
9.845IleVal: 9.845 ± 2.623
0.703IleTrp: 0.703 ± 1.122
3.516IleTyr: 3.516 ± 1.884
0.0IleXaa: 0.0 ± 0.0
Lys
1.406LysAla: 1.406 ± 0.607
1.406LysCys: 1.406 ± 0.904
4.219LysAsp: 4.219 ± 2.204
8.439LysGlu: 8.439 ± 2.768
3.516LysPhe: 3.516 ± 1.282
6.329LysGly: 6.329 ± 2.63
2.813LysHis: 2.813 ± 1.429
3.516LysIle: 3.516 ± 2.157
8.439LysLys: 8.439 ± 2.829
8.439LysLeu: 8.439 ± 2.319
4.923LysMet: 4.923 ± 1.304
4.219LysAsn: 4.219 ± 1.248
2.11LysPro: 2.11 ± 1.414
0.703LysGln: 0.703 ± 0.938
6.329LysArg: 6.329 ± 3.558
3.516LysSer: 3.516 ± 1.234
5.626LysThr: 5.626 ± 1.631
4.219LysVal: 4.219 ± 1.881
1.406LysTrp: 1.406 ± 1.414
4.219LysTyr: 4.219 ± 2.328
0.0LysXaa: 0.0 ± 0.0
Leu
6.329LeuAla: 6.329 ± 2.295
0.703LeuCys: 0.703 ± 1.013
5.626LeuAsp: 5.626 ± 2.521
6.329LeuGlu: 6.329 ± 3.685
2.11LeuPhe: 2.11 ± 1.369
8.439LeuGly: 8.439 ± 2.349
0.0LeuHis: 0.0 ± 0.0
4.219LeuIle: 4.219 ± 2.801
4.923LeuLys: 4.923 ± 1.653
7.032LeuLeu: 7.032 ± 6.097
2.11LeuMet: 2.11 ± 2.269
4.219LeuAsn: 4.219 ± 1.418
4.219LeuPro: 4.219 ± 1.509
2.813LeuGln: 2.813 ± 1.164
7.032LeuArg: 7.032 ± 1.411
7.736LeuSer: 7.736 ± 2.938
4.923LeuThr: 4.923 ± 1.812
5.626LeuVal: 5.626 ± 2.076
1.406LeuTrp: 1.406 ± 1.155
1.406LeuTyr: 1.406 ± 0.942
0.0LeuXaa: 0.0 ± 0.0
Met
2.11MetAla: 2.11 ± 1.088
0.0MetCys: 0.0 ± 0.0
2.813MetAsp: 2.813 ± 1.505
2.11MetGlu: 2.11 ± 1.477
1.406MetPhe: 1.406 ± 0.942
0.703MetGly: 0.703 ± 0.913
0.0MetHis: 0.0 ± 0.0
2.11MetIle: 2.11 ± 2.227
2.11MetLys: 2.11 ± 1.111
2.813MetLeu: 2.813 ± 2.872
0.0MetMet: 0.0 ± 0.0
0.703MetAsn: 0.703 ± 0.654
1.406MetPro: 1.406 ± 0.939
1.406MetGln: 1.406 ± 1.146
2.813MetArg: 2.813 ± 1.248
2.11MetSer: 2.11 ± 0.836
0.703MetThr: 0.703 ± 0.471
2.813MetVal: 2.813 ± 1.991
0.0MetTrp: 0.0 ± 0.0
0.703MetTyr: 0.703 ± 0.471
0.0MetXaa: 0.0 ± 0.0
Asn
2.813AsnAla: 2.813 ± 1.259
0.703AsnCys: 0.703 ± 0.654
2.11AsnAsp: 2.11 ± 0.836
3.516AsnGlu: 3.516 ± 1.728
2.11AsnPhe: 2.11 ± 1.291
2.813AsnGly: 2.813 ± 0.99
2.11AsnHis: 2.11 ± 1.145
5.626AsnIle: 5.626 ± 2.469
5.626AsnLys: 5.626 ± 1.095
4.219AsnLeu: 4.219 ± 1.737
1.406AsnMet: 1.406 ± 1.826
2.813AsnAsn: 2.813 ± 0.945
2.11AsnPro: 2.11 ± 1.328
1.406AsnGln: 1.406 ± 0.842
2.11AsnArg: 2.11 ± 1.085
4.219AsnSer: 4.219 ± 1.384
5.626AsnThr: 5.626 ± 2.102
2.11AsnVal: 2.11 ± 1.414
2.11AsnTrp: 2.11 ± 1.219
2.11AsnTyr: 2.11 ± 1.17
0.0AsnXaa: 0.0 ± 0.0
Pro
1.406ProAla: 1.406 ± 0.842
1.406ProCys: 1.406 ± 1.116
0.703ProAsp: 0.703 ± 0.471
1.406ProGlu: 1.406 ± 1.307
0.0ProPhe: 0.0 ± 0.0
2.813ProGly: 2.813 ± 1.505
1.406ProHis: 1.406 ± 0.607
0.703ProIle: 0.703 ± 0.471
3.516ProLys: 3.516 ± 1.033
5.626ProLeu: 5.626 ± 1.907
1.406ProMet: 1.406 ± 0.942
2.11ProAsn: 2.11 ± 0.868
2.11ProPro: 2.11 ± 0.991
3.516ProGln: 3.516 ± 2.356
1.406ProArg: 1.406 ± 0.939
3.516ProSer: 3.516 ± 1.489
0.703ProThr: 0.703 ± 0.471
3.516ProVal: 3.516 ± 2.356
1.406ProTrp: 1.406 ± 0.998
0.703ProTyr: 0.703 ± 0.471
0.0ProXaa: 0.0 ± 0.0
Gln
1.406GlnAla: 1.406 ± 0.856
0.0GlnCys: 0.0 ± 0.0
2.11GlnAsp: 2.11 ± 0.868
2.813GlnGlu: 2.813 ± 1.839
0.703GlnPhe: 0.703 ± 0.913
4.219GlnGly: 4.219 ± 1.771
0.703GlnHis: 0.703 ± 0.471
1.406GlnIle: 1.406 ± 1.142
2.813GlnLys: 2.813 ± 1.164
4.923GlnLeu: 4.923 ± 1.482
0.0GlnMet: 0.0 ± 0.0
2.813GlnAsn: 2.813 ± 1.304
0.703GlnPro: 0.703 ± 0.471
1.406GlnGln: 1.406 ± 0.607
4.219GlnArg: 4.219 ± 1.773
2.11GlnSer: 2.11 ± 1.414
2.813GlnThr: 2.813 ± 1.318
1.406GlnVal: 1.406 ± 0.607
1.406GlnTrp: 1.406 ± 1.056
0.703GlnTyr: 0.703 ± 0.913
0.0GlnXaa: 0.0 ± 0.0
Arg
4.923ArgAla: 4.923 ± 1.687
0.703ArgCys: 0.703 ± 0.754
3.516ArgAsp: 3.516 ± 1.844
2.11ArgGlu: 2.11 ± 1.17
4.219ArgPhe: 4.219 ± 1.387
3.516ArgGly: 3.516 ± 1.203
0.703ArgHis: 0.703 ± 0.902
2.813ArgIle: 2.813 ± 0.99
3.516ArgLys: 3.516 ± 1.891
4.923ArgLeu: 4.923 ± 1.806
3.516ArgMet: 3.516 ± 2.416
1.406ArgAsn: 1.406 ± 1.292
3.516ArgPro: 3.516 ± 2.436
0.703ArgGln: 0.703 ± 0.654
6.329ArgArg: 6.329 ± 4.966
4.923ArgSer: 4.923 ± 1.094
2.11ArgThr: 2.11 ± 0.868
4.923ArgVal: 4.923 ± 2.571
0.703ArgTrp: 0.703 ± 1.222
4.219ArgTyr: 4.219 ± 1.345
0.0ArgXaa: 0.0 ± 0.0
Ser
3.516SerAla: 3.516 ± 2.208
0.703SerCys: 0.703 ± 0.471
3.516SerAsp: 3.516 ± 1.017
5.626SerGlu: 5.626 ± 2.553
1.406SerPhe: 1.406 ± 1.414
4.219SerGly: 4.219 ± 2.534
1.406SerHis: 1.406 ± 0.842
1.406SerIle: 1.406 ± 0.939
7.032SerLys: 7.032 ± 2.426
6.329SerLeu: 6.329 ± 1.684
2.11SerMet: 2.11 ± 1.085
7.736SerAsn: 7.736 ± 1.971
0.703SerPro: 0.703 ± 0.913
4.219SerGln: 4.219 ± 1.378
2.11SerArg: 2.11 ± 1.707
7.032SerSer: 7.032 ± 2.289
2.813SerThr: 2.813 ± 1.259
6.329SerVal: 6.329 ± 1.335
0.703SerTrp: 0.703 ± 0.471
1.406SerTyr: 1.406 ± 0.607
0.0SerXaa: 0.0 ± 0.0
Thr
2.11ThrAla: 2.11 ± 0.868
0.703ThrCys: 0.703 ± 0.754
2.813ThrAsp: 2.813 ± 1.164
1.406ThrGlu: 1.406 ± 0.842
1.406ThrPhe: 1.406 ± 0.942
2.11ThrGly: 2.11 ± 0.868
0.703ThrHis: 0.703 ± 0.902
6.329ThrIle: 6.329 ± 1.102
4.219ThrLys: 4.219 ± 1.881
6.329ThrLeu: 6.329 ± 1.741
0.703ThrMet: 0.703 ± 0.471
0.703ThrAsn: 0.703 ± 0.754
4.219ThrPro: 4.219 ± 2.827
1.406ThrGln: 1.406 ± 1.292
1.406ThrArg: 1.406 ± 0.607
4.219ThrSer: 4.219 ± 1.255
2.11ThrThr: 2.11 ± 0.868
1.406ThrVal: 1.406 ± 0.856
0.0ThrTrp: 0.0 ± 0.0
4.923ThrTyr: 4.923 ± 1.478
0.0ThrXaa: 0.0 ± 0.0
Val
4.923ValAla: 4.923 ± 0.978
1.406ValCys: 1.406 ± 1.293
2.813ValAsp: 2.813 ± 1.885
2.11ValGlu: 2.11 ± 0.991
0.703ValPhe: 0.703 ± 1.122
4.219ValGly: 4.219 ± 1.928
2.11ValHis: 2.11 ± 0.868
2.813ValIle: 2.813 ± 1.864
5.626ValLys: 5.626 ± 1.35
3.516ValLeu: 3.516 ± 2.331
2.11ValMet: 2.11 ± 1.954
2.813ValAsn: 2.813 ± 1.225
7.032ValPro: 7.032 ± 2.845
2.813ValGln: 2.813 ± 1.318
2.813ValArg: 2.813 ± 1.439
2.813ValSer: 2.813 ± 1.735
2.11ValThr: 2.11 ± 0.868
3.516ValVal: 3.516 ± 3.852
1.406ValTrp: 1.406 ± 1.155
1.406ValTyr: 1.406 ± 0.607
0.0ValXaa: 0.0 ± 0.0
Trp
1.406TrpAla: 1.406 ± 0.942
0.0TrpCys: 0.0 ± 0.0
2.11TrpAsp: 2.11 ± 0.868
0.0TrpGlu: 0.0 ± 0.0
2.813TrpPhe: 2.813 ± 1.017
0.703TrpGly: 0.703 ± 0.654
0.703TrpHis: 0.703 ± 0.471
1.406TrpIle: 1.406 ± 2.243
1.406TrpLys: 1.406 ± 1.155
2.11TrpLeu: 2.11 ± 1.312
0.0TrpMet: 0.0 ± 0.0
0.703TrpAsn: 0.703 ± 0.913
0.703TrpPro: 0.703 ± 1.013
1.406TrpGln: 1.406 ± 1.028
1.406TrpArg: 1.406 ± 0.939
0.703TrpSer: 0.703 ± 0.754
0.703TrpThr: 0.703 ± 0.654
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.11TyrAla: 2.11 ± 1.414
0.0TyrCys: 0.0 ± 0.0
2.813TyrAsp: 2.813 ± 0.975
0.703TyrGlu: 0.703 ± 0.938
2.813TyrPhe: 2.813 ± 1.796
2.11TyrGly: 2.11 ± 0.778
2.11TyrHis: 2.11 ± 1.17
2.813TyrIle: 2.813 ± 1.214
3.516TyrLys: 3.516 ± 1.423
4.219TyrLeu: 4.219 ± 2.111
0.0TyrMet: 0.0 ± 0.0
1.406TyrAsn: 1.406 ± 1.028
0.703TyrPro: 0.703 ± 0.471
4.219TyrGln: 4.219 ± 0.824
3.516TyrArg: 3.516 ± 2.804
3.516TyrSer: 3.516 ± 1.262
2.813TyrThr: 2.813 ± 1.993
1.406TyrVal: 1.406 ± 0.998
0.703TyrTrp: 0.703 ± 0.471
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski