Amino acid dipepetide frequency for Simian immunodeficiency virus - agm.tan-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.505AlaAla: 6.505 ± 3.034
2.396AlaCys: 2.396 ± 0.702
1.027AlaAsp: 1.027 ± 0.898
5.135AlaGlu: 5.135 ± 1.768
2.396AlaPhe: 2.396 ± 0.779
5.82AlaGly: 5.82 ± 2.042
0.685AlaHis: 0.685 ± 0.366
1.369AlaIle: 1.369 ± 0.528
2.739AlaLys: 2.739 ± 1.464
5.478AlaLeu: 5.478 ± 0.596
1.712AlaMet: 1.712 ± 0.957
2.054AlaAsn: 2.054 ± 0.918
3.081AlaPro: 3.081 ± 1.085
3.766AlaGln: 3.766 ± 0.874
3.423AlaArg: 3.423 ± 0.759
2.054AlaSer: 2.054 ± 0.848
2.739AlaThr: 2.739 ± 0.824
3.766AlaVal: 3.766 ± 1.243
2.396AlaTrp: 2.396 ± 1.367
2.739AlaTyr: 2.739 ± 0.519
0.0AlaXaa: 0.0 ± 0.0
Cys
1.369CysAla: 1.369 ± 0.528
0.685CysCys: 0.685 ± 0.903
0.685CysAsp: 0.685 ± 0.468
0.685CysGlu: 0.685 ± 0.572
1.712CysPhe: 1.712 ± 2.101
1.027CysGly: 1.027 ± 0.682
1.712CysHis: 1.712 ± 0.674
1.712CysIle: 1.712 ± 0.674
1.712CysLys: 1.712 ± 0.705
1.712CysLeu: 1.712 ± 0.635
0.342CysMet: 0.342 ± 0.286
1.027CysAsn: 1.027 ± 0.492
1.027CysPro: 1.027 ± 0.278
2.396CysGln: 2.396 ± 0.668
1.369CysArg: 1.369 ± 0.818
0.342CysSer: 0.342 ± 0.286
2.396CysThr: 2.396 ± 1.328
1.027CysVal: 1.027 ± 0.683
0.342CysTrp: 0.342 ± 0.247
1.027CysTyr: 1.027 ± 0.919
0.0CysXaa: 0.0 ± 0.0
Asp
1.712AspAla: 1.712 ± 1.236
1.712AspCys: 1.712 ± 1.033
2.396AspAsp: 2.396 ± 1.301
1.712AspGlu: 1.712 ± 0.909
1.027AspPhe: 1.027 ± 0.589
1.712AspGly: 1.712 ± 1.177
0.685AspHis: 0.685 ± 0.381
2.396AspIle: 2.396 ± 1.393
3.081AspLys: 3.081 ± 0.978
2.054AspLeu: 2.054 ± 1.85
0.685AspMet: 0.685 ± 0.468
1.712AspAsn: 1.712 ± 0.655
3.766AspPro: 3.766 ± 1.284
1.712AspGln: 1.712 ± 0.977
1.027AspArg: 1.027 ± 0.278
2.396AspSer: 2.396 ± 1.241
3.081AspThr: 3.081 ± 0.517
2.054AspVal: 2.054 ± 0.939
2.739AspTrp: 2.739 ± 1.416
2.054AspTyr: 2.054 ± 0.767
0.0AspXaa: 0.0 ± 0.0
Glu
4.108GluAla: 4.108 ± 0.737
1.027GluCys: 1.027 ± 0.667
2.396GluAsp: 2.396 ± 0.555
5.478GluGlu: 5.478 ± 1.992
2.054GluPhe: 2.054 ± 1.115
7.532GluGly: 7.532 ± 1.464
1.369GluHis: 1.369 ± 0.642
4.793GluIle: 4.793 ± 1.567
7.189GluLys: 7.189 ± 1.036
3.081GluLeu: 3.081 ± 0.714
1.027GluMet: 1.027 ± 0.646
2.054GluAsn: 2.054 ± 0.983
2.739GluPro: 2.739 ± 1.237
5.135GluGln: 5.135 ± 1.284
3.423GluArg: 3.423 ± 1.044
4.451GluSer: 4.451 ± 2.023
4.108GluThr: 4.108 ± 1.685
4.108GluVal: 4.108 ± 1.201
1.369GluTrp: 1.369 ± 0.926
0.342GluTyr: 0.342 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
1.369PheAla: 1.369 ± 0.76
1.369PheCys: 1.369 ± 0.934
0.685PheAsp: 0.685 ± 0.718
2.396PheGlu: 2.396 ± 0.544
1.369PhePhe: 1.369 ± 0.642
2.396PheGly: 2.396 ± 0.912
0.342PheHis: 0.342 ± 0.286
1.027PheIle: 1.027 ± 0.719
2.739PheLys: 2.739 ± 0.507
4.108PheLeu: 4.108 ± 0.927
0.0PheMet: 0.0 ± 0.0
2.054PheAsn: 2.054 ± 0.63
1.369PhePro: 1.369 ± 0.404
1.369PheGln: 1.369 ± 0.632
2.396PheArg: 2.396 ± 1.358
0.342PheSer: 0.342 ± 0.463
0.342PheThr: 0.342 ± 0.247
1.027PheVal: 1.027 ± 0.492
0.685PheTrp: 0.685 ± 0.264
1.369PheTyr: 1.369 ± 0.818
0.0PheXaa: 0.0 ± 0.0
Gly
6.505GlyAla: 6.505 ± 1.939
1.712GlyCys: 1.712 ± 0.927
3.081GlyAsp: 3.081 ± 0.828
3.766GlyGlu: 3.766 ± 1.574
4.108GlyPhe: 4.108 ± 1.281
6.847GlyGly: 6.847 ± 1.746
1.712GlyHis: 1.712 ± 1.373
6.505GlyIle: 6.505 ± 1.822
5.135GlyLys: 5.135 ± 1.659
5.135GlyLeu: 5.135 ± 1.548
2.054GlyMet: 2.054 ± 1.194
2.739GlyAsn: 2.739 ± 0.632
4.108GlyPro: 4.108 ± 1.132
3.766GlyGln: 3.766 ± 1.664
5.478GlyArg: 5.478 ± 1.881
4.108GlySer: 4.108 ± 1.138
2.739GlyThr: 2.739 ± 0.868
2.739GlyVal: 2.739 ± 0.858
1.369GlyTrp: 1.369 ± 0.716
1.369GlyTyr: 1.369 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
0.342HisAla: 0.342 ± 0.359
1.027HisCys: 1.027 ± 1.017
1.369HisAsp: 1.369 ± 0.626
0.342HisGlu: 0.342 ± 0.359
0.685HisPhe: 0.685 ± 0.498
1.712HisGly: 1.712 ± 0.635
0.0HisHis: 0.0 ± 0.0
1.369HisIle: 1.369 ± 0.362
0.685HisLys: 0.685 ± 0.366
2.739HisLeu: 2.739 ± 0.882
0.0HisMet: 0.0 ± 0.0
0.342HisAsn: 0.342 ± 0.247
1.712HisPro: 1.712 ± 0.635
0.685HisGln: 0.685 ± 0.903
0.685HisArg: 0.685 ± 0.866
1.712HisSer: 1.712 ± 1.19
2.396HisThr: 2.396 ± 1.013
0.685HisVal: 0.685 ± 0.866
0.685HisTrp: 0.685 ± 0.494
0.342HisTyr: 0.342 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
1.369IleAla: 1.369 ± 0.632
0.685IleCys: 0.685 ± 0.264
1.369IleAsp: 1.369 ± 0.772
4.108IleGlu: 4.108 ± 0.818
1.027IlePhe: 1.027 ± 0.626
4.451IleGly: 4.451 ± 1.077
2.396IleHis: 2.396 ± 0.596
4.451IleIle: 4.451 ± 1.479
3.766IleLys: 3.766 ± 1.133
4.451IleLeu: 4.451 ± 1.02
1.027IleMet: 1.027 ± 0.424
2.396IleAsn: 2.396 ± 0.596
4.108IlePro: 4.108 ± 1.906
3.766IleGln: 3.766 ± 0.649
4.793IleArg: 4.793 ± 1.077
3.081IleSer: 3.081 ± 0.89
3.766IleThr: 3.766 ± 1.291
3.081IleVal: 3.081 ± 0.871
3.081IleTrp: 3.081 ± 1.707
1.369IleTyr: 1.369 ± 0.582
0.0IleXaa: 0.0 ± 0.0
Lys
3.766LysAla: 3.766 ± 1.648
2.396LysCys: 2.396 ± 1.348
4.108LysAsp: 4.108 ± 1.04
8.901LysGlu: 8.901 ± 1.835
1.027LysPhe: 1.027 ± 0.424
4.108LysGly: 4.108 ± 0.927
1.369LysHis: 1.369 ± 0.429
4.793LysIle: 4.793 ± 2.06
4.108LysLys: 4.108 ± 0.917
8.559LysLeu: 8.559 ± 1.534
1.369LysMet: 1.369 ± 0.631
2.739LysAsn: 2.739 ± 1.014
1.712LysPro: 1.712 ± 0.635
2.739LysGln: 2.739 ± 1.047
4.451LysArg: 4.451 ± 1.276
2.396LysSer: 2.396 ± 0.723
4.108LysThr: 4.108 ± 0.628
4.108LysVal: 4.108 ± 2.628
0.342LysTrp: 0.342 ± 0.286
2.054LysTyr: 2.054 ± 0.602
0.0LysXaa: 0.0 ± 0.0
Leu
4.108LeuAla: 4.108 ± 1.564
0.685LeuCys: 0.685 ± 0.366
3.081LeuAsp: 3.081 ± 0.973
8.216LeuGlu: 8.216 ± 1.164
2.054LeuPhe: 2.054 ± 0.556
5.478LeuGly: 5.478 ± 1.193
1.369LeuHis: 1.369 ± 0.998
4.108LeuIle: 4.108 ± 0.706
6.505LeuLys: 6.505 ± 1.621
8.559LeuLeu: 8.559 ± 1.136
1.027LeuMet: 1.027 ± 0.278
4.793LeuAsn: 4.793 ± 1.088
3.423LeuPro: 3.423 ± 0.759
2.396LeuGln: 2.396 ± 1.482
6.162LeuArg: 6.162 ± 1.561
5.135LeuSer: 5.135 ± 1.757
5.135LeuThr: 5.135 ± 1.522
7.189LeuVal: 7.189 ± 1.372
2.396LeuTrp: 2.396 ± 0.728
2.054LeuTyr: 2.054 ± 0.659
0.0LeuXaa: 0.0 ± 0.0
Met
2.054MetAla: 2.054 ± 1.451
0.342MetCys: 0.342 ± 0.286
1.369MetAsp: 1.369 ± 0.528
1.712MetGlu: 1.712 ± 0.957
0.342MetPhe: 0.342 ± 0.359
2.396MetGly: 2.396 ± 0.44
0.342MetHis: 0.342 ± 0.463
1.027MetIle: 1.027 ± 0.424
0.0MetLys: 0.0 ± 0.0
1.712MetLeu: 1.712 ± 0.635
0.342MetMet: 0.342 ± 0.286
1.027MetAsn: 1.027 ± 0.492
1.027MetPro: 1.027 ± 0.682
1.027MetGln: 1.027 ± 1.077
0.342MetArg: 0.342 ± 0.286
1.027MetSer: 1.027 ± 0.719
1.712MetThr: 1.712 ± 0.471
1.369MetVal: 1.369 ± 0.585
1.027MetTrp: 1.027 ± 0.742
0.342MetTyr: 0.342 ± 0.359
0.0MetXaa: 0.0 ± 0.0
Asn
2.739AsnAla: 2.739 ± 1.134
1.712AsnCys: 1.712 ± 1.037
1.369AsnAsp: 1.369 ± 0.429
1.369AsnGlu: 1.369 ± 0.484
1.712AsnPhe: 1.712 ± 0.854
1.369AsnGly: 1.369 ± 0.404
0.685AsnHis: 0.685 ± 1.769
4.108AsnIle: 4.108 ± 1.387
2.739AsnLys: 2.739 ± 0.734
3.766AsnLeu: 3.766 ± 0.94
2.054AsnMet: 2.054 ± 0.583
2.739AsnAsn: 2.739 ± 1.499
2.739AsnPro: 2.739 ± 1.094
2.739AsnGln: 2.739 ± 1.454
1.369AsnArg: 1.369 ± 0.528
3.423AsnSer: 3.423 ± 1.809
3.766AsnThr: 3.766 ± 1.588
2.396AsnVal: 2.396 ± 0.85
2.396AsnTrp: 2.396 ± 0.44
1.027AsnTyr: 1.027 ± 0.683
0.0AsnXaa: 0.0 ± 0.0
Pro
4.451ProAla: 4.451 ± 1.528
1.369ProCys: 1.369 ± 1.058
3.081ProAsp: 3.081 ± 0.797
2.396ProGlu: 2.396 ± 1.039
0.685ProPhe: 0.685 ± 0.381
3.423ProGly: 3.423 ± 1.4
1.027ProHis: 1.027 ± 0.719
2.739ProIle: 2.739 ± 1.253
2.739ProLys: 2.739 ± 1.203
5.478ProLeu: 5.478 ± 1.178
1.369ProMet: 1.369 ± 0.585
2.054ProAsn: 2.054 ± 0.767
3.423ProPro: 3.423 ± 1.631
2.739ProGln: 2.739 ± 1.154
4.108ProArg: 4.108 ± 2.337
2.396ProSer: 2.396 ± 0.749
4.451ProThr: 4.451 ± 1.382
3.423ProVal: 3.423 ± 1.178
1.027ProTrp: 1.027 ± 1.063
1.369ProTyr: 1.369 ± 0.989
0.0ProXaa: 0.0 ± 0.0
Gln
4.451GlnAla: 4.451 ± 0.862
0.0GlnCys: 0.0 ± 0.0
2.054GlnAsp: 2.054 ± 1.035
5.478GlnGlu: 5.478 ± 1.332
2.054GlnPhe: 2.054 ± 1.115
5.135GlnGly: 5.135 ± 1.61
1.027GlnHis: 1.027 ± 0.424
4.451GlnIle: 4.451 ± 0.87
4.793GlnLys: 4.793 ± 1.142
4.793GlnLeu: 4.793 ± 1.209
2.739GlnMet: 2.739 ± 1.01
1.712GlnAsn: 1.712 ± 1.205
1.027GlnPro: 1.027 ± 0.424
8.216GlnGln: 8.216 ± 3.423
3.081GlnArg: 3.081 ± 2.648
2.054GlnSer: 2.054 ± 0.866
1.712GlnThr: 1.712 ± 0.705
3.423GlnVal: 3.423 ± 1.665
3.081GlnTrp: 3.081 ± 1.247
3.423GlnTyr: 3.423 ± 0.652
0.0GlnXaa: 0.0 ± 0.0
Arg
2.739ArgAla: 2.739 ± 0.507
1.712ArgCys: 1.712 ± 1.085
2.054ArgAsp: 2.054 ± 1.167
4.793ArgGlu: 4.793 ± 1.832
1.712ArgPhe: 1.712 ± 1.199
5.135ArgGly: 5.135 ± 1.579
0.685ArgHis: 0.685 ± 0.564
2.054ArgIle: 2.054 ± 2.552
4.451ArgLys: 4.451 ± 1.802
3.423ArgLeu: 3.423 ± 0.73
1.712ArgMet: 1.712 ± 0.406
2.396ArgAsn: 2.396 ± 0.664
4.451ArgPro: 4.451 ± 2.983
5.478ArgGln: 5.478 ± 1.961
6.162ArgArg: 6.162 ± 5.344
2.739ArgSer: 2.739 ± 0.824
2.396ArgThr: 2.396 ± 0.85
2.739ArgVal: 2.739 ± 0.519
1.027ArgTrp: 1.027 ± 0.424
1.712ArgTyr: 1.712 ± 0.769
0.0ArgXaa: 0.0 ± 0.0
Ser
2.054SerAla: 2.054 ± 1.154
1.027SerCys: 1.027 ± 0.492
2.054SerAsp: 2.054 ± 0.921
2.396SerGlu: 2.396 ± 1.859
1.027SerPhe: 1.027 ± 0.667
2.739SerGly: 2.739 ± 1.321
1.027SerHis: 1.027 ± 0.589
2.054SerIle: 2.054 ± 0.821
4.108SerLys: 4.108 ± 1.891
4.108SerLeu: 4.108 ± 1.716
1.027SerMet: 1.027 ± 0.492
2.739SerAsn: 2.739 ± 1.533
3.081SerPro: 3.081 ± 1.002
4.793SerGln: 4.793 ± 1.707
3.423SerArg: 3.423 ± 2.716
5.82SerSer: 5.82 ± 3.29
3.766SerThr: 3.766 ± 1.893
3.766SerVal: 3.766 ± 0.986
1.027SerTrp: 1.027 ± 0.858
0.342SerTyr: 0.342 ± 0.359
0.0SerXaa: 0.0 ± 0.0
Thr
6.505ThrAla: 6.505 ± 1.218
0.0ThrCys: 0.0 ± 0.0
1.369ThrAsp: 1.369 ± 0.528
2.396ThrGlu: 2.396 ± 0.664
1.369ThrPhe: 1.369 ± 0.585
4.451ThrGly: 4.451 ± 1.209
1.027ThrHis: 1.027 ± 0.511
3.423ThrIle: 3.423 ± 1.324
4.108ThrLys: 4.108 ± 1.197
5.135ThrLeu: 5.135 ± 1.73
0.0ThrMet: 0.0 ± 0.0
4.793ThrAsn: 4.793 ± 1.557
5.478ThrPro: 5.478 ± 0.433
2.739ThrGln: 2.739 ± 1.389
2.396ThrArg: 2.396 ± 1.199
2.739ThrSer: 2.739 ± 1.603
4.451ThrThr: 4.451 ± 1.676
3.423ThrVal: 3.423 ± 1.321
2.054ThrTrp: 2.054 ± 0.383
2.054ThrTyr: 2.054 ± 1.311
0.0ThrXaa: 0.0 ± 0.0
Val
2.396ValAla: 2.396 ± 0.772
2.396ValCys: 2.396 ± 1.016
2.054ValAsp: 2.054 ± 0.945
3.766ValGlu: 3.766 ± 0.947
1.027ValPhe: 1.027 ± 0.492
3.766ValGly: 3.766 ± 1.374
0.685ValHis: 0.685 ± 0.366
4.108ValIle: 4.108 ± 1.115
4.451ValLys: 4.451 ± 1.525
6.505ValLeu: 6.505 ± 1.902
0.342ValMet: 0.342 ± 0.247
2.739ValAsn: 2.739 ± 0.917
3.423ValPro: 3.423 ± 0.741
4.793ValGln: 4.793 ± 1.3
1.712ValArg: 1.712 ± 0.892
4.108ValSer: 4.108 ± 2.04
2.396ValThr: 2.396 ± 1.248
2.739ValVal: 2.739 ± 0.772
1.712ValTrp: 1.712 ± 0.641
0.685ValTyr: 0.685 ± 0.494
0.0ValXaa: 0.0 ± 0.0
Trp
1.712TrpAla: 1.712 ± 0.406
1.027TrpCys: 1.027 ± 0.57
2.054TrpAsp: 2.054 ± 1.819
1.712TrpGlu: 1.712 ± 0.847
1.027TrpPhe: 1.027 ± 0.858
3.766TrpGly: 3.766 ± 0.889
0.0TrpHis: 0.0 ± 0.0
0.685TrpIle: 0.685 ± 0.264
2.739TrpLys: 2.739 ± 0.928
2.396TrpLeu: 2.396 ± 1.441
0.685TrpMet: 0.685 ± 0.366
1.027TrpAsn: 1.027 ± 0.667
0.685TrpPro: 0.685 ± 0.494
2.396TrpGln: 2.396 ± 0.728
2.054TrpArg: 2.054 ± 0.623
1.027TrpSer: 1.027 ± 0.589
2.396TrpThr: 2.396 ± 0.702
1.369TrpVal: 1.369 ± 1.026
1.369TrpTrp: 1.369 ± 0.528
1.027TrpTyr: 1.027 ± 0.492
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.027TyrAla: 1.027 ± 0.536
1.027TyrCys: 1.027 ± 0.492
2.054TyrAsp: 2.054 ± 0.679
1.027TyrGlu: 1.027 ± 0.858
0.342TyrPhe: 0.342 ± 0.286
1.712TyrGly: 1.712 ± 0.769
1.369TyrHis: 1.369 ± 2.122
1.369TyrIle: 1.369 ± 0.76
1.712TyrLys: 1.712 ± 0.769
0.685TyrLeu: 0.685 ± 0.264
0.685TyrMet: 0.685 ± 0.264
3.081TyrAsn: 3.081 ± 0.517
1.369TyrPro: 1.369 ± 0.73
2.054TyrGln: 2.054 ± 0.556
1.712TyrArg: 1.712 ± 0.674
1.027TyrSer: 1.027 ± 0.517
2.054TyrThr: 2.054 ± 0.792
1.369TyrVal: 1.369 ± 0.429
1.027TyrTrp: 1.027 ± 0.424
1.369TyrTyr: 1.369 ± 0.989
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2922 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski