Amino acid dipepetide frequency for Simian immunodeficiency virus - agm.sab-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.522AlaAla: 4.522 ± 1.355
2.261AlaCys: 2.261 ± 0.766
1.938AlaAsp: 1.938 ± 0.59
5.168AlaGlu: 5.168 ± 1.697
1.615AlaPhe: 1.615 ± 0.575
4.522AlaGly: 4.522 ± 1.487
1.292AlaHis: 1.292 ± 0.635
3.23AlaIle: 3.23 ± 0.706
4.199AlaLys: 4.199 ± 1.041
6.46AlaLeu: 6.46 ± 0.772
1.938AlaMet: 1.938 ± 0.629
2.907AlaAsn: 2.907 ± 0.595
3.553AlaPro: 3.553 ± 1.015
3.23AlaGln: 3.23 ± 1.332
4.199AlaArg: 4.199 ± 1.012
3.876AlaSer: 3.876 ± 1.455
3.553AlaThr: 3.553 ± 0.81
5.168AlaVal: 5.168 ± 1.347
2.261AlaTrp: 2.261 ± 0.76
1.615AlaTyr: 1.615 ± 0.635
0.0AlaXaa: 0.0 ± 0.0
Cys
1.615CysAla: 1.615 ± 1.112
1.292CysCys: 1.292 ± 0.749
0.646CysAsp: 0.646 ± 0.591
0.646CysGlu: 0.646 ± 0.3
1.938CysPhe: 1.938 ± 2.239
0.969CysGly: 0.969 ± 0.794
0.969CysHis: 0.969 ± 0.684
1.615CysIle: 1.615 ± 0.563
2.584CysLys: 2.584 ± 0.818
0.646CysLeu: 0.646 ± 0.725
0.0CysMet: 0.0 ± 0.0
0.646CysAsn: 0.646 ± 0.562
0.646CysPro: 0.646 ± 0.44
1.292CysGln: 1.292 ± 0.532
1.615CysArg: 1.615 ± 0.637
0.646CysSer: 0.646 ± 0.3
0.646CysThr: 0.646 ± 0.3
1.292CysVal: 1.292 ± 0.717
1.615CysTrp: 1.615 ± 0.69
0.969CysTyr: 0.969 ± 0.529
0.0CysXaa: 0.0 ± 0.0
Asp
1.938AspAla: 1.938 ± 0.42
0.969AspCys: 0.969 ± 0.628
2.261AspAsp: 2.261 ± 0.431
1.292AspGlu: 1.292 ± 0.929
1.615AspPhe: 1.615 ± 0.631
2.261AspGly: 2.261 ± 1.176
1.615AspHis: 1.615 ± 0.789
1.938AspIle: 1.938 ± 0.973
1.615AspLys: 1.615 ± 0.84
2.584AspLeu: 2.584 ± 1.124
1.292AspMet: 1.292 ± 0.661
1.615AspAsn: 1.615 ± 0.772
4.845AspPro: 4.845 ± 0.903
2.261AspGln: 2.261 ± 0.522
2.261AspArg: 2.261 ± 0.962
2.584AspSer: 2.584 ± 0.936
1.938AspThr: 1.938 ± 0.682
0.969AspVal: 0.969 ± 0.266
1.292AspTrp: 1.292 ± 0.773
2.584AspTyr: 2.584 ± 0.393
0.0AspXaa: 0.0 ± 0.0
Glu
5.168GluAla: 5.168 ± 0.992
0.646GluCys: 0.646 ± 0.413
1.938GluAsp: 1.938 ± 0.54
8.075GluGlu: 8.075 ± 2.765
1.938GluPhe: 1.938 ± 0.647
4.845GluGly: 4.845 ± 0.803
1.292GluHis: 1.292 ± 0.612
3.23GluIle: 3.23 ± 1.032
4.199GluLys: 4.199 ± 1.341
6.137GluLeu: 6.137 ± 1.717
1.292GluMet: 1.292 ± 0.532
2.907GluAsn: 2.907 ± 0.656
2.907GluPro: 2.907 ± 1.019
4.199GluGln: 4.199 ± 1.187
3.553GluArg: 3.553 ± 0.622
3.876GluSer: 3.876 ± 0.847
4.522GluThr: 4.522 ± 1.282
3.553GluVal: 3.553 ± 0.723
2.584GluTrp: 2.584 ± 0.779
0.323GluTyr: 0.323 ± 0.433
0.0GluXaa: 0.0 ± 0.0
Phe
1.292PheAla: 1.292 ± 0.577
0.969PheCys: 0.969 ± 0.599
1.615PheAsp: 1.615 ± 0.9
0.646PheGlu: 0.646 ± 0.345
1.615PhePhe: 1.615 ± 1.079
2.584PheGly: 2.584 ± 0.528
0.969PheHis: 0.969 ± 1.115
0.323PheIle: 0.323 ± 0.22
1.292PheLys: 1.292 ± 0.431
2.907PheLeu: 2.907 ± 0.778
0.0PheMet: 0.0 ± 0.0
1.292PheAsn: 1.292 ± 0.773
1.938PhePro: 1.938 ± 0.989
1.615PheGln: 1.615 ± 0.809
3.23PheArg: 3.23 ± 0.775
1.292PheSer: 1.292 ± 0.957
1.292PheThr: 1.292 ± 0.84
0.969PheVal: 0.969 ± 0.571
0.323PheTrp: 0.323 ± 0.281
0.969PheTyr: 0.969 ± 0.529
0.0PheXaa: 0.0 ± 0.0
Gly
5.814GlyAla: 5.814 ± 1.513
1.615GlyCys: 1.615 ± 0.69
2.584GlyAsp: 2.584 ± 0.824
4.199GlyGlu: 4.199 ± 1.386
2.907GlyPhe: 2.907 ± 0.786
6.46GlyGly: 6.46 ± 2.231
2.261GlyHis: 2.261 ± 0.57
6.137GlyIle: 6.137 ± 1.425
4.522GlyLys: 4.522 ± 1.785
4.199GlyLeu: 4.199 ± 1.043
0.323GlyMet: 0.323 ± 0.444
2.261GlyAsn: 2.261 ± 0.671
4.522GlyPro: 4.522 ± 1.601
5.168GlyGln: 5.168 ± 1.395
3.553GlyArg: 3.553 ± 1.029
2.907GlySer: 2.907 ± 1.272
2.261GlyThr: 2.261 ± 0.411
2.907GlyVal: 2.907 ± 0.773
2.907GlyTrp: 2.907 ± 1.623
2.584GlyTyr: 2.584 ± 0.929
0.0GlyXaa: 0.0 ± 0.0
His
0.646HisAla: 0.646 ± 0.324
0.969HisCys: 0.969 ± 0.781
0.323HisAsp: 0.323 ± 0.22
0.323HisGlu: 0.323 ± 0.281
0.969HisPhe: 0.969 ± 0.628
0.646HisGly: 0.646 ± 0.488
0.0HisHis: 0.0 ± 0.0
1.292HisIle: 1.292 ± 0.524
1.292HisLys: 1.292 ± 0.484
3.876HisLeu: 3.876 ± 0.737
1.292HisMet: 1.292 ± 0.888
1.615HisAsn: 1.615 ± 1.028
1.938HisPro: 1.938 ± 1.103
2.261HisGln: 2.261 ± 1.234
0.323HisArg: 0.323 ± 0.558
0.969HisSer: 0.969 ± 0.402
2.261HisThr: 2.261 ± 0.812
0.969HisVal: 0.969 ± 0.824
0.969HisTrp: 0.969 ± 1.298
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.584IleAla: 2.584 ± 0.68
0.646IleCys: 0.646 ± 0.44
1.292IleAsp: 1.292 ± 0.841
4.522IleGlu: 4.522 ± 1.604
0.969IlePhe: 0.969 ± 0.266
4.199IleGly: 4.199 ± 1.382
2.261IleHis: 2.261 ± 0.379
5.168IleIle: 5.168 ± 1.669
4.522IleLys: 4.522 ± 0.749
5.168IleLeu: 5.168 ± 0.71
1.292IleMet: 1.292 ± 0.601
1.938IleAsn: 1.938 ± 0.532
4.199IlePro: 4.199 ± 1.04
2.584IleGln: 2.584 ± 0.862
4.845IleArg: 4.845 ± 1.107
0.969IleSer: 0.969 ± 0.533
1.615IleThr: 1.615 ± 0.84
4.199IleVal: 4.199 ± 0.615
1.938IleTrp: 1.938 ± 1.077
1.615IleTyr: 1.615 ± 0.57
0.0IleXaa: 0.0 ± 0.0
Lys
5.491LysAla: 5.491 ± 1.694
1.615LysCys: 1.615 ± 0.925
3.23LysAsp: 3.23 ± 1.26
5.814LysGlu: 5.814 ± 0.783
1.938LysPhe: 1.938 ± 0.49
4.199LysGly: 4.199 ± 0.875
1.292LysHis: 1.292 ± 0.931
5.491LysIle: 5.491 ± 1.827
4.845LysLys: 4.845 ± 1.273
5.814LysLeu: 5.814 ± 1.257
1.292LysMet: 1.292 ± 0.532
4.522LysAsn: 4.522 ± 2.141
2.907LysPro: 2.907 ± 0.654
3.553LysGln: 3.553 ± 1.077
0.969LysArg: 0.969 ± 0.266
1.615LysSer: 1.615 ± 0.755
4.522LysThr: 4.522 ± 1.468
4.522LysVal: 4.522 ± 1.198
0.646LysTrp: 0.646 ± 0.3
3.876LysTyr: 3.876 ± 1.037
0.0LysXaa: 0.0 ± 0.0
Leu
5.168LeuAla: 5.168 ± 1.113
0.969LeuCys: 0.969 ± 0.556
3.876LeuAsp: 3.876 ± 0.83
7.429LeuGlu: 7.429 ± 1.44
1.938LeuPhe: 1.938 ± 1.055
5.491LeuGly: 5.491 ± 0.77
1.938LeuHis: 1.938 ± 1.468
3.553LeuIle: 3.553 ± 0.866
4.522LeuLys: 4.522 ± 0.873
6.137LeuLeu: 6.137 ± 1.415
0.969LeuMet: 0.969 ± 0.517
4.522LeuAsn: 4.522 ± 1.127
4.845LeuPro: 4.845 ± 0.962
5.491LeuGln: 5.491 ± 1.268
7.106LeuArg: 7.106 ± 1.335
3.553LeuSer: 3.553 ± 1.275
3.553LeuThr: 3.553 ± 0.987
6.783LeuVal: 6.783 ± 1.135
3.876LeuTrp: 3.876 ± 1.043
1.292LeuTyr: 1.292 ± 0.506
0.0LeuXaa: 0.0 ± 0.0
Met
2.261MetAla: 2.261 ± 0.772
0.323MetCys: 0.323 ± 0.281
1.615MetAsp: 1.615 ± 0.667
1.292MetGlu: 1.292 ± 0.382
0.323MetPhe: 0.323 ± 0.341
1.938MetGly: 1.938 ± 0.375
0.323MetHis: 0.323 ± 0.433
0.323MetIle: 0.323 ± 0.281
1.938MetLys: 1.938 ± 0.576
2.261MetLeu: 2.261 ± 0.6
0.0MetMet: 0.0 ± 0.0
1.292MetAsn: 1.292 ± 0.441
0.0MetPro: 0.0 ± 0.0
0.323MetGln: 0.323 ± 0.22
0.0MetArg: 0.0 ± 0.0
0.323MetSer: 0.323 ± 0.368
1.615MetThr: 1.615 ± 0.917
0.969MetVal: 0.969 ± 0.446
0.0MetTrp: 0.0 ± 0.0
0.323MetTyr: 0.323 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
1.938AsnAla: 1.938 ± 1.25
1.938AsnCys: 1.938 ± 1.308
1.615AsnAsp: 1.615 ± 1.079
1.938AsnGlu: 1.938 ± 0.423
1.615AsnPhe: 1.615 ± 0.75
2.261AsnGly: 2.261 ± 0.478
0.646AsnHis: 0.646 ± 0.446
4.199AsnIle: 4.199 ± 1.008
2.261AsnLys: 2.261 ± 1.02
3.553AsnLeu: 3.553 ± 1.245
0.969AsnMet: 0.969 ± 0.72
2.261AsnAsn: 2.261 ± 0.923
3.23AsnPro: 3.23 ± 1.241
1.938AsnGln: 1.938 ± 0.838
2.261AsnArg: 2.261 ± 1.12
3.553AsnSer: 3.553 ± 0.817
2.907AsnThr: 2.907 ± 1.583
1.615AsnVal: 1.615 ± 0.454
2.261AsnTrp: 2.261 ± 0.552
1.938AsnTyr: 1.938 ± 0.814
0.0AsnXaa: 0.0 ± 0.0
Pro
4.199ProAla: 4.199 ± 1.31
1.615ProCys: 1.615 ± 0.913
2.584ProAsp: 2.584 ± 0.827
3.876ProGlu: 3.876 ± 0.844
1.292ProPhe: 1.292 ± 0.622
4.522ProGly: 4.522 ± 2.241
0.646ProHis: 0.646 ± 0.42
2.261ProIle: 2.261 ± 0.721
3.23ProLys: 3.23 ± 1.334
7.106ProLeu: 7.106 ± 1.55
1.292ProMet: 1.292 ± 0.532
1.615ProAsn: 1.615 ± 0.575
5.168ProPro: 5.168 ± 3.3
3.876ProGln: 3.876 ± 1.045
4.522ProArg: 4.522 ± 1.915
3.23ProSer: 3.23 ± 0.98
3.553ProThr: 3.553 ± 1.616
4.522ProVal: 4.522 ± 1.085
0.646ProTrp: 0.646 ± 0.541
1.938ProTyr: 1.938 ± 0.795
0.0ProXaa: 0.0 ± 0.0
Gln
3.876GlnAla: 3.876 ± 1.386
0.646GlnCys: 0.646 ± 0.598
0.969GlnAsp: 0.969 ± 0.575
7.429GlnGlu: 7.429 ± 2.026
0.646GlnPhe: 0.646 ± 0.427
6.783GlnGly: 6.783 ± 2.187
0.323GlnHis: 0.323 ± 0.341
4.199GlnIle: 4.199 ± 1.34
6.46GlnLys: 6.46 ± 1.678
3.553GlnLeu: 3.553 ± 1.842
1.292GlnMet: 1.292 ± 0.819
1.615GlnAsn: 1.615 ± 0.925
1.938GlnPro: 1.938 ± 0.891
8.398GlnGln: 8.398 ± 0.878
3.876GlnArg: 3.876 ± 1.231
1.615GlnSer: 1.615 ± 0.483
2.584GlnThr: 2.584 ± 1.145
4.845GlnVal: 4.845 ± 1.55
2.584GlnTrp: 2.584 ± 0.907
2.584GlnTyr: 2.584 ± 0.568
0.0GlnXaa: 0.0 ± 0.0
Arg
2.907ArgAla: 2.907 ± 1.317
0.969ArgCys: 0.969 ± 0.684
3.23ArgAsp: 3.23 ± 0.548
3.553ArgGlu: 3.553 ± 1.488
1.292ArgPhe: 1.292 ± 1.132
3.553ArgGly: 3.553 ± 1.463
1.292ArgHis: 1.292 ± 0.845
2.907ArgIle: 2.907 ± 1.149
2.261ArgLys: 2.261 ± 0.431
5.168ArgLeu: 5.168 ± 1.48
0.969ArgMet: 0.969 ± 0.824
2.584ArgAsn: 2.584 ± 0.487
5.168ArgPro: 5.168 ± 2.115
6.46ArgGln: 6.46 ± 1.302
5.168ArgArg: 5.168 ± 1.938
0.969ArgSer: 0.969 ± 0.437
2.584ArgThr: 2.584 ± 0.554
2.261ArgVal: 2.261 ± 0.969
1.292ArgTrp: 1.292 ± 0.884
2.261ArgTyr: 2.261 ± 1.038
0.0ArgXaa: 0.0 ± 0.0
Ser
2.584SerAla: 2.584 ± 1.23
0.323SerCys: 0.323 ± 0.22
2.261SerAsp: 2.261 ± 0.921
1.938SerGlu: 1.938 ± 1.206
0.323SerPhe: 0.323 ± 0.283
3.553SerGly: 3.553 ± 1.036
1.292SerHis: 1.292 ± 0.967
0.969SerIle: 0.969 ± 0.266
3.876SerLys: 3.876 ± 1.1
4.845SerLeu: 4.845 ± 2.078
0.969SerMet: 0.969 ± 0.463
1.615SerAsn: 1.615 ± 0.69
4.199SerPro: 4.199 ± 0.983
3.553SerGln: 3.553 ± 1.124
1.938SerArg: 1.938 ± 0.59
4.199SerSer: 4.199 ± 1.201
3.23SerThr: 3.23 ± 1.734
4.199SerVal: 4.199 ± 1.387
0.969SerTrp: 0.969 ± 0.607
1.292SerTyr: 1.292 ± 0.946
0.0SerXaa: 0.0 ± 0.0
Thr
7.106ThrAla: 7.106 ± 1.216
0.646ThrCys: 0.646 ± 0.532
2.261ThrAsp: 2.261 ± 0.428
3.876ThrGlu: 3.876 ± 0.509
0.646ThrPhe: 0.646 ± 0.3
3.553ThrGly: 3.553 ± 1.479
0.969ThrHis: 0.969 ± 0.484
3.23ThrIle: 3.23 ± 1.226
2.584ThrLys: 2.584 ± 0.609
2.907ThrLeu: 2.907 ± 0.496
0.0ThrMet: 0.0 ± 0.0
3.876ThrAsn: 3.876 ± 1.233
3.23ThrPro: 3.23 ± 0.703
1.938ThrGln: 1.938 ± 1.016
2.584ThrArg: 2.584 ± 0.655
3.553ThrSer: 3.553 ± 1.239
3.23ThrThr: 3.23 ± 0.76
4.845ThrVal: 4.845 ± 2.069
1.292ThrTrp: 1.292 ± 0.538
1.615ThrTyr: 1.615 ± 0.913
0.0ThrXaa: 0.0 ± 0.0
Val
3.876ValAla: 3.876 ± 1.308
1.615ValCys: 1.615 ± 0.661
2.907ValAsp: 2.907 ± 0.947
2.261ValGlu: 2.261 ± 0.741
0.969ValPhe: 0.969 ± 0.539
4.199ValGly: 4.199 ± 0.371
2.261ValHis: 2.261 ± 0.75
4.845ValIle: 4.845 ± 0.679
5.491ValLys: 5.491 ± 1.595
5.168ValLeu: 5.168 ± 1.246
0.323ValMet: 0.323 ± 0.22
2.907ValAsn: 2.907 ± 1.144
4.199ValPro: 4.199 ± 1.198
3.553ValGln: 3.553 ± 0.721
1.938ValArg: 1.938 ± 0.97
3.553ValSer: 3.553 ± 0.766
3.876ValThr: 3.876 ± 1.376
2.584ValVal: 2.584 ± 0.68
2.907ValTrp: 2.907 ± 0.67
1.615ValTyr: 1.615 ± 0.852
0.0ValXaa: 0.0 ± 0.0
Trp
1.615TrpAla: 1.615 ± 0.441
0.969TrpCys: 0.969 ± 0.605
1.292TrpAsp: 1.292 ± 0.664
1.615TrpGlu: 1.615 ± 0.661
1.615TrpPhe: 1.615 ± 1.132
2.907TrpGly: 2.907 ± 0.691
0.323TrpHis: 0.323 ± 0.433
0.969TrpIle: 0.969 ± 0.472
3.876TrpLys: 3.876 ± 0.805
2.261TrpLeu: 2.261 ± 1.558
0.969TrpMet: 0.969 ± 0.266
0.646TrpAsn: 0.646 ± 0.598
0.969TrpPro: 0.969 ± 0.661
2.584TrpGln: 2.584 ± 0.594
1.938TrpArg: 1.938 ± 0.617
1.938TrpSer: 1.938 ± 0.817
1.615TrpThr: 1.615 ± 0.826
1.292TrpVal: 1.292 ± 0.708
0.969TrpTrp: 0.969 ± 0.575
1.938TrpTyr: 1.938 ± 0.589
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.907TyrAla: 2.907 ± 0.498
1.292TyrCys: 1.292 ± 0.705
1.292TyrAsp: 1.292 ± 0.667
1.292TyrGlu: 1.292 ± 0.264
0.969TyrPhe: 0.969 ± 0.419
0.969TyrGly: 0.969 ± 0.484
0.969TyrHis: 0.969 ± 0.513
0.646TyrIle: 0.646 ± 0.44
2.584TyrLys: 2.584 ± 0.841
2.261TyrLeu: 2.261 ± 0.868
0.646TyrMet: 0.646 ± 0.3
2.261TyrAsn: 2.261 ± 0.671
1.292TyrPro: 1.292 ± 0.6
1.938TyrGln: 1.938 ± 0.784
0.969TyrArg: 0.969 ± 0.539
2.907TyrSer: 2.907 ± 0.917
2.584TyrThr: 2.584 ± 0.888
2.584TyrVal: 2.584 ± 1.002
0.969TyrTrp: 0.969 ± 0.419
1.615TyrTyr: 1.615 ± 0.828
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski