Amino acid dipepetide frequency for Human immunodeficiency virus type 2 subtype B (isolate UC1) (HIV-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.386AlaAla: 4.386 ± 1.386
1.29AlaCys: 1.29 ± 0.559
2.064AlaAsp: 2.064 ± 0.96
6.966AlaGlu: 6.966 ± 1.926
1.806AlaPhe: 1.806 ± 0.515
5.676AlaGly: 5.676 ± 1.178
1.806AlaHis: 1.806 ± 0.548
2.838AlaIle: 2.838 ± 1.059
2.838AlaLys: 2.838 ± 1.014
5.934AlaLeu: 5.934 ± 1.123
2.58AlaMet: 2.58 ± 0.673
2.838AlaAsn: 2.838 ± 0.668
4.644AlaPro: 4.644 ± 1.41
3.612AlaGln: 3.612 ± 0.651
2.064AlaArg: 2.064 ± 0.447
2.838AlaSer: 2.838 ± 1.204
2.58AlaThr: 2.58 ± 0.701
3.354AlaVal: 3.354 ± 0.875
2.064AlaTrp: 2.064 ± 0.523
1.806AlaTyr: 1.806 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
1.032CysAla: 1.032 ± 0.694
0.516CysCys: 0.516 ± 0.578
1.032CysAsp: 1.032 ± 0.507
1.032CysGlu: 1.032 ± 0.716
0.774CysPhe: 0.774 ± 0.715
1.29CysGly: 1.29 ± 0.659
1.032CysHis: 1.032 ± 0.406
0.774CysIle: 0.774 ± 0.361
1.29CysLys: 1.29 ± 0.559
1.806CysLeu: 1.806 ± 0.577
0.258CysMet: 0.258 ± 0.173
2.064CysAsn: 2.064 ± 1.436
0.774CysPro: 0.774 ± 0.257
1.548CysGln: 1.548 ± 0.689
1.806CysArg: 1.806 ± 0.551
0.516CysSer: 0.516 ± 0.411
1.29CysThr: 1.29 ± 0.382
2.064CysVal: 2.064 ± 0.742
1.548CysTrp: 1.548 ± 0.529
1.548CysTyr: 1.548 ± 1.191
0.0CysXaa: 0.0 ± 0.0
Asp
1.032AspAla: 1.032 ± 0.471
1.29AspCys: 1.29 ± 0.382
1.29AspAsp: 1.29 ± 0.643
2.064AspGlu: 2.064 ± 0.838
1.032AspPhe: 1.032 ± 0.47
1.548AspGly: 1.548 ± 0.623
1.29AspHis: 1.29 ± 0.559
2.58AspIle: 2.58 ± 1.095
1.29AspLys: 1.29 ± 0.473
2.064AspLeu: 2.064 ± 1.193
1.032AspMet: 1.032 ± 0.562
2.064AspAsn: 2.064 ± 1.205
4.128AspPro: 4.128 ± 1.147
2.58AspGln: 2.58 ± 0.823
2.58AspArg: 2.58 ± 1.144
2.322AspSer: 2.322 ± 1.09
4.902AspThr: 4.902 ± 0.92
2.838AspVal: 2.838 ± 0.977
1.032AspTrp: 1.032 ± 0.349
1.032AspTyr: 1.032 ± 0.464
0.0AspXaa: 0.0 ± 0.0
Glu
7.74GluAla: 7.74 ± 1.066
0.258GluCys: 0.258 ± 0.25
2.322GluAsp: 2.322 ± 0.922
8.256GluGlu: 8.256 ± 2.286
0.516GluPhe: 0.516 ± 0.377
6.708GluGly: 6.708 ± 1.652
0.774GluHis: 0.774 ± 0.447
3.87GluIle: 3.87 ± 0.898
7.224GluLys: 7.224 ± 1.354
6.45GluLeu: 6.45 ± 1.207
1.548GluMet: 1.548 ± 0.715
2.58GluAsn: 2.58 ± 0.983
3.354GluPro: 3.354 ± 1.326
3.354GluGln: 3.354 ± 0.835
3.87GluArg: 3.87 ± 1.101
3.354GluSer: 3.354 ± 1.313
4.644GluThr: 4.644 ± 1.21
4.386GluVal: 4.386 ± 0.988
0.516GluTrp: 0.516 ± 0.303
0.516GluTyr: 0.516 ± 0.236
0.0GluXaa: 0.0 ± 0.0
Phe
1.548PheAla: 1.548 ± 0.525
0.258PheCys: 0.258 ± 0.25
1.29PheAsp: 1.29 ± 0.781
0.258PheGlu: 0.258 ± 0.25
0.516PhePhe: 0.516 ± 0.236
3.096PheGly: 3.096 ± 0.767
0.516PheHis: 0.516 ± 0.236
1.29PheIle: 1.29 ± 0.464
0.774PheLys: 0.774 ± 0.329
3.612PheLeu: 3.612 ± 1.094
0.258PheMet: 0.258 ± 0.25
1.29PheAsn: 1.29 ± 0.397
1.29PhePro: 1.29 ± 0.67
2.838PheGln: 2.838 ± 0.921
1.548PheArg: 1.548 ± 0.506
1.29PheSer: 1.29 ± 0.493
1.548PheThr: 1.548 ± 0.515
0.516PheVal: 0.516 ± 0.325
0.0PheTrp: 0.0 ± 0.0
1.29PheTyr: 1.29 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
3.096GlyAla: 3.096 ± 0.665
2.838GlyCys: 2.838 ± 0.84
3.096GlyAsp: 3.096 ± 0.756
4.128GlyGlu: 4.128 ± 1.163
3.612GlyPhe: 3.612 ± 0.858
5.676GlyGly: 5.676 ± 1.787
1.806GlyHis: 1.806 ± 0.523
5.16GlyIle: 5.16 ± 1.068
6.45GlyLys: 6.45 ± 2.361
6.966GlyLeu: 6.966 ± 1.324
1.29GlyMet: 1.29 ± 0.645
4.644GlyAsn: 4.644 ± 0.98
4.128GlyPro: 4.128 ± 1.228
3.612GlyGln: 3.612 ± 0.919
2.064GlyArg: 2.064 ± 0.504
5.16GlySer: 5.16 ± 1.3
3.87GlyThr: 3.87 ± 1.283
2.064GlyVal: 2.064 ± 0.632
1.548GlyTrp: 1.548 ± 0.913
1.548GlyTyr: 1.548 ± 0.527
0.0GlyXaa: 0.0 ± 0.0
His
0.516HisAla: 0.516 ± 0.304
1.032HisCys: 1.032 ± 0.507
1.032HisAsp: 1.032 ± 0.562
0.774HisGlu: 0.774 ± 0.429
1.29HisPhe: 1.29 ± 0.934
1.548HisGly: 1.548 ± 0.745
0.516HisHis: 0.516 ± 0.35
1.548HisIle: 1.548 ± 0.531
1.548HisLys: 1.548 ± 0.377
2.838HisLeu: 2.838 ± 0.808
0.774HisMet: 0.774 ± 0.257
0.0HisAsn: 0.0 ± 0.0
1.29HisPro: 1.29 ± 0.297
1.032HisGln: 1.032 ± 0.5
1.548HisArg: 1.548 ± 0.736
1.548HisSer: 1.548 ± 0.945
1.548HisThr: 1.548 ± 0.413
0.516HisVal: 0.516 ± 0.346
0.0HisTrp: 0.0 ± 0.0
0.774HisTyr: 0.774 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
2.322IleAla: 2.322 ± 0.746
1.29IleCys: 1.29 ± 0.628
1.032IleAsp: 1.032 ± 0.369
3.096IleGlu: 3.096 ± 1.411
1.032IlePhe: 1.032 ± 0.499
3.096IleGly: 3.096 ± 0.576
0.516IleHis: 0.516 ± 0.359
4.128IleIle: 4.128 ± 1.281
3.87IleLys: 3.87 ± 0.907
4.644IleLeu: 4.644 ± 0.755
1.29IleMet: 1.29 ± 0.371
1.806IleAsn: 1.806 ± 0.86
5.16IlePro: 5.16 ± 0.381
3.612IleGln: 3.612 ± 0.788
4.386IleArg: 4.386 ± 0.996
1.29IleSer: 1.29 ± 0.971
2.838IleThr: 2.838 ± 1.554
3.096IleVal: 3.096 ± 0.956
1.548IleTrp: 1.548 ± 0.455
2.322IleTyr: 2.322 ± 0.564
0.0IleXaa: 0.0 ± 0.0
Lys
2.322LysAla: 2.322 ± 0.664
3.096LysCys: 3.096 ± 1.148
3.87LysAsp: 3.87 ± 0.685
6.45LysGlu: 6.45 ± 2.185
1.29LysPhe: 1.29 ± 0.642
3.87LysGly: 3.87 ± 0.889
1.548LysHis: 1.548 ± 0.637
4.128LysIle: 4.128 ± 1.734
4.902LysLys: 4.902 ± 1.679
3.87LysLeu: 3.87 ± 0.876
1.806LysMet: 1.806 ± 0.739
3.354LysAsn: 3.354 ± 0.893
1.29LysPro: 1.29 ± 0.623
3.87LysGln: 3.87 ± 1.15
5.16LysArg: 5.16 ± 0.766
2.064LysSer: 2.064 ± 1.124
1.806LysThr: 1.806 ± 0.588
6.192LysVal: 6.192 ± 1.738
0.774LysTrp: 0.774 ± 0.429
1.806LysTyr: 1.806 ± 0.693
0.0LysXaa: 0.0 ± 0.0
Leu
8.256LeuAla: 8.256 ± 1.408
0.774LeuCys: 0.774 ± 0.595
3.354LeuAsp: 3.354 ± 0.747
6.966LeuGlu: 6.966 ± 0.791
2.58LeuPhe: 2.58 ± 0.536
5.16LeuGly: 5.16 ± 0.756
2.322LeuHis: 2.322 ± 0.997
3.354LeuIle: 3.354 ± 0.867
6.192LeuLys: 6.192 ± 1.384
8.514LeuLeu: 8.514 ± 2.605
1.29LeuMet: 1.29 ± 0.333
4.902LeuAsn: 4.902 ± 1.024
3.354LeuPro: 3.354 ± 0.74
5.16LeuGln: 5.16 ± 1.321
5.676LeuArg: 5.676 ± 1.579
4.644LeuSer: 4.644 ± 1.219
4.386LeuThr: 4.386 ± 0.971
5.418LeuVal: 5.418 ± 1.376
1.548LeuTrp: 1.548 ± 0.608
1.29LeuTyr: 1.29 ± 0.878
0.0LeuXaa: 0.0 ± 0.0
Met
2.58MetAla: 2.58 ± 0.48
0.0MetCys: 0.0 ± 0.0
1.032MetAsp: 1.032 ± 0.429
2.064MetGlu: 2.064 ± 0.807
0.258MetPhe: 0.258 ± 0.329
2.064MetGly: 2.064 ± 0.559
0.258MetHis: 0.258 ± 0.25
0.516MetIle: 0.516 ± 0.346
0.258MetLys: 0.258 ± 0.325
1.806MetLeu: 1.806 ± 0.793
0.258MetMet: 0.258 ± 0.25
1.032MetAsn: 1.032 ± 0.496
1.29MetPro: 1.29 ± 0.505
1.032MetGln: 1.032 ± 0.349
0.774MetArg: 0.774 ± 0.454
1.548MetSer: 1.548 ± 0.498
3.354MetThr: 3.354 ± 0.617
0.258MetVal: 0.258 ± 0.25
0.258MetTrp: 0.258 ± 0.25
1.548MetTyr: 1.548 ± 0.514
0.0MetXaa: 0.0 ± 0.0
Asn
2.064AsnAla: 2.064 ± 0.559
1.29AsnCys: 1.29 ± 0.505
1.548AsnAsp: 1.548 ± 0.908
3.096AsnGlu: 3.096 ± 0.826
2.322AsnPhe: 2.322 ± 0.539
1.548AsnGly: 1.548 ± 0.908
1.032AsnHis: 1.032 ± 0.454
2.838AsnIle: 2.838 ± 0.766
2.064AsnLys: 2.064 ± 0.662
2.838AsnLeu: 2.838 ± 0.938
1.29AsnMet: 1.29 ± 0.939
1.806AsnAsn: 1.806 ± 1.163
3.612AsnPro: 3.612 ± 1.687
2.322AsnGln: 2.322 ± 0.858
3.096AsnArg: 3.096 ± 1.128
2.322AsnSer: 2.322 ± 0.726
4.644AsnThr: 4.644 ± 1.789
0.258AsnVal: 0.258 ± 0.173
1.806AsnTrp: 1.806 ± 0.56
2.58AsnTyr: 2.58 ± 0.449
0.0AsnXaa: 0.0 ± 0.0
Pro
5.16ProAla: 5.16 ± 1.736
0.774ProCys: 0.774 ± 0.454
1.548ProAsp: 1.548 ± 0.377
3.354ProGlu: 3.354 ± 1.219
1.548ProPhe: 1.548 ± 0.746
6.45ProGly: 6.45 ± 1.275
0.774ProHis: 0.774 ± 0.429
2.838ProIle: 2.838 ± 0.928
1.548ProLys: 1.548 ± 0.525
6.708ProLeu: 6.708 ± 1.002
0.774ProMet: 0.774 ± 0.783
1.032ProAsn: 1.032 ± 0.379
5.418ProPro: 5.418 ± 2.448
2.58ProGln: 2.58 ± 1.074
5.676ProArg: 5.676 ± 1.199
4.386ProSer: 4.386 ± 0.938
4.386ProThr: 4.386 ± 1.526
4.386ProVal: 4.386 ± 1.12
1.032ProTrp: 1.032 ± 0.596
1.806ProTyr: 1.806 ± 0.726
0.0ProXaa: 0.0 ± 0.0
Gln
3.87GlnAla: 3.87 ± 0.89
0.774GlnCys: 0.774 ± 0.257
1.032GlnAsp: 1.032 ± 0.562
5.418GlnGlu: 5.418 ± 1.341
0.774GlnPhe: 0.774 ± 0.429
5.934GlnGly: 5.934 ± 1.657
0.774GlnHis: 0.774 ± 0.404
4.644GlnIle: 4.644 ± 0.858
4.644GlnLys: 4.644 ± 1.174
4.644GlnLeu: 4.644 ± 1.069
1.29GlnMet: 1.29 ± 0.493
1.548GlnAsn: 1.548 ± 0.67
2.322GlnPro: 2.322 ± 0.965
5.418GlnGln: 5.418 ± 1.305
2.838GlnArg: 2.838 ± 1.579
2.064GlnSer: 2.064 ± 0.923
3.096GlnThr: 3.096 ± 1.103
2.838GlnVal: 2.838 ± 0.84
1.806GlnTrp: 1.806 ± 0.682
1.548GlnTyr: 1.548 ± 0.659
0.0GlnXaa: 0.0 ± 0.0
Arg
4.644ArgAla: 4.644 ± 0.97
1.032ArgCys: 1.032 ± 0.772
2.322ArgAsp: 2.322 ± 0.727
4.902ArgGlu: 4.902 ± 0.946
1.548ArgPhe: 1.548 ± 0.501
4.902ArgGly: 4.902 ± 0.726
2.064ArgHis: 2.064 ± 0.657
2.58ArgIle: 2.58 ± 0.671
2.838ArgLys: 2.838 ± 0.905
5.16ArgLeu: 5.16 ± 1.135
1.29ArgMet: 1.29 ± 0.368
2.58ArgAsn: 2.58 ± 1.018
3.354ArgPro: 3.354 ± 0.906
3.87ArgGln: 3.87 ± 1.003
5.676ArgArg: 5.676 ± 1.813
3.612ArgSer: 3.612 ± 1.347
4.128ArgThr: 4.128 ± 1.481
2.322ArgVal: 2.322 ± 0.889
1.29ArgTrp: 1.29 ± 0.577
2.322ArgTyr: 2.322 ± 0.715
0.0ArgXaa: 0.0 ± 0.0
Ser
3.612SerAla: 3.612 ± 1.5
2.322SerCys: 2.322 ± 0.768
3.87SerAsp: 3.87 ± 1.377
2.838SerGlu: 2.838 ± 1.402
0.258SerPhe: 0.258 ± 0.348
4.128SerGly: 4.128 ± 0.863
0.258SerHis: 0.258 ± 0.353
1.548SerIle: 1.548 ± 0.623
3.096SerLys: 3.096 ± 0.729
5.16SerLeu: 5.16 ± 1.335
0.774SerMet: 0.774 ± 0.499
1.29SerAsn: 1.29 ± 1.024
2.838SerPro: 2.838 ± 0.813
3.87SerGln: 3.87 ± 1.2
3.096SerArg: 3.096 ± 1.054
7.74SerSer: 7.74 ± 4.507
2.838SerThr: 2.838 ± 0.74
1.548SerVal: 1.548 ± 0.858
1.548SerTrp: 1.548 ± 1.209
2.58SerTyr: 2.58 ± 1.331
0.0SerXaa: 0.0 ± 0.0
Thr
4.644ThrAla: 4.644 ± 0.856
1.806ThrCys: 1.806 ± 0.669
3.096ThrAsp: 3.096 ± 1.014
5.418ThrGlu: 5.418 ± 1.162
0.774ThrPhe: 0.774 ± 0.439
2.58ThrGly: 2.58 ± 0.709
1.032ThrHis: 1.032 ± 0.553
2.322ThrIle: 2.322 ± 0.944
2.838ThrLys: 2.838 ± 1.737
4.902ThrLeu: 4.902 ± 0.845
0.774ThrMet: 0.774 ± 0.543
3.096ThrAsn: 3.096 ± 1.162
6.192ThrPro: 6.192 ± 1.08
2.58ThrGln: 2.58 ± 0.705
2.58ThrArg: 2.58 ± 1.19
5.16ThrSer: 5.16 ± 1.837
3.612ThrThr: 3.612 ± 1.914
5.418ThrVal: 5.418 ± 1.424
2.838ThrTrp: 2.838 ± 1.411
1.548ThrTyr: 1.548 ± 0.969
0.0ThrXaa: 0.0 ± 0.0
Val
2.838ValAla: 2.838 ± 0.82
0.774ValCys: 0.774 ± 0.257
1.806ValAsp: 1.806 ± 0.335
3.87ValGlu: 3.87 ± 0.87
1.032ValPhe: 1.032 ± 0.694
4.644ValGly: 4.644 ± 1.032
1.806ValHis: 1.806 ± 0.674
3.096ValIle: 3.096 ± 0.815
4.386ValLys: 4.386 ± 1.159
4.644ValLeu: 4.644 ± 1.557
0.774ValMet: 0.774 ± 0.333
3.612ValAsn: 3.612 ± 0.683
4.902ValPro: 4.902 ± 1.083
2.58ValGln: 2.58 ± 0.559
3.354ValArg: 3.354 ± 0.931
1.806ValSer: 1.806 ± 0.634
3.87ValThr: 3.87 ± 0.823
3.87ValVal: 3.87 ± 0.69
1.29ValTrp: 1.29 ± 0.553
1.29ValTyr: 1.29 ± 0.667
0.0ValXaa: 0.0 ± 0.0
Trp
1.29TrpAla: 1.29 ± 0.489
0.258TrpCys: 0.258 ± 0.25
2.064TrpAsp: 2.064 ± 0.317
0.774TrpGlu: 0.774 ± 0.587
1.29TrpPhe: 1.29 ± 1.249
1.29TrpGly: 1.29 ± 0.623
0.774TrpHis: 0.774 ± 0.584
1.548TrpIle: 1.548 ± 0.37
2.322TrpLys: 2.322 ± 0.608
0.774TrpLeu: 0.774 ± 0.534
1.548TrpMet: 1.548 ± 0.531
1.032TrpAsn: 1.032 ± 0.337
1.29TrpPro: 1.29 ± 0.546
1.548TrpGln: 1.548 ± 0.511
1.806TrpArg: 1.806 ± 0.8
0.258TrpSer: 0.258 ± 0.25
1.548TrpThr: 1.548 ± 0.623
1.29TrpVal: 1.29 ± 0.307
0.774TrpTrp: 0.774 ± 0.429
1.032TrpTyr: 1.032 ± 0.719
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.032TyrAla: 1.032 ± 0.519
2.322TyrCys: 2.322 ± 0.634
1.29TyrAsp: 1.29 ± 0.536
0.774TyrGlu: 0.774 ± 0.454
0.774TyrPhe: 0.774 ± 0.365
1.548TyrGly: 1.548 ± 0.811
0.774TyrHis: 0.774 ± 0.523
0.774TyrIle: 0.774 ± 0.454
3.096TyrLys: 3.096 ± 0.423
1.806TyrLeu: 1.806 ± 1.079
1.032TyrMet: 1.032 ± 0.406
1.806TyrAsn: 1.806 ± 0.513
1.548TyrPro: 1.548 ± 0.834
0.258TyrGln: 0.258 ± 0.353
2.838TyrArg: 2.838 ± 0.653
1.29TyrSer: 1.29 ± 0.578
2.322TyrThr: 2.322 ± 0.779
3.612TyrVal: 3.612 ± 0.583
1.29TyrTrp: 1.29 ± 0.568
1.548TyrTyr: 1.548 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski