Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group M subtype C (isolate 92BR025) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.569AlaAla: 3.569 ± 0.616
1.922AlaCys: 1.922 ± 0.529
1.922AlaAsp: 1.922 ± 0.745
6.041AlaGlu: 6.041 ± 1.004
1.373AlaPhe: 1.373 ± 0.377
4.393AlaGly: 4.393 ± 1.127
0.824AlaHis: 0.824 ± 0.312
4.393AlaIle: 4.393 ± 1.432
2.471AlaLys: 2.471 ± 0.863
7.414AlaLeu: 7.414 ± 1.005
1.647AlaMet: 1.647 ± 0.486
2.197AlaAsn: 2.197 ± 0.781
3.295AlaPro: 3.295 ± 0.959
1.373AlaGln: 1.373 ± 0.471
4.393AlaArg: 4.393 ± 1.076
4.393AlaSer: 4.393 ± 0.759
3.02AlaThr: 3.02 ± 0.818
3.844AlaVal: 3.844 ± 1.131
2.197AlaTrp: 2.197 ± 0.419
1.098AlaTyr: 1.098 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
0.824CysAla: 0.824 ± 0.527
0.0CysCys: 0.0 ± 0.0
0.824CysAsp: 0.824 ± 0.399
0.0CysGlu: 0.0 ± 0.0
1.647CysPhe: 1.647 ± 0.9
1.373CysGly: 1.373 ± 0.613
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.471CysLys: 2.471 ± 0.944
0.824CysLeu: 0.824 ± 0.648
0.549CysMet: 0.549 ± 0.297
2.197CysAsn: 2.197 ± 1.361
0.275CysPro: 0.275 ± 0.208
1.098CysGln: 1.098 ± 0.384
1.373CysArg: 1.373 ± 0.495
2.197CysSer: 2.197 ± 0.969
1.922CysThr: 1.922 ± 0.436
1.647CysVal: 1.647 ± 0.603
0.824CysTrp: 0.824 ± 0.359
0.549CysTyr: 0.549 ± 0.626
0.0CysXaa: 0.0 ± 0.0
Asp
1.373AspAla: 1.373 ± 0.284
2.746AspCys: 2.746 ± 0.804
1.098AspAsp: 1.098 ± 0.471
1.098AspGlu: 1.098 ± 0.51
0.549AspPhe: 0.549 ± 0.36
1.373AspGly: 1.373 ± 0.516
0.549AspHis: 0.549 ± 0.626
4.393AspIle: 4.393 ± 0.834
3.569AspLys: 3.569 ± 0.858
3.295AspLeu: 3.295 ± 0.708
1.373AspMet: 1.373 ± 0.604
1.647AspAsn: 1.647 ± 0.583
3.295AspPro: 3.295 ± 1.092
1.922AspGln: 1.922 ± 0.619
3.844AspArg: 3.844 ± 1.028
3.295AspSer: 3.295 ± 0.953
3.295AspThr: 3.295 ± 0.638
1.647AspVal: 1.647 ± 0.718
0.824AspTrp: 0.824 ± 0.659
2.197AspTyr: 2.197 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
6.864GluAla: 6.864 ± 1.046
0.0GluCys: 0.0 ± 0.0
2.471GluAsp: 2.471 ± 1.119
7.414GluGlu: 7.414 ± 1.557
1.098GluPhe: 1.098 ± 0.471
5.491GluGly: 5.491 ± 0.773
1.098GluHis: 1.098 ± 0.471
4.668GluIle: 4.668 ± 1.687
3.295GluLys: 3.295 ± 0.62
6.315GluLeu: 6.315 ± 1.419
1.098GluMet: 1.098 ± 0.248
2.746GluAsn: 2.746 ± 0.821
4.393GluPro: 4.393 ± 1.331
3.569GluGln: 3.569 ± 0.657
4.668GluArg: 4.668 ± 0.886
3.02GluSer: 3.02 ± 1.069
3.844GluThr: 3.844 ± 1.204
3.569GluVal: 3.569 ± 0.847
1.373GluTrp: 1.373 ± 0.51
0.549GluTyr: 0.549 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
1.647PheAla: 1.647 ± 0.324
0.275PheCys: 0.275 ± 0.208
1.373PheAsp: 1.373 ± 0.814
0.275PheGlu: 0.275 ± 0.208
1.373PhePhe: 1.373 ± 0.299
1.373PheGly: 1.373 ± 0.533
0.0PheHis: 0.0 ± 0.0
1.373PheIle: 1.373 ± 0.641
1.373PheLys: 1.373 ± 0.487
1.922PheLeu: 1.922 ± 0.472
0.0PheMet: 0.0 ± 0.0
2.197PheAsn: 2.197 ± 0.809
2.197PhePro: 2.197 ± 0.975
1.373PheGln: 1.373 ± 0.733
3.02PheArg: 3.02 ± 0.939
1.098PheSer: 1.098 ± 0.248
1.098PheThr: 1.098 ± 0.497
0.549PheVal: 0.549 ± 0.33
0.549PheTrp: 0.549 ± 0.194
1.098PheTyr: 1.098 ± 0.554
0.0PheXaa: 0.0 ± 0.0
Gly
4.942GlyAla: 4.942 ± 0.899
1.922GlyCys: 1.922 ± 0.457
3.295GlyAsp: 3.295 ± 0.771
3.295GlyGlu: 3.295 ± 0.606
2.197GlyPhe: 2.197 ± 0.736
6.864GlyGly: 6.864 ± 0.894
3.569GlyHis: 3.569 ± 1.875
7.414GlyIle: 7.414 ± 2.067
5.766GlyLys: 5.766 ± 1.989
4.393GlyLeu: 4.393 ± 1.223
0.824GlyMet: 0.824 ± 0.324
2.746GlyAsn: 2.746 ± 0.993
5.491GlyPro: 5.491 ± 1.112
3.295GlyGln: 3.295 ± 1.101
3.569GlyArg: 3.569 ± 0.879
3.569GlySer: 3.569 ± 0.94
3.844GlyThr: 3.844 ± 1.086
3.02GlyVal: 3.02 ± 0.742
1.098GlyTrp: 1.098 ± 0.42
1.922GlyTyr: 1.922 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
1.098HisAla: 1.098 ± 0.32
1.098HisCys: 1.098 ± 0.731
0.275HisAsp: 0.275 ± 0.208
0.824HisGlu: 0.824 ± 0.359
0.824HisPhe: 0.824 ± 0.704
1.647HisGly: 1.647 ± 0.53
0.549HisHis: 0.549 ± 0.804
0.549HisIle: 0.549 ± 0.561
0.824HisLys: 0.824 ± 0.359
2.746HisLeu: 2.746 ± 0.641
1.373HisMet: 1.373 ± 1.174
1.373HisAsn: 1.373 ± 0.327
2.471HisPro: 2.471 ± 0.888
2.746HisGln: 2.746 ± 0.815
1.373HisArg: 1.373 ± 0.465
1.373HisSer: 1.373 ± 0.597
0.824HisThr: 0.824 ± 0.399
0.275HisVal: 0.275 ± 0.18
0.0HisTrp: 0.0 ± 0.0
1.373HisTyr: 1.373 ± 0.769
0.0HisXaa: 0.0 ± 0.0
Ile
3.02IleAla: 3.02 ± 0.647
1.098IleCys: 1.098 ± 0.389
1.922IleAsp: 1.922 ± 0.688
3.569IleGlu: 3.569 ± 0.852
1.098IlePhe: 1.098 ± 0.554
4.393IleGly: 4.393 ± 1.391
1.373IleHis: 1.373 ± 0.464
7.688IleIle: 7.688 ± 1.556
7.963IleLys: 7.963 ± 1.014
6.59IleLeu: 6.59 ± 0.689
1.373IleMet: 1.373 ± 0.613
2.197IleAsn: 2.197 ± 0.464
3.569IlePro: 3.569 ± 0.51
3.569IleGln: 3.569 ± 1.399
3.02IleArg: 3.02 ± 1.133
3.844IleSer: 3.844 ± 0.774
3.569IleThr: 3.569 ± 1.013
4.942IleVal: 4.942 ± 0.935
2.197IleTrp: 2.197 ± 0.586
2.471IleTyr: 2.471 ± 0.468
0.0IleXaa: 0.0 ± 0.0
Lys
4.119LysAla: 4.119 ± 0.682
1.922LysCys: 1.922 ± 0.474
2.471LysAsp: 2.471 ± 0.934
6.59LysGlu: 6.59 ± 2.419
0.824LysPhe: 0.824 ± 0.324
4.119LysGly: 4.119 ± 1.275
1.922LysHis: 1.922 ± 0.503
6.59LysIle: 6.59 ± 1.68
6.041LysLys: 6.041 ± 1.436
5.491LysLeu: 5.491 ± 1.316
0.824LysMet: 0.824 ± 0.541
1.922LysAsn: 1.922 ± 0.614
3.02LysPro: 3.02 ± 1.103
4.393LysGln: 4.393 ± 0.854
4.393LysArg: 4.393 ± 1.207
2.197LysSer: 2.197 ± 0.496
4.942LysThr: 4.942 ± 0.862
5.217LysVal: 5.217 ± 1.138
1.373LysTrp: 1.373 ± 0.471
1.647LysTyr: 1.647 ± 0.492
0.0LysXaa: 0.0 ± 0.0
Leu
4.668LeuAla: 4.668 ± 0.644
0.824LeuCys: 0.824 ± 0.359
4.668LeuAsp: 4.668 ± 1.13
7.139LeuGlu: 7.139 ± 1.148
2.197LeuPhe: 2.197 ± 0.949
6.315LeuGly: 6.315 ± 1.542
2.746LeuHis: 2.746 ± 0.938
4.668LeuIle: 4.668 ± 1.44
5.766LeuLys: 5.766 ± 1.371
7.139LeuLeu: 7.139 ± 1.493
0.275LeuMet: 0.275 ± 0.314
4.119LeuAsn: 4.119 ± 1.049
2.471LeuPro: 2.471 ± 0.768
6.315LeuGln: 6.315 ± 0.83
4.942LeuArg: 4.942 ± 0.716
2.746LeuSer: 2.746 ± 0.484
4.668LeuThr: 4.668 ± 0.977
6.041LeuVal: 6.041 ± 1.347
2.471LeuTrp: 2.471 ± 0.764
1.373LeuTyr: 1.373 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
1.647MetAla: 1.647 ± 0.637
0.0MetCys: 0.0 ± 0.0
0.549MetAsp: 0.549 ± 0.36
1.647MetGlu: 1.647 ± 0.935
0.549MetPhe: 0.549 ± 0.274
2.197MetGly: 2.197 ± 0.377
0.824MetHis: 0.824 ± 0.54
0.549MetIle: 0.549 ± 0.194
1.647MetLys: 1.647 ± 0.644
1.922MetLeu: 1.922 ± 0.49
1.647MetMet: 1.647 ± 0.823
0.549MetAsn: 0.549 ± 0.358
0.0MetPro: 0.0 ± 0.0
1.373MetGln: 1.373 ± 0.447
1.647MetArg: 1.647 ± 0.336
0.824MetSer: 0.824 ± 0.364
2.471MetThr: 2.471 ± 0.635
1.647MetVal: 1.647 ± 0.529
0.549MetTrp: 0.549 ± 0.415
1.098MetTyr: 1.098 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
1.373AsnAla: 1.373 ± 0.565
3.569AsnCys: 3.569 ± 0.917
1.098AsnAsp: 1.098 ± 0.4
2.471AsnGlu: 2.471 ± 0.519
2.197AsnPhe: 2.197 ± 0.879
1.373AsnGly: 1.373 ± 0.573
0.824AsnHis: 0.824 ± 0.851
2.746AsnIle: 2.746 ± 0.612
3.569AsnLys: 3.569 ± 0.514
3.295AsnLeu: 3.295 ± 0.573
1.647AsnMet: 1.647 ± 0.96
4.393AsnAsn: 4.393 ± 0.973
3.569AsnPro: 3.569 ± 0.789
1.098AsnGln: 1.098 ± 0.471
2.746AsnArg: 2.746 ± 0.616
2.471AsnSer: 2.471 ± 0.646
5.217AsnThr: 5.217 ± 0.768
1.647AsnVal: 1.647 ± 0.96
2.197AsnTrp: 2.197 ± 0.51
1.098AsnTyr: 1.098 ± 0.328
0.0AsnXaa: 0.0 ± 0.0
Pro
3.295ProAla: 3.295 ± 1.125
0.824ProCys: 0.824 ± 0.623
1.922ProAsp: 1.922 ± 0.498
3.02ProGlu: 3.02 ± 0.537
1.922ProPhe: 1.922 ± 0.631
6.041ProGly: 6.041 ± 1.553
0.275ProHis: 0.275 ± 0.18
4.668ProIle: 4.668 ± 0.846
4.119ProLys: 4.119 ± 1.509
4.119ProLeu: 4.119 ± 0.862
1.373ProMet: 1.373 ± 0.651
1.922ProAsn: 1.922 ± 1.18
3.02ProPro: 3.02 ± 0.813
3.295ProGln: 3.295 ± 0.591
2.471ProArg: 2.471 ± 0.45
2.197ProSer: 2.197 ± 0.871
1.373ProThr: 1.373 ± 0.284
6.041ProVal: 6.041 ± 1.182
1.098ProTrp: 1.098 ± 0.752
1.098ProTyr: 1.098 ± 0.463
0.0ProXaa: 0.0 ± 0.0
Gln
5.217GlnAla: 5.217 ± 0.796
0.275GlnCys: 0.275 ± 0.208
3.295GlnAsp: 3.295 ± 0.621
3.295GlnGlu: 3.295 ± 0.557
0.0GlnPhe: 0.0 ± 0.0
4.942GlnGly: 4.942 ± 0.626
1.098GlnHis: 1.098 ± 0.654
3.295GlnIle: 3.295 ± 1.065
3.02GlnLys: 3.02 ± 1.388
5.491GlnLeu: 5.491 ± 1.375
2.746GlnMet: 2.746 ± 0.634
4.942GlnAsn: 4.942 ± 1.15
2.746GlnPro: 2.746 ± 1.107
3.295GlnGln: 3.295 ± 0.78
3.295GlnArg: 3.295 ± 1.314
1.373GlnSer: 1.373 ± 0.456
2.746GlnThr: 2.746 ± 1.49
2.746GlnVal: 2.746 ± 0.661
1.373GlnTrp: 1.373 ± 0.435
1.647GlnTyr: 1.647 ± 0.536
0.0GlnXaa: 0.0 ± 0.0
Arg
6.041ArgAla: 6.041 ± 0.427
0.824ArgCys: 0.824 ± 0.851
4.393ArgAsp: 4.393 ± 0.714
5.491ArgGlu: 5.491 ± 1.363
1.647ArgPhe: 1.647 ± 0.813
3.844ArgGly: 3.844 ± 0.793
1.098ArgHis: 1.098 ± 0.988
4.393ArgIle: 4.393 ± 1.806
3.844ArgLys: 3.844 ± 0.893
4.119ArgLeu: 4.119 ± 1.489
1.098ArgMet: 1.098 ± 0.512
1.647ArgAsn: 1.647 ± 0.381
2.471ArgPro: 2.471 ± 1.099
5.491ArgGln: 5.491 ± 1.109
5.217ArgArg: 5.217 ± 2.861
3.569ArgSer: 3.569 ± 1.563
3.02ArgThr: 3.02 ± 0.584
3.02ArgVal: 3.02 ± 0.832
1.647ArgTrp: 1.647 ± 0.861
0.824ArgTyr: 0.824 ± 0.335
0.0ArgXaa: 0.0 ± 0.0
Ser
2.197SerAla: 2.197 ± 0.689
0.275SerCys: 0.275 ± 0.18
2.197SerAsp: 2.197 ± 0.452
4.668SerGlu: 4.668 ± 1.226
1.647SerPhe: 1.647 ± 0.864
3.844SerGly: 3.844 ± 1.429
0.549SerHis: 0.549 ± 0.421
4.119SerIle: 4.119 ± 0.558
2.197SerLys: 2.197 ± 0.94
5.491SerLeu: 5.491 ± 1.64
0.549SerMet: 0.549 ± 0.315
3.295SerAsn: 3.295 ± 0.952
2.471SerPro: 2.471 ± 0.857
3.295SerGln: 3.295 ± 0.52
3.02SerArg: 3.02 ± 1.301
3.844SerSer: 3.844 ± 1.009
3.844SerThr: 3.844 ± 0.901
1.098SerVal: 1.098 ± 0.325
0.824SerTrp: 0.824 ± 0.312
1.373SerTyr: 1.373 ± 1.257
0.0SerXaa: 0.0 ± 0.0
Thr
4.393ThrAla: 4.393 ± 0.835
0.549ThrCys: 0.549 ± 0.415
3.569ThrAsp: 3.569 ± 0.773
4.393ThrGlu: 4.393 ± 0.605
0.824ThrPhe: 0.824 ± 0.324
3.844ThrGly: 3.844 ± 0.704
1.373ThrHis: 1.373 ± 0.539
4.119ThrIle: 4.119 ± 0.945
3.295ThrLys: 3.295 ± 0.625
4.942ThrLeu: 4.942 ± 0.963
1.647ThrMet: 1.647 ± 0.685
2.197ThrAsn: 2.197 ± 0.496
3.844ThrPro: 3.844 ± 0.792
2.471ThrGln: 2.471 ± 0.42
2.471ThrArg: 2.471 ± 1.249
2.746ThrSer: 2.746 ± 0.764
3.02ThrThr: 3.02 ± 0.664
4.119ThrVal: 4.119 ± 1.068
2.471ThrTrp: 2.471 ± 0.388
1.373ThrTyr: 1.373 ± 0.593
0.0ThrXaa: 0.0 ± 0.0
Val
2.746ValAla: 2.746 ± 0.826
0.549ValCys: 0.549 ± 0.626
3.569ValAsp: 3.569 ± 1.164
3.569ValGlu: 3.569 ± 1.143
0.824ValPhe: 0.824 ± 0.312
6.041ValGly: 6.041 ± 0.699
3.569ValHis: 3.569 ± 0.526
1.922ValIle: 1.922 ± 0.449
4.668ValLys: 4.668 ± 1.037
3.569ValLeu: 3.569 ± 0.631
0.275ValMet: 0.275 ± 0.208
2.471ValAsn: 2.471 ± 1.055
3.844ValPro: 3.844 ± 0.744
3.02ValGln: 3.02 ± 0.698
4.668ValArg: 4.668 ± 0.63
3.569ValSer: 3.569 ± 0.976
1.647ValThr: 1.647 ± 0.583
2.471ValVal: 2.471 ± 0.48
2.197ValTrp: 2.197 ± 0.511
1.373ValTyr: 1.373 ± 0.471
0.0ValXaa: 0.0 ± 0.0
Trp
1.373TrpAla: 1.373 ± 0.377
0.275TrpCys: 0.275 ± 0.31
1.647TrpAsp: 1.647 ± 0.431
1.922TrpGlu: 1.922 ± 0.48
0.275TrpPhe: 0.275 ± 0.208
1.922TrpGly: 1.922 ± 0.489
0.275TrpHis: 0.275 ± 0.402
1.098TrpIle: 1.098 ± 0.248
2.471TrpLys: 2.471 ± 0.579
1.373TrpLeu: 1.373 ± 0.62
1.922TrpMet: 1.922 ± 0.427
1.922TrpAsn: 1.922 ± 1.262
0.549TrpPro: 0.549 ± 0.274
2.197TrpGln: 2.197 ± 0.633
1.647TrpArg: 1.647 ± 0.518
0.824TrpSer: 0.824 ± 0.527
2.197TrpThr: 2.197 ± 0.87
1.373TrpVal: 1.373 ± 0.299
0.824TrpTrp: 0.824 ± 0.312
0.824TrpTyr: 0.824 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.373TyrAla: 1.373 ± 0.539
1.647TyrCys: 1.647 ± 0.639
0.824TyrAsp: 0.824 ± 0.312
0.824TyrGlu: 0.824 ± 0.5
1.098TyrPhe: 1.098 ± 0.431
1.647TyrGly: 1.647 ± 0.757
1.098TyrHis: 1.098 ± 0.554
0.824TyrIle: 0.824 ± 0.346
1.647TyrLys: 1.647 ± 0.654
1.098TyrLeu: 1.098 ± 0.427
0.824TyrMet: 0.824 ± 0.359
1.922TyrAsn: 1.922 ± 0.568
1.098TyrPro: 1.098 ± 0.37
1.922TyrGln: 1.922 ± 0.769
1.922TyrArg: 1.922 ± 0.888
1.647TyrSer: 1.647 ± 0.364
1.098TyrThr: 1.098 ± 0.448
1.647TyrVal: 1.647 ± 0.604
0.824TyrTrp: 0.824 ± 0.335
1.373TyrTyr: 1.373 ± 0.369
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski