Amino acid dipepetide frequency for Human immunodeficiency virus type 1 group N (isolate YBF30) (HIV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.644AlaAla: 4.644 ± 0.901
2.458AlaCys: 2.458 ± 0.509
2.731AlaAsp: 2.731 ± 0.61
4.917AlaGlu: 4.917 ± 1.375
1.366AlaPhe: 1.366 ± 0.302
4.37AlaGly: 4.37 ± 0.716
1.912AlaHis: 1.912 ± 0.525
3.551AlaIle: 3.551 ± 1.136
3.005AlaLys: 3.005 ± 0.662
6.282AlaLeu: 6.282 ± 0.998
2.185AlaMet: 2.185 ± 0.715
2.458AlaAsn: 2.458 ± 0.905
1.912AlaPro: 1.912 ± 0.766
2.731AlaGln: 2.731 ± 0.466
4.097AlaArg: 4.097 ± 1.227
3.551AlaSer: 3.551 ± 0.682
4.37AlaThr: 4.37 ± 1.162
4.644AlaVal: 4.644 ± 1.208
1.912AlaTrp: 1.912 ± 0.599
1.366AlaTyr: 1.366 ± 0.602
0.0AlaXaa: 0.0 ± 0.0
Cys
1.093CysAla: 1.093 ± 0.387
0.546CysCys: 0.546 ± 0.64
0.546CysAsp: 0.546 ± 0.433
0.273CysGlu: 0.273 ± 0.38
1.639CysPhe: 1.639 ± 0.915
1.093CysGly: 1.093 ± 0.502
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.458CysLys: 2.458 ± 0.84
0.819CysLeu: 0.819 ± 0.878
0.273CysMet: 0.273 ± 0.366
2.458CysAsn: 2.458 ± 1.115
0.546CysPro: 0.546 ± 0.212
1.366CysGln: 1.366 ± 0.517
0.546CysArg: 0.546 ± 0.467
0.819CysSer: 0.819 ± 0.404
1.912CysThr: 1.912 ± 0.807
1.639CysVal: 1.639 ± 0.53
0.819CysTrp: 0.819 ± 0.352
1.366CysTyr: 1.366 ± 1.274
0.0CysXaa: 0.0 ± 0.0
Asp
1.366AspAla: 1.366 ± 0.48
1.639AspCys: 1.639 ± 0.613
1.093AspAsp: 1.093 ± 0.72
1.093AspGlu: 1.093 ± 0.595
1.093AspPhe: 1.093 ± 0.759
2.458AspGly: 2.458 ± 0.793
1.093AspHis: 1.093 ± 0.582
3.824AspIle: 3.824 ± 0.929
3.005AspLys: 3.005 ± 0.901
3.551AspLeu: 3.551 ± 1.214
0.273AspMet: 0.273 ± 0.397
1.639AspAsn: 1.639 ± 0.89
2.731AspPro: 2.731 ± 1.02
2.185AspGln: 2.185 ± 0.649
3.551AspArg: 3.551 ± 0.651
3.551AspSer: 3.551 ± 0.853
2.458AspThr: 2.458 ± 0.473
1.093AspVal: 1.093 ± 0.492
0.819AspTrp: 0.819 ± 0.38
0.819AspTyr: 0.819 ± 0.352
0.0AspXaa: 0.0 ± 0.0
Glu
6.556GluAla: 6.556 ± 1.21
0.0GluCys: 0.0 ± 0.0
2.731GluAsp: 2.731 ± 0.76
7.921GluGlu: 7.921 ± 2.219
1.093GluPhe: 1.093 ± 0.38
5.463GluGly: 5.463 ± 0.841
0.819GluHis: 0.819 ± 0.424
2.458GluIle: 2.458 ± 0.845
4.37GluLys: 4.37 ± 0.844
8.741GluLeu: 8.741 ± 1.864
2.185GluMet: 2.185 ± 0.716
1.912GluAsn: 1.912 ± 0.48
6.829GluPro: 6.829 ± 1.134
4.097GluGln: 4.097 ± 0.854
3.824GluArg: 3.824 ± 0.726
3.005GluSer: 3.005 ± 0.977
4.644GluThr: 4.644 ± 1.224
4.917GluVal: 4.917 ± 1.085
1.366GluTrp: 1.366 ± 0.926
0.819GluTyr: 0.819 ± 0.564
0.0GluXaa: 0.0 ± 0.0
Phe
1.639PheAla: 1.639 ± 0.464
0.273PheCys: 0.273 ± 0.234
0.819PheAsp: 0.819 ± 0.817
0.273PheGlu: 0.273 ± 0.234
0.819PhePhe: 0.819 ± 0.349
1.639PheGly: 1.639 ± 0.622
0.0PheHis: 0.0 ± 0.0
1.366PheIle: 1.366 ± 0.61
1.366PheLys: 1.366 ± 0.53
1.912PheLeu: 1.912 ± 0.521
0.546PheMet: 0.546 ± 0.251
2.185PheAsn: 2.185 ± 0.709
1.366PhePro: 1.366 ± 0.517
0.546PheGln: 0.546 ± 0.251
2.458PheArg: 2.458 ± 1.106
1.366PheSer: 1.366 ± 0.346
1.912PheThr: 1.912 ± 0.832
0.546PheVal: 0.546 ± 0.379
0.273PheTrp: 0.273 ± 0.19
2.185PheTyr: 2.185 ± 0.724
0.0PheXaa: 0.0 ± 0.0
Gly
6.009GlyAla: 6.009 ± 1.184
2.185GlyCys: 2.185 ± 0.574
3.551GlyAsp: 3.551 ± 0.841
3.278GlyGlu: 3.278 ± 1.017
2.458GlyPhe: 2.458 ± 0.536
6.009GlyGly: 6.009 ± 1.129
2.731GlyHis: 2.731 ± 1.602
7.375GlyIle: 7.375 ± 1.556
6.556GlyLys: 6.556 ± 1.497
4.37GlyLeu: 4.37 ± 1.169
0.546GlyMet: 0.546 ± 0.212
3.278GlyAsn: 3.278 ± 1.08
4.917GlyPro: 4.917 ± 0.759
4.644GlyGln: 4.644 ± 1.046
3.824GlyArg: 3.824 ± 0.657
3.551GlySer: 3.551 ± 0.578
3.005GlyThr: 3.005 ± 1.799
3.278GlyVal: 3.278 ± 0.909
1.366GlyTrp: 1.366 ± 0.722
1.366GlyTyr: 1.366 ± 0.519
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.546HisCys: 0.546 ± 0.64
0.273HisAsp: 0.273 ± 0.19
0.0HisGlu: 0.0 ± 0.0
0.273HisPhe: 0.273 ± 0.391
1.639HisGly: 1.639 ± 0.546
0.819HisHis: 0.819 ± 1.14
1.093HisIle: 1.093 ± 0.76
2.185HisLys: 2.185 ± 0.65
3.278HisLeu: 3.278 ± 0.649
0.546HisMet: 0.546 ± 0.715
1.366HisAsn: 1.366 ± 0.302
2.731HisPro: 2.731 ± 1.014
2.731HisGln: 2.731 ± 0.928
0.819HisArg: 0.819 ± 0.335
2.185HisSer: 2.185 ± 0.486
1.093HisThr: 1.093 ± 0.582
0.546HisVal: 0.546 ± 0.36
0.546HisTrp: 0.546 ± 0.212
1.093HisTyr: 1.093 ± 0.946
0.0HisXaa: 0.0 ± 0.0
Ile
2.458IleAla: 2.458 ± 0.722
0.819IleCys: 0.819 ± 0.349
0.273IleAsp: 0.273 ± 0.19
3.824IleGlu: 3.824 ± 1.001
1.093IlePhe: 1.093 ± 0.425
4.37IleGly: 4.37 ± 1.359
2.185IleHis: 2.185 ± 0.553
4.097IleIle: 4.097 ± 1.508
5.736IleLys: 5.736 ± 1.002
5.736IleLeu: 5.736 ± 1.099
1.093IleMet: 1.093 ± 0.745
1.639IleAsn: 1.639 ± 0.53
3.551IlePro: 3.551 ± 0.614
1.912IleGln: 1.912 ± 1.043
6.282IleArg: 6.282 ± 1.281
3.551IleSer: 3.551 ± 0.736
3.278IleThr: 3.278 ± 1.233
4.917IleVal: 4.917 ± 1.571
1.912IleTrp: 1.912 ± 0.631
2.458IleTyr: 2.458 ± 0.682
0.0IleXaa: 0.0 ± 0.0
Lys
5.19LysAla: 5.19 ± 1.621
1.639LysCys: 1.639 ± 0.464
1.639LysAsp: 1.639 ± 0.651
6.829LysGlu: 6.829 ± 1.739
0.546LysPhe: 0.546 ± 0.212
4.097LysGly: 4.097 ± 0.879
1.639LysHis: 1.639 ± 0.524
5.19LysIle: 5.19 ± 0.933
5.736LysLys: 5.736 ± 1.503
6.282LysLeu: 6.282 ± 1.713
1.093LysMet: 1.093 ± 0.347
3.005LysAsn: 3.005 ± 0.587
2.458LysPro: 2.458 ± 0.454
5.19LysGln: 5.19 ± 0.929
2.731LysArg: 2.731 ± 1.364
2.458LysSer: 2.458 ± 0.689
3.824LysThr: 3.824 ± 0.809
3.551LysVal: 3.551 ± 1.156
1.639LysTrp: 1.639 ± 0.593
1.639LysTyr: 1.639 ± 0.375
0.0LysXaa: 0.0 ± 0.0
Leu
4.644LeuAla: 4.644 ± 0.703
0.819LeuCys: 0.819 ± 0.404
4.097LeuAsp: 4.097 ± 1.223
7.648LeuGlu: 7.648 ± 0.949
2.185LeuPhe: 2.185 ± 1.077
5.463LeuGly: 5.463 ± 1.54
2.185LeuHis: 2.185 ± 0.753
3.551LeuIle: 3.551 ± 1.693
5.19LeuLys: 5.19 ± 1.87
8.194LeuLeu: 8.194 ± 2.484
1.093LeuMet: 1.093 ± 0.259
5.19LeuAsn: 5.19 ± 1.201
3.824LeuPro: 3.824 ± 0.906
3.278LeuGln: 3.278 ± 0.733
5.463LeuArg: 5.463 ± 0.985
4.097LeuSer: 4.097 ± 1.38
4.644LeuThr: 4.644 ± 1.11
7.648LeuVal: 7.648 ± 1.219
3.824LeuTrp: 3.824 ± 1.54
3.278LeuTyr: 3.278 ± 0.962
0.0LeuXaa: 0.0 ± 0.0
Met
2.458MetAla: 2.458 ± 0.611
0.0MetCys: 0.0 ± 0.0
1.093MetAsp: 1.093 ± 0.507
1.639MetGlu: 1.639 ± 0.89
0.819MetPhe: 0.819 ± 0.228
1.912MetGly: 1.912 ± 0.359
0.273MetHis: 0.273 ± 0.19
0.546MetIle: 0.546 ± 0.212
2.185MetLys: 2.185 ± 0.61
1.639MetLeu: 1.639 ± 0.461
0.546MetMet: 0.546 ± 0.251
0.546MetAsn: 0.546 ± 0.355
0.0MetPro: 0.0 ± 0.0
1.366MetGln: 1.366 ± 0.575
0.546MetArg: 0.546 ± 0.36
0.546MetSer: 0.546 ± 0.251
2.458MetThr: 2.458 ± 0.64
1.093MetVal: 1.093 ± 0.384
0.546MetTrp: 0.546 ± 0.467
1.093MetTyr: 1.093 ± 0.507
0.0MetXaa: 0.0 ± 0.0
Asn
2.458AsnAla: 2.458 ± 0.577
3.551AsnCys: 3.551 ± 0.895
1.639AsnAsp: 1.639 ± 0.422
5.463AsnGlu: 5.463 ± 1.547
3.005AsnPhe: 3.005 ± 1.056
1.639AsnGly: 1.639 ± 0.533
0.546AsnHis: 0.546 ± 0.64
3.005AsnIle: 3.005 ± 0.929
1.366AsnLys: 1.366 ± 0.519
2.458AsnLeu: 2.458 ± 1.015
0.819AsnMet: 0.819 ± 0.701
3.005AsnAsn: 3.005 ± 1.251
3.551AsnPro: 3.551 ± 0.976
1.366AsnGln: 1.366 ± 0.581
2.185AsnArg: 2.185 ± 0.512
0.819AsnSer: 0.819 ± 0.741
4.917AsnThr: 4.917 ± 1.306
1.093AsnVal: 1.093 ± 0.625
1.366AsnTrp: 1.366 ± 0.505
1.093AsnTyr: 1.093 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
3.824ProAla: 3.824 ± 1.021
0.819ProCys: 0.819 ± 0.589
2.731ProAsp: 2.731 ± 0.598
4.917ProGlu: 4.917 ± 1.119
1.366ProPhe: 1.366 ± 0.581
6.282ProGly: 6.282 ± 0.992
0.273ProHis: 0.273 ± 0.19
5.19ProIle: 5.19 ± 1.089
3.005ProLys: 3.005 ± 0.686
5.463ProLeu: 5.463 ± 1.352
0.819ProMet: 0.819 ± 0.228
0.819ProAsn: 0.819 ± 0.609
4.097ProPro: 4.097 ± 1.044
2.731ProGln: 2.731 ± 0.972
3.278ProArg: 3.278 ± 0.911
1.639ProSer: 1.639 ± 0.495
1.912ProThr: 1.912 ± 0.449
6.556ProVal: 6.556 ± 1.198
0.819ProTrp: 0.819 ± 0.656
1.093ProTyr: 1.093 ± 0.577
0.0ProXaa: 0.0 ± 0.0
Gln
4.917GlnAla: 4.917 ± 0.913
0.273GlnCys: 0.273 ± 0.234
2.185GlnAsp: 2.185 ± 0.925
6.556GlnGlu: 6.556 ± 1.283
0.273GlnPhe: 0.273 ± 0.19
5.19GlnGly: 5.19 ± 0.923
2.185GlnHis: 2.185 ± 0.761
3.551GlnIle: 3.551 ± 0.991
2.731GlnLys: 2.731 ± 1.359
4.917GlnLeu: 4.917 ± 1.167
3.005GlnMet: 3.005 ± 0.793
1.366GlnAsn: 1.366 ± 0.346
2.458GlnPro: 2.458 ± 0.986
3.278GlnGln: 3.278 ± 1.569
3.005GlnArg: 3.005 ± 1.198
2.185GlnSer: 2.185 ± 0.849
2.185GlnThr: 2.185 ± 0.53
3.551GlnVal: 3.551 ± 1.283
1.912GlnTrp: 1.912 ± 0.527
1.366GlnTyr: 1.366 ± 0.709
0.0GlnXaa: 0.0 ± 0.0
Arg
3.551ArgAla: 3.551 ± 1.357
0.819ArgCys: 0.819 ± 0.835
3.551ArgAsp: 3.551 ± 1.006
6.556ArgGlu: 6.556 ± 1.509
1.366ArgPhe: 1.366 ± 0.53
4.37ArgGly: 4.37 ± 0.839
0.819ArgHis: 0.819 ± 0.726
4.644ArgIle: 4.644 ± 1.638
4.644ArgLys: 4.644 ± 1.494
2.458ArgLeu: 2.458 ± 1.257
1.639ArgMet: 1.639 ± 0.607
3.551ArgAsn: 3.551 ± 0.898
2.458ArgPro: 2.458 ± 0.706
4.37ArgGln: 4.37 ± 1.676
5.463ArgArg: 5.463 ± 2.701
2.458ArgSer: 2.458 ± 0.944
3.551ArgThr: 3.551 ± 0.946
2.185ArgVal: 2.185 ± 0.451
1.639ArgTrp: 1.639 ± 0.881
1.093ArgTyr: 1.093 ± 0.38
0.0ArgXaa: 0.0 ± 0.0
Ser
1.639SerAla: 1.639 ± 0.43
1.093SerCys: 1.093 ± 0.393
2.185SerAsp: 2.185 ± 0.526
2.458SerGlu: 2.458 ± 0.583
0.819SerPhe: 0.819 ± 0.563
4.644SerGly: 4.644 ± 1.519
0.819SerHis: 0.819 ± 0.656
3.824SerIle: 3.824 ± 0.42
2.458SerLys: 2.458 ± 1.662
6.009SerLeu: 6.009 ± 2.574
0.546SerMet: 0.546 ± 0.372
2.185SerAsn: 2.185 ± 0.496
2.731SerPro: 2.731 ± 0.914
4.097SerGln: 4.097 ± 1.664
3.551SerArg: 3.551 ± 0.8
2.731SerSer: 2.731 ± 1.76
2.458SerThr: 2.458 ± 0.709
2.185SerVal: 2.185 ± 0.536
1.093SerTrp: 1.093 ± 0.384
1.366SerTyr: 1.366 ± 0.767
0.0SerXaa: 0.0 ± 0.0
Thr
5.736ThrAla: 5.736 ± 1.065
0.0ThrCys: 0.0 ± 0.0
3.278ThrAsp: 3.278 ± 0.992
4.917ThrGlu: 4.917 ± 1.169
1.366ThrPhe: 1.366 ± 0.447
3.551ThrGly: 3.551 ± 0.594
1.912ThrHis: 1.912 ± 0.526
1.639ThrIle: 1.639 ± 0.867
2.458ThrLys: 2.458 ± 0.659
5.736ThrLeu: 5.736 ± 0.969
1.366ThrMet: 1.366 ± 0.346
1.639ThrAsn: 1.639 ± 0.637
4.37ThrPro: 4.37 ± 0.983
3.824ThrGln: 3.824 ± 0.568
2.458ThrArg: 2.458 ± 0.771
3.005ThrSer: 3.005 ± 0.827
4.917ThrThr: 4.917 ± 0.925
4.917ThrVal: 4.917 ± 1.344
1.093ThrTrp: 1.093 ± 0.49
1.366ThrTyr: 1.366 ± 0.614
0.0ThrXaa: 0.0 ± 0.0
Val
3.551ValAla: 3.551 ± 1.045
0.0ValCys: 0.0 ± 0.0
3.005ValAsp: 3.005 ± 1.124
1.912ValGlu: 1.912 ± 0.731
0.819ValPhe: 0.819 ± 0.352
6.282ValGly: 6.282 ± 0.967
1.639ValHis: 1.639 ± 0.613
5.19ValIle: 5.19 ± 0.865
3.551ValLys: 3.551 ± 1.287
4.917ValLeu: 4.917 ± 0.7
0.546ValMet: 0.546 ± 0.527
2.458ValAsn: 2.458 ± 0.854
4.097ValPro: 4.097 ± 0.812
4.37ValGln: 4.37 ± 1.039
3.005ValArg: 3.005 ± 0.494
4.37ValSer: 4.37 ± 0.934
3.551ValThr: 3.551 ± 1.215
2.458ValVal: 2.458 ± 0.716
3.005ValTrp: 3.005 ± 0.627
1.639ValTyr: 1.639 ± 0.637
0.0ValXaa: 0.0 ± 0.0
Trp
1.912TrpAla: 1.912 ± 0.458
0.273TrpCys: 0.273 ± 0.365
1.093TrpAsp: 1.093 ± 0.387
2.731TrpGlu: 2.731 ± 0.677
0.273TrpPhe: 0.273 ± 0.234
1.912TrpGly: 1.912 ± 0.773
0.546TrpHis: 0.546 ± 0.522
0.819TrpIle: 0.819 ± 0.352
1.912TrpLys: 1.912 ± 1.043
1.639TrpLeu: 1.639 ± 1.093
1.639TrpMet: 1.639 ± 0.572
2.185TrpAsn: 2.185 ± 1.354
0.819TrpPro: 0.819 ± 0.445
1.639TrpGln: 1.639 ± 0.669
1.639TrpArg: 1.639 ± 0.688
1.366TrpSer: 1.366 ± 0.832
1.366TrpThr: 1.366 ± 0.67
1.912TrpVal: 1.912 ± 0.384
0.546TrpTrp: 0.546 ± 0.379
0.819TrpTyr: 0.819 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.819TyrAla: 0.819 ± 0.328
2.458TyrCys: 2.458 ± 1.264
0.819TyrAsp: 0.819 ± 0.328
0.546TyrGlu: 0.546 ± 0.355
0.819TyrPhe: 0.819 ± 0.424
1.912TyrGly: 1.912 ± 0.773
1.366TyrHis: 1.366 ± 0.729
0.273TyrIle: 0.273 ± 0.234
2.458TyrLys: 2.458 ± 0.974
1.912TyrLeu: 1.912 ± 0.354
0.273TyrMet: 0.273 ± 0.19
2.458TyrAsn: 2.458 ± 0.709
2.185TyrPro: 2.185 ± 0.703
1.639TyrGln: 1.639 ± 0.881
2.458TyrArg: 2.458 ± 0.636
1.639TyrSer: 1.639 ± 0.82
1.093TyrThr: 1.093 ± 0.38
1.639TyrVal: 1.639 ± 0.375
0.546TyrTrp: 0.546 ± 0.42
1.093TyrTyr: 1.093 ± 0.347
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski