Amino acid dipepetide frequency for Human immunodeficiency virus type 2 subtype A (isolate ROD) (HIV-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.663AlaAla: 4.663 ± 1.1
1.295AlaCys: 1.295 ± 0.425
2.591AlaAsp: 2.591 ± 1.22
8.29AlaGlu: 8.29 ± 1.902
2.85AlaPhe: 2.85 ± 0.605
4.404AlaGly: 4.404 ± 0.669
0.777AlaHis: 0.777 ± 0.356
3.368AlaIle: 3.368 ± 0.683
2.332AlaLys: 2.332 ± 0.883
4.404AlaLeu: 4.404 ± 1.462
2.332AlaMet: 2.332 ± 0.612
2.591AlaAsn: 2.591 ± 0.669
5.181AlaPro: 5.181 ± 0.889
2.85AlaGln: 2.85 ± 0.551
4.922AlaArg: 4.922 ± 1.023
2.332AlaSer: 2.332 ± 0.541
2.591AlaThr: 2.591 ± 1.031
2.332AlaVal: 2.332 ± 0.841
2.073AlaTrp: 2.073 ± 0.491
1.554AlaTyr: 1.554 ± 0.485
0.0AlaXaa: 0.0 ± 0.0
Cys
1.554CysAla: 1.554 ± 0.878
0.777CysCys: 0.777 ± 0.757
0.259CysAsp: 0.259 ± 0.213
1.036CysGlu: 1.036 ± 0.402
0.777CysPhe: 0.777 ± 0.757
1.554CysGly: 1.554 ± 0.766
0.518CysHis: 0.518 ± 0.426
0.777CysIle: 0.777 ± 0.46
1.295CysLys: 1.295 ± 0.629
1.036CysLeu: 1.036 ± 0.678
0.259CysMet: 0.259 ± 0.185
2.591CysAsn: 2.591 ± 1.509
0.518CysPro: 0.518 ± 0.255
2.073CysGln: 2.073 ± 0.587
1.554CysArg: 1.554 ± 0.508
1.036CysSer: 1.036 ± 0.746
1.554CysThr: 1.554 ± 0.464
2.073CysVal: 2.073 ± 0.664
1.295CysTrp: 1.295 ± 0.444
2.073CysTyr: 2.073 ± 2.132
0.0CysXaa: 0.0 ± 0.0
Asp
1.036AspAla: 1.036 ± 0.582
1.036AspCys: 1.036 ± 0.366
2.073AspAsp: 2.073 ± 1.624
1.554AspGlu: 1.554 ± 0.946
1.554AspPhe: 1.554 ± 0.871
1.295AspGly: 1.295 ± 0.589
0.777AspHis: 0.777 ± 0.343
2.591AspIle: 2.591 ± 0.914
1.813AspLys: 1.813 ± 0.695
1.813AspLeu: 1.813 ± 0.803
0.518AspMet: 0.518 ± 0.34
1.554AspAsn: 1.554 ± 0.765
3.886AspPro: 3.886 ± 1.675
1.813AspGln: 1.813 ± 0.462
3.627AspArg: 3.627 ± 0.822
2.85AspSer: 2.85 ± 0.942
3.368AspThr: 3.368 ± 0.607
3.368AspVal: 3.368 ± 0.924
1.295AspTrp: 1.295 ± 0.566
1.036AspTyr: 1.036 ± 0.501
0.0AspXaa: 0.0 ± 0.0
Glu
6.477GluAla: 6.477 ± 1.696
0.0GluCys: 0.0 ± 0.0
2.332GluAsp: 2.332 ± 0.631
8.031GluGlu: 8.031 ± 1.203
0.777GluPhe: 0.777 ± 0.693
6.218GluGly: 6.218 ± 2.006
0.518GluHis: 0.518 ± 0.37
4.922GluIle: 4.922 ± 1.225
7.513GluLys: 7.513 ± 1.728
6.218GluLeu: 6.218 ± 1.555
1.295GluMet: 1.295 ± 0.654
1.554GluAsn: 1.554 ± 0.415
3.109GluPro: 3.109 ± 1.141
4.922GluGln: 4.922 ± 0.696
3.886GluArg: 3.886 ± 1.038
3.886GluSer: 3.886 ± 0.673
6.218GluThr: 6.218 ± 2.102
4.145GluVal: 4.145 ± 1.047
1.554GluTrp: 1.554 ± 0.593
0.518GluTyr: 0.518 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
2.073PheAla: 2.073 ± 0.56
0.259PheCys: 0.259 ± 0.213
1.295PheAsp: 1.295 ± 0.885
0.259PheGlu: 0.259 ± 0.213
0.259PhePhe: 0.259 ± 0.354
3.109PheGly: 3.109 ± 0.801
0.518PheHis: 0.518 ± 0.212
0.777PheIle: 0.777 ± 0.417
1.036PheLys: 1.036 ± 0.272
3.886PheLeu: 3.886 ± 1.218
0.259PheMet: 0.259 ± 0.354
1.295PheAsn: 1.295 ± 0.347
1.295PhePro: 1.295 ± 0.972
2.332PheGln: 2.332 ± 0.557
2.591PheArg: 2.591 ± 0.788
2.073PheSer: 2.073 ± 0.646
1.295PheThr: 1.295 ± 0.628
0.518PheVal: 0.518 ± 0.212
0.259PheTrp: 0.259 ± 0.284
1.295PheTyr: 1.295 ± 0.325
0.0PheXaa: 0.0 ± 0.0
Gly
4.922GlyAla: 4.922 ± 1.036
2.591GlyCys: 2.591 ± 0.786
3.368GlyAsp: 3.368 ± 1.132
4.663GlyGlu: 4.663 ± 1.271
3.109GlyPhe: 3.109 ± 0.864
5.44GlyGly: 5.44 ± 1.214
2.332GlyHis: 2.332 ± 0.702
4.404GlyIle: 4.404 ± 1.562
5.959GlyLys: 5.959 ± 1.932
6.218GlyLeu: 6.218 ± 1.466
1.295GlyMet: 1.295 ± 0.537
3.627GlyAsn: 3.627 ± 0.574
4.922GlyPro: 4.922 ± 1.506
2.332GlyGln: 2.332 ± 0.715
3.886GlyArg: 3.886 ± 1.026
3.886GlySer: 3.886 ± 0.936
3.109GlyThr: 3.109 ± 0.694
2.591GlyVal: 2.591 ± 0.529
1.295GlyTrp: 1.295 ± 0.735
1.554GlyTyr: 1.554 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
0.518HisAla: 0.518 ± 0.255
1.554HisCys: 1.554 ± 0.755
0.518HisAsp: 0.518 ± 0.414
0.518HisGlu: 0.518 ± 0.37
0.777HisPhe: 0.777 ± 1.097
1.813HisGly: 1.813 ± 0.919
0.518HisHis: 0.518 ± 0.306
2.073HisIle: 2.073 ± 0.664
2.073HisLys: 2.073 ± 0.84
4.404HisLeu: 4.404 ± 1.152
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.813HisPro: 1.813 ± 0.649
1.036HisGln: 1.036 ± 0.507
0.777HisArg: 0.777 ± 0.492
2.073HisSer: 2.073 ± 0.597
1.295HisThr: 1.295 ± 0.53
1.295HisVal: 1.295 ± 0.406
0.259HisTrp: 0.259 ± 0.185
0.777HisTyr: 0.777 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
2.591IleAla: 2.591 ± 0.81
1.036IleCys: 1.036 ± 0.644
0.777IleAsp: 0.777 ± 0.305
3.368IleGlu: 3.368 ± 1.378
1.295IlePhe: 1.295 ± 0.498
3.368IleGly: 3.368 ± 0.79
2.332IleHis: 2.332 ± 0.633
5.181IleIle: 5.181 ± 1.071
3.109IleLys: 3.109 ± 0.995
4.145IleLeu: 4.145 ± 0.822
1.295IleMet: 1.295 ± 0.4
2.85IleAsn: 2.85 ± 0.616
4.404IlePro: 4.404 ± 1.483
5.181IleGln: 5.181 ± 1.044
4.145IleArg: 4.145 ± 0.651
1.813IleSer: 1.813 ± 0.771
2.073IleThr: 2.073 ± 0.851
3.886IleVal: 3.886 ± 1.206
1.295IleTrp: 1.295 ± 0.654
2.85IleTyr: 2.85 ± 0.562
0.0IleXaa: 0.0 ± 0.0
Lys
3.627LysAla: 3.627 ± 1.097
1.813LysCys: 1.813 ± 0.684
3.886LysAsp: 3.886 ± 0.912
6.477LysGlu: 6.477 ± 2.124
1.813LysPhe: 1.813 ± 0.615
4.922LysGly: 4.922 ± 1.108
1.813LysHis: 1.813 ± 0.486
3.886LysIle: 3.886 ± 2.187
5.699LysLys: 5.699 ± 1.267
4.663LysLeu: 4.663 ± 0.919
1.295LysMet: 1.295 ± 0.655
2.85LysAsn: 2.85 ± 0.725
1.295LysPro: 1.295 ± 0.444
3.368LysGln: 3.368 ± 1.141
3.109LysArg: 3.109 ± 0.709
2.85LysSer: 2.85 ± 1.115
2.85LysThr: 2.85 ± 1.644
4.404LysVal: 4.404 ± 1.245
1.036LysTrp: 1.036 ± 0.416
3.109LysTyr: 3.109 ± 0.696
0.0LysXaa: 0.0 ± 0.0
Leu
6.477LeuAla: 6.477 ± 1.341
0.777LeuCys: 0.777 ± 0.43
2.591LeuAsp: 2.591 ± 0.839
8.031LeuGlu: 8.031 ± 0.836
2.073LeuPhe: 2.073 ± 0.53
4.663LeuGly: 4.663 ± 0.782
1.554LeuHis: 1.554 ± 0.592
3.886LeuIle: 3.886 ± 1.179
5.959LeuLys: 5.959 ± 1.666
6.736LeuLeu: 6.736 ± 1.238
1.295LeuMet: 1.295 ± 0.351
4.922LeuAsn: 4.922 ± 0.551
3.886LeuPro: 3.886 ± 1.086
3.886LeuGln: 3.886 ± 0.933
5.959LeuArg: 5.959 ± 1.137
3.886LeuSer: 3.886 ± 0.866
4.663LeuThr: 4.663 ± 0.964
5.959LeuVal: 5.959 ± 1.307
1.554LeuTrp: 1.554 ± 0.608
1.813LeuTyr: 1.813 ± 0.833
0.0LeuXaa: 0.0 ± 0.0
Met
1.554MetAla: 1.554 ± 0.729
0.518MetCys: 0.518 ± 0.711
1.036MetAsp: 1.036 ± 0.548
1.554MetGlu: 1.554 ± 0.845
1.036MetPhe: 1.036 ± 0.73
1.554MetGly: 1.554 ± 0.384
0.259MetHis: 0.259 ± 0.442
0.518MetIle: 0.518 ± 0.37
0.259MetLys: 0.259 ± 0.213
2.073MetLeu: 2.073 ± 0.543
0.518MetMet: 0.518 ± 0.426
2.073MetAsn: 2.073 ± 0.414
0.518MetPro: 0.518 ± 0.255
1.295MetGln: 1.295 ± 0.67
0.259MetArg: 0.259 ± 0.185
1.036MetSer: 1.036 ± 0.501
3.109MetThr: 3.109 ± 0.508
0.777MetVal: 0.777 ± 0.343
0.259MetTrp: 0.259 ± 0.213
0.777MetTyr: 0.777 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
1.813AsnAla: 1.813 ± 0.581
2.591AsnCys: 2.591 ± 0.811
1.554AsnAsp: 1.554 ± 0.878
2.073AsnGlu: 2.073 ± 0.867
0.777AsnPhe: 0.777 ± 0.343
1.295AsnGly: 1.295 ± 0.53
0.777AsnHis: 0.777 ± 0.43
3.109AsnIle: 3.109 ± 0.617
3.109AsnLys: 3.109 ± 0.687
2.591AsnLeu: 2.591 ± 0.529
1.813AsnMet: 1.813 ± 0.791
2.332AsnAsn: 2.332 ± 1.024
3.109AsnPro: 3.109 ± 1.268
2.591AsnGln: 2.591 ± 0.858
2.332AsnArg: 2.332 ± 0.996
3.627AsnSer: 3.627 ± 0.728
2.85AsnThr: 2.85 ± 0.663
1.036AsnVal: 1.036 ± 0.424
1.295AsnTrp: 1.295 ± 0.347
2.591AsnTyr: 2.591 ± 0.481
0.0AsnXaa: 0.0 ± 0.0
Pro
3.886ProAla: 3.886 ± 1.185
1.036ProCys: 1.036 ± 0.746
3.368ProAsp: 3.368 ± 0.853
1.813ProGlu: 1.813 ± 1.199
2.073ProPhe: 2.073 ± 0.776
7.513ProGly: 7.513 ± 1.593
0.777ProHis: 0.777 ± 0.419
2.85ProIle: 2.85 ± 0.787
2.332ProLys: 2.332 ± 0.819
5.181ProLeu: 5.181 ± 1.939
1.036ProMet: 1.036 ± 0.73
0.518ProAsn: 0.518 ± 0.212
4.922ProPro: 4.922 ± 3.421
3.109ProGln: 3.109 ± 1.01
5.44ProArg: 5.44 ± 1.082
3.109ProSer: 3.109 ± 0.83
6.477ProThr: 6.477 ± 1.643
3.886ProVal: 3.886 ± 1.223
1.036ProTrp: 1.036 ± 0.556
2.332ProTyr: 2.332 ± 0.964
0.0ProXaa: 0.0 ± 0.0
Gln
4.145GlnAla: 4.145 ± 1.053
1.295GlnCys: 1.295 ± 0.347
1.295GlnAsp: 1.295 ± 0.695
6.477GlnGlu: 6.477 ± 1.302
1.036GlnPhe: 1.036 ± 0.416
5.44GlnGly: 5.44 ± 1.626
1.554GlnHis: 1.554 ± 0.688
4.145GlnIle: 4.145 ± 0.951
4.404GlnLys: 4.404 ± 1.167
3.886GlnLeu: 3.886 ± 0.969
2.073GlnMet: 2.073 ± 0.992
2.332GlnAsn: 2.332 ± 0.754
1.295GlnPro: 1.295 ± 0.459
3.886GlnGln: 3.886 ± 1.252
3.368GlnArg: 3.368 ± 2.384
1.554GlnSer: 1.554 ± 0.561
3.368GlnThr: 3.368 ± 0.967
2.85GlnVal: 2.85 ± 0.525
1.554GlnTrp: 1.554 ± 0.827
1.813GlnTyr: 1.813 ± 0.7
0.0GlnXaa: 0.0 ± 0.0
Arg
4.663ArgAla: 4.663 ± 1.084
0.777ArgCys: 0.777 ± 0.717
2.332ArgAsp: 2.332 ± 1.177
6.218ArgGlu: 6.218 ± 1.991
1.554ArgPhe: 1.554 ± 0.506
5.959ArgGly: 5.959 ± 0.748
1.295ArgHis: 1.295 ± 0.599
2.85ArgIle: 2.85 ± 0.579
2.591ArgLys: 2.591 ± 0.809
6.477ArgLeu: 6.477 ± 1.35
1.554ArgMet: 1.554 ± 0.372
2.591ArgAsn: 2.591 ± 0.87
4.663ArgPro: 4.663 ± 0.931
6.218ArgGln: 6.218 ± 0.64
6.477ArgArg: 6.477 ± 2.561
1.036ArgSer: 1.036 ± 0.774
3.886ArgThr: 3.886 ± 1.646
1.813ArgVal: 1.813 ± 0.693
1.554ArgTrp: 1.554 ± 0.87
2.591ArgTyr: 2.591 ± 1.358
0.0ArgXaa: 0.0 ± 0.0
Ser
2.591SerAla: 2.591 ± 0.798
2.591SerCys: 2.591 ± 1.355
1.813SerAsp: 1.813 ± 0.621
3.627SerGlu: 3.627 ± 0.743
0.777SerPhe: 0.777 ± 0.382
3.368SerGly: 3.368 ± 0.98
1.554SerHis: 1.554 ± 0.634
1.813SerIle: 1.813 ± 0.598
2.85SerLys: 2.85 ± 0.678
4.663SerLeu: 4.663 ± 0.93
0.259SerMet: 0.259 ± 0.32
0.518SerAsn: 0.518 ± 0.212
2.591SerPro: 2.591 ± 0.698
2.85SerGln: 2.85 ± 1.321
3.627SerArg: 3.627 ± 1.062
3.368SerSer: 3.368 ± 0.845
4.922SerThr: 4.922 ± 1.409
1.554SerVal: 1.554 ± 0.381
1.036SerTrp: 1.036 ± 0.782
1.295SerTyr: 1.295 ± 0.609
0.0SerXaa: 0.0 ± 0.0
Thr
4.404ThrAla: 4.404 ± 0.991
1.295ThrCys: 1.295 ± 0.962
3.109ThrAsp: 3.109 ± 0.879
4.663ThrGlu: 4.663 ± 0.547
1.036ThrPhe: 1.036 ± 0.401
3.109ThrGly: 3.109 ± 0.737
3.368ThrHis: 3.368 ± 1.408
3.109ThrIle: 3.109 ± 0.979
2.85ThrLys: 2.85 ± 1.006
3.886ThrLeu: 3.886 ± 0.652
0.518ThrMet: 0.518 ± 0.366
4.145ThrAsn: 4.145 ± 0.975
6.995ThrPro: 6.995 ± 2.296
2.073ThrGln: 2.073 ± 0.653
3.109ThrArg: 3.109 ± 1.796
3.886ThrSer: 3.886 ± 1.156
2.85ThrThr: 2.85 ± 1.122
5.181ThrVal: 5.181 ± 1.215
2.85ThrTrp: 2.85 ± 1.558
1.036ThrTyr: 1.036 ± 0.609
0.0ThrXaa: 0.0 ± 0.0
Val
4.663ValAla: 4.663 ± 0.833
1.295ValCys: 1.295 ± 0.469
2.332ValAsp: 2.332 ± 1.089
3.368ValGlu: 3.368 ± 0.888
1.036ValPhe: 1.036 ± 0.852
4.145ValGly: 4.145 ± 1.46
0.518ValHis: 0.518 ± 0.34
2.85ValIle: 2.85 ± 0.667
4.404ValLys: 4.404 ± 1.059
5.699ValLeu: 5.699 ± 1.579
0.518ValMet: 0.518 ± 0.358
2.073ValAsn: 2.073 ± 0.957
5.181ValPro: 5.181 ± 1.442
3.109ValGln: 3.109 ± 0.798
3.109ValArg: 3.109 ± 0.807
1.036ValSer: 1.036 ± 0.342
3.109ValThr: 3.109 ± 0.818
3.368ValVal: 3.368 ± 1.117
1.554ValTrp: 1.554 ± 0.592
1.036ValTyr: 1.036 ± 0.538
0.0ValXaa: 0.0 ± 0.0
Trp
1.036TrpAla: 1.036 ± 0.366
0.777TrpCys: 0.777 ± 0.237
1.813TrpAsp: 1.813 ± 0.525
0.777TrpGlu: 0.777 ± 0.305
1.036TrpPhe: 1.036 ± 0.852
1.295TrpGly: 1.295 ± 0.556
1.295TrpHis: 1.295 ± 0.868
2.073TrpIle: 2.073 ± 0.424
3.368TrpLys: 3.368 ± 0.957
1.036TrpLeu: 1.036 ± 0.672
1.295TrpMet: 1.295 ± 0.575
1.295TrpAsn: 1.295 ± 0.44
1.295TrpPro: 1.295 ± 0.53
1.554TrpGln: 1.554 ± 0.503
1.554TrpArg: 1.554 ± 1.042
0.0TrpSer: 0.0 ± 0.0
1.554TrpThr: 1.554 ± 0.698
1.295TrpVal: 1.295 ± 0.347
0.777TrpTrp: 0.777 ± 0.417
0.518TrpTyr: 0.518 ± 0.386
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.036TyrAla: 1.036 ± 0.424
1.295TyrCys: 1.295 ± 0.897
0.518TyrAsp: 0.518 ± 0.255
1.295TyrGlu: 1.295 ± 0.733
1.036TyrPhe: 1.036 ± 0.611
1.036TyrGly: 1.036 ± 0.582
1.036TyrHis: 1.036 ± 0.665
1.813TyrIle: 1.813 ± 0.756
2.073TyrLys: 2.073 ± 0.558
1.554TyrLeu: 1.554 ± 0.894
1.295TyrMet: 1.295 ± 0.539
1.554TyrAsn: 1.554 ± 0.415
1.813TyrPro: 1.813 ± 0.84
1.295TyrGln: 1.295 ± 0.566
3.368TyrArg: 3.368 ± 1.038
2.332TyrSer: 2.332 ± 1.172
2.332TyrThr: 2.332 ± 0.912
2.073TyrVal: 2.073 ± 0.557
1.813TyrTrp: 1.813 ± 0.779
0.518TyrTyr: 0.518 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski