Amino acid dipepetide frequency for HIV-1 M_02CD.MBTB047

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.541AlaAla: 4.541 ± 0.842
1.746AlaCys: 1.746 ± 0.676
1.746AlaAsp: 1.746 ± 0.444
5.938AlaGlu: 5.938 ± 1.557
1.397AlaPhe: 1.397 ± 0.382
4.89AlaGly: 4.89 ± 1.103
1.048AlaHis: 1.048 ± 0.674
5.589AlaIle: 5.589 ± 1.613
3.493AlaLys: 3.493 ± 0.73
6.287AlaLeu: 6.287 ± 1.613
1.746AlaMet: 1.746 ± 0.426
2.794AlaAsn: 2.794 ± 1.39
2.096AlaPro: 2.096 ± 0.623
3.144AlaGln: 3.144 ± 0.518
4.191AlaArg: 4.191 ± 1.082
3.493AlaSer: 3.493 ± 1.155
2.794AlaThr: 2.794 ± 0.317
5.589AlaVal: 5.589 ± 0.879
1.746AlaTrp: 1.746 ± 0.426
1.397AlaTyr: 1.397 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
1.048CysAla: 1.048 ± 0.78
0.0CysCys: 0.0 ± 0.0
0.349CysAsp: 0.349 ± 0.225
0.349CysGlu: 0.349 ± 0.473
1.048CysPhe: 1.048 ± 0.709
1.048CysGly: 1.048 ± 0.674
0.0CysHis: 0.0 ± 0.0
0.349CysIle: 0.349 ± 0.3
1.048CysLys: 1.048 ± 0.674
0.349CysLeu: 0.349 ± 0.3
0.0CysMet: 0.0 ± 0.543
1.746CysAsn: 1.746 ± 1.06
0.349CysPro: 0.349 ± 0.3
1.397CysGln: 1.397 ± 0.515
1.048CysArg: 1.048 ± 0.901
2.096CysSer: 2.096 ± 1.01
2.794CysThr: 2.794 ± 1.147
1.746CysVal: 1.746 ± 0.676
0.699CysTrp: 0.699 ± 0.449
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.397AspAla: 1.397 ± 0.581
2.445AspCys: 2.445 ± 1.153
1.048AspAsp: 1.048 ± 0.674
0.699AspGlu: 0.699 ± 0.219
1.746AspPhe: 1.746 ± 0.508
2.445AspGly: 2.445 ± 0.643
1.048AspHis: 1.048 ± 0.62
5.589AspIle: 5.589 ± 1.136
5.589AspLys: 5.589 ± 1.227
3.493AspLeu: 3.493 ± 1.266
0.349AspMet: 0.349 ± 0.217
2.096AspAsn: 2.096 ± 0.652
2.445AspPro: 2.445 ± 0.494
1.746AspGln: 1.746 ± 0.778
3.842AspArg: 3.842 ± 2.269
3.144AspSer: 3.144 ± 1.26
3.493AspThr: 3.493 ± 0.859
0.699AspVal: 0.699 ± 0.449
0.699AspTrp: 0.699 ± 0.615
0.699AspTyr: 0.699 ± 0.449
0.0AspXaa: 0.0 ± 0.0
Glu
4.541GluAla: 4.541 ± 1.808
0.699GluCys: 0.699 ± 0.558
2.794GluAsp: 2.794 ± 1.317
4.89GluGlu: 4.89 ± 1.676
1.746GluPhe: 1.746 ± 0.725
4.89GluGly: 4.89 ± 0.841
0.349GluHis: 0.349 ± 0.225
5.239GluIle: 5.239 ± 1.523
5.239GluLys: 5.239 ± 1.642
5.938GluLeu: 5.938 ± 1.816
2.096GluMet: 2.096 ± 0.525
2.794GluAsn: 2.794 ± 0.542
2.794GluPro: 2.794 ± 0.714
3.493GluGln: 3.493 ± 0.768
2.794GluArg: 2.794 ± 0.542
2.096GluSer: 2.096 ± 1.209
4.191GluThr: 4.191 ± 1.939
4.89GluVal: 4.89 ± 1.685
2.096GluTrp: 2.096 ± 1.014
2.096GluTyr: 2.096 ± 1.218
0.0GluXaa: 0.0 ± 0.0
Phe
0.699PheAla: 0.699 ± 0.219
0.699PheCys: 0.699 ± 0.601
1.048PheAsp: 1.048 ± 1.149
0.699PheGlu: 0.699 ± 0.558
1.746PhePhe: 1.746 ± 0.916
0.699PheGly: 0.699 ± 0.504
0.0PheHis: 0.0 ± 0.0
1.397PheIle: 1.397 ± 0.515
1.746PheLys: 1.746 ± 0.799
2.445PheLeu: 2.445 ± 0.907
0.0PheMet: 0.0 ± 0.0
3.144PheAsn: 3.144 ± 1.425
2.445PhePro: 2.445 ± 1.226
0.699PheGln: 0.699 ± 0.219
3.493PheArg: 3.493 ± 1.506
2.096PheSer: 2.096 ± 0.567
1.746PheThr: 1.746 ± 0.426
1.048PheVal: 1.048 ± 0.675
0.349PheTrp: 0.349 ± 0.225
1.397PheTyr: 1.397 ± 0.764
0.0PheXaa: 0.0 ± 0.0
Gly
4.89GlyAla: 4.89 ± 1.002
1.746GlyCys: 1.746 ± 0.581
3.493GlyAsp: 3.493 ± 1.633
4.191GlyGlu: 4.191 ± 1.218
2.794GlyPhe: 2.794 ± 1.516
7.684GlyGly: 7.684 ± 1.192
3.144GlyHis: 3.144 ± 1.549
5.589GlyIle: 5.589 ± 0.825
5.239GlyLys: 5.239 ± 2.039
6.986GlyLeu: 6.986 ± 2.247
1.397GlyMet: 1.397 ± 0.591
2.445GlyAsn: 2.445 ± 0.643
3.493GlyPro: 3.493 ± 1.19
5.938GlyGln: 5.938 ± 1.737
3.493GlyArg: 3.493 ± 1.506
3.144GlySer: 3.144 ± 0.802
2.096GlyThr: 2.096 ± 0.95
4.191GlyVal: 4.191 ± 0.523
1.746GlyTrp: 1.746 ± 1.186
2.096GlyTyr: 2.096 ± 0.662
0.0GlyXaa: 0.0 ± 0.0
His
1.746HisAla: 1.746 ± 0.817
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.349HisGlu: 0.349 ± 0.225
1.048HisPhe: 1.048 ± 1.218
2.445HisGly: 2.445 ± 1.425
0.699HisHis: 0.699 ± 0.558
1.048HisIle: 1.048 ± 0.873
1.397HisLys: 1.397 ± 0.821
3.493HisLeu: 3.493 ± 0.704
1.048HisMet: 1.048 ± 0.919
1.048HisAsn: 1.048 ± 0.62
1.746HisPro: 1.746 ± 0.668
1.746HisGln: 1.746 ± 1.123
0.699HisArg: 0.699 ± 0.41
1.746HisSer: 1.746 ± 0.778
1.746HisThr: 1.746 ± 0.534
0.349HisVal: 0.349 ± 0.225
0.0HisTrp: 0.0 ± 0.0
0.699HisTyr: 0.699 ± 0.558
0.0HisXaa: 0.0 ± 0.0
Ile
3.842IleAla: 3.842 ± 1.134
1.048IleCys: 1.048 ± 0.326
1.746IleAsp: 1.746 ± 0.869
3.144IleGlu: 3.144 ± 0.743
1.746IlePhe: 1.746 ± 0.479
4.541IleGly: 4.541 ± 1.618
2.096IleHis: 2.096 ± 0.807
6.636IleIle: 6.636 ± 1.067
4.89IleLys: 4.89 ± 1.665
6.986IleLeu: 6.986 ± 1.079
1.048IleMet: 1.048 ± 0.475
3.493IleAsn: 3.493 ± 1.093
3.493IlePro: 3.493 ± 1.003
3.144IleGln: 3.144 ± 1.605
4.89IleArg: 4.89 ± 1.682
2.096IleSer: 2.096 ± 0.95
3.144IleThr: 3.144 ± 1.823
6.986IleVal: 6.986 ± 2.58
1.397IleTrp: 1.397 ± 0.437
3.144IleTyr: 3.144 ± 1.077
0.0IleXaa: 0.0 ± 0.0
Lys
6.986LysAla: 6.986 ± 1.216
1.746LysCys: 1.746 ± 0.41
5.239LysAsp: 5.239 ± 0.957
4.89LysGlu: 4.89 ± 2.246
1.048LysPhe: 1.048 ± 0.53
4.89LysGly: 4.89 ± 1.357
1.397LysHis: 1.397 ± 1.007
5.239LysIle: 5.239 ± 1.796
5.589LysLys: 5.589 ± 2.294
5.938LysLeu: 5.938 ± 1.589
0.349LysMet: 0.349 ± 0.225
2.794LysAsn: 2.794 ± 1.955
2.096LysPro: 2.096 ± 0.807
4.191LysGln: 4.191 ± 0.647
2.096LysArg: 2.096 ± 0.567
2.445LysSer: 2.445 ± 0.516
2.794LysThr: 2.794 ± 1.398
6.986LysVal: 6.986 ± 1.687
2.445LysTrp: 2.445 ± 0.42
2.445LysTyr: 2.445 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
6.636LeuAla: 6.636 ± 1.403
1.746LeuCys: 1.746 ± 1.06
4.89LeuAsp: 4.89 ± 1.39
6.287LeuGlu: 6.287 ± 1.415
2.794LeuPhe: 2.794 ± 1.116
5.938LeuGly: 5.938 ± 1.144
2.794LeuHis: 2.794 ± 1.725
3.493LeuIle: 3.493 ± 1.828
8.034LeuLys: 8.034 ± 0.552
7.335LeuLeu: 7.335 ± 2.222
1.048LeuMet: 1.048 ± 1.002
3.144LeuAsn: 3.144 ± 0.978
2.445LeuPro: 2.445 ± 0.76
4.191LeuGln: 4.191 ± 1.623
4.89LeuArg: 4.89 ± 0.875
3.493LeuSer: 3.493 ± 0.715
4.191LeuThr: 4.191 ± 0.716
5.938LeuVal: 5.938 ± 2.03
3.493LeuTrp: 3.493 ± 0.915
1.397LeuTyr: 1.397 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
1.048MetAla: 1.048 ± 0.674
0.0MetCys: 0.0 ± 0.0
0.699MetAsp: 0.699 ± 0.449
2.096MetGlu: 2.096 ± 1.044
0.699MetPhe: 0.699 ± 0.219
2.794MetGly: 2.794 ± 0.715
0.699MetHis: 0.699 ± 0.601
1.746MetIle: 1.746 ± 0.444
0.699MetLys: 0.699 ± 0.219
0.699MetLeu: 0.699 ± 0.219
0.699MetMet: 0.699 ± 0.449
1.746MetAsn: 1.746 ± 0.444
0.0MetPro: 0.0 ± 0.0
0.699MetGln: 0.699 ± 0.449
1.397MetArg: 1.397 ± 0.979
0.699MetSer: 0.699 ± 0.504
2.794MetThr: 2.794 ± 0.562
0.699MetVal: 0.699 ± 0.219
1.048MetTrp: 1.048 ± 0.78
1.048MetTyr: 1.048 ± 0.403
0.0MetXaa: 0.0 ± 0.0
Asn
2.096AsnAla: 2.096 ± 0.652
2.794AsnCys: 2.794 ± 1.528
1.746AsnAsp: 1.746 ± 0.508
3.144AsnGlu: 3.144 ± 0.684
2.794AsnPhe: 2.794 ± 1.383
3.144AsnGly: 3.144 ± 1.182
0.349AsnHis: 0.349 ± 0.3
2.794AsnIle: 2.794 ± 0.874
2.794AsnLys: 2.794 ± 0.317
3.144AsnLeu: 3.144 ± 1.425
2.096AsnMet: 2.096 ± 1.357
2.445AsnAsn: 2.445 ± 1.359
3.493AsnPro: 3.493 ± 1.04
1.397AsnGln: 1.397 ± 0.437
2.445AsnArg: 2.445 ± 0.95
1.746AsnSer: 1.746 ± 0.437
4.89AsnThr: 4.89 ± 1.032
2.445AsnVal: 2.445 ± 0.996
2.096AsnTrp: 2.096 ± 0.656
1.048AsnTyr: 1.048 ± 0.46
0.0AsnXaa: 0.0 ± 0.0
Pro
3.144ProAla: 3.144 ± 1.044
0.699ProCys: 0.699 ± 0.601
2.096ProAsp: 2.096 ± 1.06
3.493ProGlu: 3.493 ± 1.249
1.397ProPhe: 1.397 ± 0.899
4.541ProGly: 4.541 ± 0.633
0.349ProHis: 0.349 ± 0.225
4.191ProIle: 4.191 ± 1.884
2.096ProLys: 2.096 ± 0.567
4.541ProLeu: 4.541 ± 1.316
1.048ProMet: 1.048 ± 0.838
1.048ProAsn: 1.048 ± 0.901
2.445ProPro: 2.445 ± 0.494
2.794ProGln: 2.794 ± 0.841
2.096ProArg: 2.096 ± 0.724
1.397ProSer: 1.397 ± 0.339
1.048ProThr: 1.048 ± 0.475
4.541ProVal: 4.541 ± 0.948
1.048ProTrp: 1.048 ± 1.097
1.048ProTyr: 1.048 ± 0.675
0.0ProXaa: 0.0 ± 0.0
Gln
5.589GlnAla: 5.589 ± 0.875
0.699GlnCys: 0.699 ± 0.601
2.445GlnAsp: 2.445 ± 0.833
4.89GlnGlu: 4.89 ± 0.829
0.349GlnPhe: 0.349 ± 0.3
4.89GlnGly: 4.89 ± 1.52
1.397GlnHis: 1.397 ± 0.582
4.89GlnIle: 4.89 ± 1.436
3.144GlnLys: 3.144 ± 1.614
5.938GlnLeu: 5.938 ± 0.469
2.445GlnMet: 2.445 ± 0.867
3.144GlnAsn: 3.144 ± 1.154
1.048GlnPro: 1.048 ± 0.674
3.842GlnGln: 3.842 ± 0.846
2.445GlnArg: 2.445 ± 1.104
2.096GlnSer: 2.096 ± 0.95
1.048GlnThr: 1.048 ± 0.403
3.144GlnVal: 3.144 ± 1.336
1.048GlnTrp: 1.048 ± 0.326
2.096GlnTyr: 2.096 ± 0.924
0.0GlnXaa: 0.0 ± 0.0
Arg
3.842ArgAla: 3.842 ± 0.744
0.0ArgCys: 0.0 ± 0.0
3.842ArgAsp: 3.842 ± 0.933
6.636ArgGlu: 6.636 ± 1.63
0.349ArgPhe: 0.349 ± 0.225
3.493ArgGly: 3.493 ± 0.768
1.048ArgHis: 1.048 ± 1.269
3.493ArgIle: 3.493 ± 2.145
4.89ArgLys: 4.89 ± 0.819
3.493ArgLeu: 3.493 ± 1.016
1.397ArgMet: 1.397 ± 0.676
2.096ArgAsn: 2.096 ± 0.623
3.144ArgPro: 3.144 ± 1.714
3.144ArgGln: 3.144 ± 0.92
2.794ArgArg: 2.794 ± 1.33
2.096ArgSer: 2.096 ± 0.924
3.842ArgThr: 3.842 ± 1.413
2.794ArgVal: 2.794 ± 1.147
1.746ArgTrp: 1.746 ± 0.844
1.397ArgTyr: 1.397 ± 0.676
0.0ArgXaa: 0.0 ± 0.0
Ser
2.794SerAla: 2.794 ± 0.651
0.349SerCys: 0.349 ± 0.225
2.445SerAsp: 2.445 ± 0.71
3.144SerGlu: 3.144 ± 0.372
2.096SerPhe: 2.096 ± 1.545
3.842SerGly: 3.842 ± 0.907
0.699SerHis: 0.699 ± 0.884
4.191SerIle: 4.191 ± 0.911
2.794SerLys: 2.794 ± 0.542
3.842SerLeu: 3.842 ± 1.706
0.699SerMet: 0.699 ± 0.449
2.794SerAsn: 2.794 ± 0.465
2.794SerPro: 2.794 ± 1.031
2.794SerGln: 2.794 ± 1.28
3.493SerArg: 3.493 ± 1.32
4.89SerSer: 4.89 ± 0.724
2.445SerThr: 2.445 ± 0.571
1.397SerVal: 1.397 ± 0.651
1.048SerTrp: 1.048 ± 0.475
0.349SerTyr: 0.349 ± 0.3
0.0SerXaa: 0.0 ± 0.0
Thr
2.096ThrAla: 2.096 ± 0.984
0.0ThrCys: 0.0 ± 0.0
3.493ThrAsp: 3.493 ± 1.015
4.191ThrGlu: 4.191 ± 0.742
0.699ThrPhe: 0.699 ± 0.504
3.493ThrGly: 3.493 ± 0.288
1.048ThrHis: 1.048 ± 0.475
3.144ThrIle: 3.144 ± 0.92
3.144ThrLys: 3.144 ± 1.141
5.239ThrLeu: 5.239 ± 1.087
1.746ThrMet: 1.746 ± 0.444
3.493ThrAsn: 3.493 ± 1.768
3.842ThrPro: 3.842 ± 1.191
2.794ThrGln: 2.794 ± 0.542
1.746ThrArg: 1.746 ± 1.06
3.493ThrSer: 3.493 ± 0.874
2.794ThrThr: 2.794 ± 1.147
4.89ThrVal: 4.89 ± 1.47
2.445ThrTrp: 2.445 ± 0.526
1.397ThrTyr: 1.397 ± 1.138
0.0ThrXaa: 0.0 ± 0.0
Val
4.89ValAla: 4.89 ± 1.034
0.0ValCys: 0.0 ± 0.0
2.445ValAsp: 2.445 ± 0.973
3.842ValGlu: 3.842 ± 1.491
0.349ValPhe: 0.349 ± 0.225
5.938ValGly: 5.938 ± 1.451
3.144ValHis: 3.144 ± 0.592
3.842ValIle: 3.842 ± 0.778
4.89ValLys: 4.89 ± 1.519
4.541ValLeu: 4.541 ± 1.03
0.699ValMet: 0.699 ± 0.558
2.445ValAsn: 2.445 ± 0.833
3.144ValPro: 3.144 ± 0.763
4.89ValGln: 4.89 ± 1.47
3.842ValArg: 3.842 ± 0.916
3.842ValSer: 3.842 ± 0.588
4.191ValThr: 4.191 ± 1.014
4.541ValVal: 4.541 ± 0.508
2.445ValTrp: 2.445 ± 0.955
1.746ValTyr: 1.746 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
1.746TrpAla: 1.746 ± 0.508
0.349TrpCys: 0.349 ± 0.572
2.096TrpAsp: 2.096 ± 0.95
1.746TrpGlu: 1.746 ± 0.508
0.349TrpPhe: 0.349 ± 0.3
3.144TrpGly: 3.144 ± 1.28
0.699TrpHis: 0.699 ± 0.947
0.699TrpIle: 0.699 ± 0.449
2.794TrpLys: 2.794 ± 0.567
1.746TrpLeu: 1.746 ± 1.269
1.048TrpMet: 1.048 ± 0.674
1.746TrpAsn: 1.746 ± 1.287
1.048TrpPro: 1.048 ± 0.53
2.445TrpGln: 2.445 ± 0.834
1.746TrpArg: 1.746 ± 0.444
1.048TrpSer: 1.048 ± 1.057
2.096TrpThr: 2.096 ± 0.724
1.397TrpVal: 1.397 ± 0.382
0.699TrpTrp: 0.699 ± 0.449
0.699TrpTyr: 0.699 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.096TyrAla: 2.096 ± 0.525
1.048TyrCys: 1.048 ± 0.475
1.048TyrAsp: 1.048 ± 0.326
1.048TyrGlu: 1.048 ± 0.807
1.397TyrPhe: 1.397 ± 0.339
1.048TyrGly: 1.048 ± 0.518
1.048TyrHis: 1.048 ± 0.403
0.349TyrIle: 0.349 ± 0.3
2.096TyrLys: 2.096 ± 1.397
1.746TyrLeu: 1.746 ± 0.444
0.699TyrMet: 0.699 ± 0.449
2.445TyrAsn: 2.445 ± 1.104
0.699TyrPro: 0.699 ± 0.504
2.096TyrGln: 2.096 ± 0.977
2.096TyrArg: 2.096 ± 0.807
1.746TyrSer: 1.746 ± 0.534
1.048TyrThr: 1.048 ± 0.46
1.397TyrVal: 1.397 ± 0.899
1.048TyrTrp: 1.048 ± 0.403
1.397TyrTyr: 1.397 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2864 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski