Amino acid dipepetide frequency for Wolkberg virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.28AlaAla: 1.28 ± 0.615
1.28AlaCys: 1.28 ± 0.241
1.536AlaAsp: 1.536 ± 0.6
2.817AlaGlu: 2.817 ± 1.133
1.28AlaPhe: 1.28 ± 0.587
1.28AlaGly: 1.28 ± 0.837
0.512AlaHis: 0.512 ± 0.32
3.585AlaIle: 3.585 ± 1.672
4.866AlaLys: 4.866 ± 0.742
5.122AlaLeu: 5.122 ± 1.036
1.536AlaMet: 1.536 ± 0.959
2.561AlaAsn: 2.561 ± 1.241
0.256AlaPro: 0.256 ± 0.16
1.28AlaGln: 1.28 ± 0.615
1.536AlaArg: 1.536 ± 1.367
2.049AlaSer: 2.049 ± 1.011
2.561AlaThr: 2.561 ± 1.174
3.073AlaVal: 3.073 ± 1.174
0.256AlaTrp: 0.256 ± 0.16
2.817AlaTyr: 2.817 ± 0.553
0.0AlaXaa: 0.0 ± 0.0
Cys
1.28CysAla: 1.28 ± 0.451
0.256CysCys: 0.256 ± 0.233
0.768CysAsp: 0.768 ± 0.164
1.28CysGlu: 1.28 ± 1.166
1.024CysPhe: 1.024 ± 0.566
1.536CysGly: 1.536 ± 1.399
0.256CysHis: 0.256 ± 0.233
2.817CysIle: 2.817 ± 1.12
3.073CysLys: 3.073 ± 2.059
1.793CysLeu: 1.793 ± 0.567
0.768CysMet: 0.768 ± 0.336
1.536CysAsn: 1.536 ± 0.328
1.536CysPro: 1.536 ± 0.328
1.536CysGln: 1.536 ± 0.673
0.768CysArg: 0.768 ± 0.336
1.28CysSer: 1.28 ± 0.241
1.536CysThr: 1.536 ± 1.399
1.024CysVal: 1.024 ± 0.932
0.0CysTrp: 0.0 ± 0.0
1.793CysTyr: 1.793 ± 1.262
0.0CysXaa: 0.0 ± 0.0
Asp
1.793AspAla: 1.793 ± 1.414
1.024AspCys: 1.024 ± 0.566
4.609AspAsp: 4.609 ± 0.983
3.329AspGlu: 3.329 ± 0.24
5.122AspPhe: 5.122 ± 1.232
2.049AspGly: 2.049 ± 0.452
0.512AspHis: 0.512 ± 0.32
6.914AspIle: 6.914 ± 2.568
3.073AspLys: 3.073 ± 0.242
2.817AspLeu: 2.817 ± 1.091
1.536AspMet: 1.536 ± 0.607
3.841AspAsn: 3.841 ± 0.819
2.305AspPro: 2.305 ± 0.749
2.305AspGln: 2.305 ± 0.376
2.817AspArg: 2.817 ± 1.176
2.817AspSer: 2.817 ± 0.577
2.561AspThr: 2.561 ± 1.418
2.561AspVal: 2.561 ± 0.613
0.768AspTrp: 0.768 ± 0.336
3.073AspTyr: 3.073 ± 0.584
0.0AspXaa: 0.0 ± 0.0
Glu
3.585GluAla: 3.585 ± 0.858
0.768GluCys: 0.768 ± 0.336
3.329GluAsp: 3.329 ± 0.777
4.609GluGlu: 4.609 ± 0.538
2.561GluPhe: 2.561 ± 0.903
1.28GluGly: 1.28 ± 1.166
2.305GluHis: 2.305 ± 1.009
4.866GluIle: 4.866 ± 1.144
6.658GluLys: 6.658 ± 1.365
6.658GluLeu: 6.658 ± 2.744
3.073GluMet: 3.073 ± 1.568
2.817GluAsn: 2.817 ± 0.806
2.817GluPro: 2.817 ± 0.29
2.049GluGln: 2.049 ± 0.479
2.561GluArg: 2.561 ± 1.599
4.866GluSer: 4.866 ± 0.771
4.353GluThr: 4.353 ± 0.966
3.329GluVal: 3.329 ± 1.874
0.512GluTrp: 0.512 ± 0.123
2.817GluTyr: 2.817 ± 1.058
0.0GluXaa: 0.0 ± 0.0
Phe
1.024PheAla: 1.024 ± 0.566
1.024PheCys: 1.024 ± 0.3
3.329PheAsp: 3.329 ± 0.696
3.329PheGlu: 3.329 ± 0.777
1.793PhePhe: 1.793 ± 2.091
4.353PheGly: 4.353 ± 0.829
0.768PheHis: 0.768 ± 0.699
3.841PheIle: 3.841 ± 1.053
4.866PheLys: 4.866 ± 1.582
4.866PheLeu: 4.866 ± 2.612
0.512PheMet: 0.512 ± 0.32
3.841PheAsn: 3.841 ± 1.122
1.024PhePro: 1.024 ± 1.447
0.512PheGln: 0.512 ± 0.466
1.536PheArg: 1.536 ± 0.328
3.841PheSer: 3.841 ± 1.053
3.073PheThr: 3.073 ± 0.508
2.561PheVal: 2.561 ± 0.616
0.768PheTrp: 0.768 ± 0.48
2.305PheTyr: 2.305 ± 1.194
0.0PheXaa: 0.0 ± 0.0
Gly
2.305GlyAla: 2.305 ± 0.853
1.793GlyCys: 1.793 ± 0.901
2.817GlyAsp: 2.817 ± 0.29
2.817GlyGlu: 2.817 ± 1.097
1.28GlyPhe: 1.28 ± 0.837
0.768GlyGly: 0.768 ± 0.699
0.256GlyHis: 0.256 ± 0.16
3.841GlyIle: 3.841 ± 1.735
3.329GlyLys: 3.329 ± 1.002
3.841GlyLeu: 3.841 ± 1.376
0.768GlyMet: 0.768 ± 1.526
2.561GlyAsn: 2.561 ± 1.174
1.28GlyPro: 1.28 ± 0.451
1.536GlyGln: 1.536 ± 0.328
1.28GlyArg: 1.28 ± 0.775
3.841GlySer: 3.841 ± 2.181
3.585GlyThr: 3.585 ± 1.153
1.536GlyVal: 1.536 ± 0.673
0.512GlyTrp: 0.512 ± 0.123
2.305GlyTyr: 2.305 ± 0.853
0.0GlyXaa: 0.0 ± 0.0
His
1.28HisAla: 1.28 ± 0.45
1.024HisCys: 1.024 ± 0.932
1.024HisAsp: 1.024 ± 0.3
1.28HisGlu: 1.28 ± 0.241
0.768HisPhe: 0.768 ± 0.684
1.793HisGly: 1.793 ± 0.564
0.256HisHis: 0.256 ± 0.16
1.793HisIle: 1.793 ± 0.567
1.793HisLys: 1.793 ± 0.456
1.793HisLeu: 1.793 ± 0.456
0.256HisMet: 0.256 ± 0.233
1.536HisAsn: 1.536 ± 0.368
0.768HisPro: 0.768 ± 0.699
1.024HisGln: 1.024 ± 0.566
0.512HisArg: 0.512 ± 0.778
2.049HisSer: 2.049 ± 0.599
1.024HisThr: 1.024 ± 0.3
1.536HisVal: 1.536 ± 0.328
0.256HisTrp: 0.256 ± 0.16
0.256HisTyr: 0.256 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
5.122IleAla: 5.122 ± 0.216
2.561IleCys: 2.561 ± 0.899
3.585IleAsp: 3.585 ± 1.2
5.634IleGlu: 5.634 ± 1.507
3.585IlePhe: 3.585 ± 0.714
4.609IleGly: 4.609 ± 0.751
2.305IleHis: 2.305 ± 0.459
7.426IleIle: 7.426 ± 1.667
7.682IleLys: 7.682 ± 1.447
12.036IleLeu: 12.036 ± 2.311
3.329IleMet: 3.329 ± 0.888
5.89IleAsn: 5.89 ± 0.613
1.536IlePro: 1.536 ± 0.368
3.073IleGln: 3.073 ± 1.346
3.841IleArg: 3.841 ± 1.706
6.146IleSer: 6.146 ± 0.348
5.634IleThr: 5.634 ± 0.6
2.305IleVal: 2.305 ± 0.538
0.512IleTrp: 0.512 ± 0.32
2.817IleTyr: 2.817 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
2.561LysAla: 2.561 ± 0.621
2.817LysCys: 2.817 ± 1.826
5.122LysAsp: 5.122 ± 0.553
7.939LysGlu: 7.939 ± 1.512
5.122LysPhe: 5.122 ± 1.069
3.841LysGly: 3.841 ± 0.172
1.536LysHis: 1.536 ± 0.368
6.146LysIle: 6.146 ± 1.311
5.378LysLys: 5.378 ± 2.262
8.195LysLeu: 8.195 ± 1.004
3.841LysMet: 3.841 ± 0.729
4.097LysAsn: 4.097 ± 1.198
2.305LysPro: 2.305 ± 0.376
1.536LysGln: 1.536 ± 0.328
3.073LysArg: 3.073 ± 0.899
5.378LysSer: 5.378 ± 0.921
7.17LysThr: 7.17 ± 0.587
4.609LysVal: 4.609 ± 0.872
1.28LysTrp: 1.28 ± 0.241
3.329LysTyr: 3.329 ± 1.573
0.0LysXaa: 0.0 ± 0.0
Leu
4.097LeuAla: 4.097 ± 1.746
2.561LeuCys: 2.561 ± 1.594
5.634LeuAsp: 5.634 ± 0.262
7.17LeuGlu: 7.17 ± 0.269
6.146LeuPhe: 6.146 ± 1.526
3.585LeuGly: 3.585 ± 0.955
2.305LeuHis: 2.305 ± 0.538
8.451LeuIle: 8.451 ± 2.146
8.451LeuLys: 8.451 ± 1.897
8.195LeuLeu: 8.195 ± 1.545
2.305LeuMet: 2.305 ± 0.686
6.146LeuAsn: 6.146 ± 0.348
5.122LeuPro: 5.122 ± 0.553
2.561LeuGln: 2.561 ± 0.613
3.329LeuArg: 3.329 ± 0.632
5.89LeuSer: 5.89 ± 1.393
5.378LeuThr: 5.378 ± 1.719
6.146LeuVal: 6.146 ± 1.192
0.512LeuTrp: 0.512 ± 0.466
4.097LeuTyr: 4.097 ± 1.509
0.0LeuXaa: 0.0 ± 0.0
Met
1.536MetAla: 1.536 ± 1.402
0.512MetCys: 0.512 ± 0.32
2.305MetAsp: 2.305 ± 1.295
1.28MetGlu: 1.28 ± 0.451
1.536MetPhe: 1.536 ± 0.6
1.28MetGly: 1.28 ± 0.797
0.256MetHis: 0.256 ± 0.16
2.561MetIle: 2.561 ± 0.903
3.585MetLys: 3.585 ± 0.714
3.073MetLeu: 3.073 ± 0.584
0.512MetMet: 0.512 ± 0.717
2.049MetAsn: 2.049 ± 0.599
0.768MetPro: 0.768 ± 0.164
1.024MetGln: 1.024 ± 1.447
2.305MetArg: 2.305 ± 0.749
2.817MetSer: 2.817 ± 0.41
1.28MetThr: 1.28 ± 0.587
0.256MetVal: 0.256 ± 0.233
0.0MetTrp: 0.0 ± 0.0
0.256MetTyr: 0.256 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
2.561AsnAla: 2.561 ± 1.14
1.793AsnCys: 1.793 ± 0.567
2.561AsnAsp: 2.561 ± 0.616
4.609AsnGlu: 4.609 ± 0.983
3.585AsnPhe: 3.585 ± 0.134
1.024AsnGly: 1.024 ± 0.728
1.793AsnHis: 1.793 ± 0.564
4.866AsnIle: 4.866 ± 1.291
4.609AsnLys: 4.609 ± 0.919
7.682AsnLeu: 7.682 ± 2.105
1.536AsnMet: 1.536 ± 0.328
4.097AsnAsn: 4.097 ± 0.088
2.817AsnPro: 2.817 ± 1.367
2.049AsnGln: 2.049 ± 0.599
4.097AsnArg: 4.097 ± 0.958
4.609AsnSer: 4.609 ± 1.209
2.561AsnThr: 2.561 ± 0.695
1.536AsnVal: 1.536 ± 0.328
0.512AsnTrp: 0.512 ± 0.32
3.585AsnTyr: 3.585 ± 1.2
0.0AsnXaa: 0.0 ± 0.0
Pro
1.28ProAla: 1.28 ± 0.241
0.256ProCys: 0.256 ± 0.233
2.049ProAsp: 2.049 ± 1.072
2.049ProGlu: 2.049 ± 0.738
0.768ProPhe: 0.768 ± 0.164
2.305ProGly: 2.305 ± 1.295
0.512ProHis: 0.512 ± 0.123
3.841ProIle: 3.841 ± 0.618
1.024ProLys: 1.024 ± 0.639
2.305ProLeu: 2.305 ± 0.844
0.768ProMet: 0.768 ± 0.48
1.536ProAsn: 1.536 ± 0.368
0.512ProPro: 0.512 ± 0.32
0.512ProGln: 0.512 ± 0.123
1.024ProArg: 1.024 ± 0.3
2.561ProSer: 2.561 ± 0.402
1.28ProThr: 1.28 ± 0.45
3.073ProVal: 3.073 ± 0.655
1.28ProTrp: 1.28 ± 0.587
0.768ProTyr: 0.768 ± 0.48
0.0ProXaa: 0.0 ± 0.0
Gln
1.024GlnAla: 1.024 ± 0.617
0.256GlnCys: 0.256 ± 0.233
2.049GlnAsp: 2.049 ± 0.49
0.512GlnGlu: 0.512 ± 0.123
1.536GlnPhe: 1.536 ± 1.331
1.28GlnGly: 1.28 ± 0.241
0.256GlnHis: 0.256 ± 0.16
2.561GlnIle: 2.561 ± 0.402
3.329GlnLys: 3.329 ± 1.372
2.817GlnLeu: 2.817 ± 1.467
0.256GlnMet: 0.256 ± 0.16
2.049GlnAsn: 2.049 ± 0.394
0.512GlnPro: 0.512 ± 0.32
0.768GlnGln: 0.768 ± 0.336
2.305GlnArg: 2.305 ± 0.844
2.817GlnSer: 2.817 ± 1.292
2.817GlnThr: 2.817 ± 1.12
1.536GlnVal: 1.536 ± 0.368
0.768GlnTrp: 0.768 ± 0.856
1.536GlnTyr: 1.536 ± 0.527
0.0GlnXaa: 0.0 ± 0.0
Arg
1.536ArgAla: 1.536 ± 0.959
1.024ArgCys: 1.024 ± 0.245
2.049ArgAsp: 2.049 ± 0.479
3.073ArgGlu: 3.073 ± 0.899
2.561ArgPhe: 2.561 ± 1.322
0.512ArgGly: 0.512 ± 0.123
1.793ArgHis: 1.793 ± 0.456
2.817ArgIle: 2.817 ± 0.539
2.561ArgLys: 2.561 ± 0.613
3.585ArgLeu: 3.585 ± 1.879
2.561ArgMet: 2.561 ± 0.359
3.585ArgAsn: 3.585 ± 1.53
0.512ArgPro: 0.512 ± 0.466
1.536ArgGln: 1.536 ± 1.331
1.024ArgArg: 1.024 ± 0.3
3.329ArgSer: 3.329 ± 0.289
1.536ArgThr: 1.536 ± 0.856
2.049ArgVal: 2.049 ± 2.048
0.256ArgTrp: 0.256 ± 0.766
3.073ArgTyr: 3.073 ± 1.053
0.0ArgXaa: 0.0 ± 0.0
Ser
2.049SerAla: 2.049 ± 0.49
2.561SerCys: 2.561 ± 1.594
4.866SerAsp: 4.866 ± 0.397
5.122SerGlu: 5.122 ± 1.301
1.536SerPhe: 1.536 ± 0.368
4.097SerGly: 4.097 ± 1.666
2.049SerHis: 2.049 ± 0.479
9.475SerIle: 9.475 ± 1.915
6.914SerLys: 6.914 ± 0.973
7.682SerLeu: 7.682 ± 0.831
1.793SerMet: 1.793 ± 0.477
3.585SerAsn: 3.585 ± 1.375
1.793SerPro: 1.793 ± 1.119
1.793SerGln: 1.793 ± 0.456
3.073SerArg: 3.073 ± 0.865
4.353SerSer: 4.353 ± 0.876
3.329SerThr: 3.329 ± 0.777
2.817SerVal: 2.817 ± 0.553
1.024SerTrp: 1.024 ± 0.728
2.049SerTyr: 2.049 ± 0.785
0.0SerXaa: 0.0 ± 0.0
Thr
3.329ThrAla: 3.329 ± 1.819
1.024ThrCys: 1.024 ± 0.245
3.585ThrAsp: 3.585 ± 0.134
2.305ThrGlu: 2.305 ± 0.376
3.329ThrPhe: 3.329 ± 0.632
3.329ThrGly: 3.329 ± 0.24
1.024ThrHis: 1.024 ± 0.245
6.658ThrIle: 6.658 ± 2.017
6.402ThrLys: 6.402 ± 1.067
4.609ThrLeu: 4.609 ± 1.58
0.0ThrMet: 0.0 ± 0.0
2.817ThrAsn: 2.817 ± 0.577
1.536ThrPro: 1.536 ± 0.607
1.793ThrGln: 1.793 ± 0.477
2.561ThrArg: 2.561 ± 0.613
4.866ThrSer: 4.866 ± 0.393
3.329ThrThr: 3.329 ± 0.926
2.049ThrVal: 2.049 ± 0.49
0.512ThrTrp: 0.512 ± 0.123
2.817ThrTyr: 2.817 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
2.049ValAla: 2.049 ± 0.394
1.536ValCys: 1.536 ± 0.673
0.768ValAsp: 0.768 ± 0.336
3.329ValGlu: 3.329 ± 0.632
2.305ValPhe: 2.305 ± 0.531
1.536ValGly: 1.536 ± 0.673
2.049ValHis: 2.049 ± 0.394
3.329ValIle: 3.329 ± 0.567
3.585ValLys: 3.585 ± 1.153
4.866ValLeu: 4.866 ± 0.717
1.536ValMet: 1.536 ± 0.527
3.585ValAsn: 3.585 ± 0.134
1.28ValPro: 1.28 ± 0.451
2.049ValGln: 2.049 ± 0.452
1.024ValArg: 1.024 ± 0.245
4.353ValSer: 4.353 ± 0.454
2.049ValThr: 2.049 ± 0.923
1.793ValVal: 1.793 ± 0.567
0.0ValTrp: 0.0 ± 0.0
3.073ValTyr: 3.073 ± 0.242
0.0ValXaa: 0.0 ± 0.0
Trp
0.512TrpAla: 0.512 ± 0.778
0.256TrpCys: 0.256 ± 0.16
0.512TrpAsp: 0.512 ± 0.32
0.512TrpGlu: 0.512 ± 0.32
0.768TrpPhe: 0.768 ± 0.336
0.768TrpGly: 0.768 ± 0.684
0.256TrpHis: 0.256 ± 0.233
0.512TrpIle: 0.512 ± 0.32
0.256TrpLys: 0.256 ± 0.233
1.28TrpLeu: 1.28 ± 0.241
0.256TrpMet: 0.256 ± 0.766
1.28TrpAsn: 1.28 ± 0.241
0.0TrpPro: 0.0 ± 0.0
0.256TrpGln: 0.256 ± 0.16
0.256TrpArg: 0.256 ± 0.233
1.024TrpSer: 1.024 ± 0.639
0.512TrpThr: 0.512 ± 0.717
0.512TrpVal: 0.512 ± 0.466
0.0TrpTrp: 0.0 ± 0.0
0.256TrpTyr: 0.256 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.28TyrAla: 1.28 ± 0.797
1.536TyrCys: 1.536 ± 1.399
3.329TyrAsp: 3.329 ± 1.066
2.817TyrGlu: 2.817 ± 0.553
2.561TyrPhe: 2.561 ± 0.616
1.024TyrGly: 1.024 ± 0.566
1.024TyrHis: 1.024 ± 0.566
3.585TyrIle: 3.585 ± 0.714
3.841TyrLys: 3.841 ± 0.816
4.866TyrLeu: 4.866 ± 1.584
1.793TyrMet: 1.793 ± 0.456
3.073TyrAsn: 3.073 ± 0.865
1.024TyrPro: 1.024 ± 0.617
1.793TyrGln: 1.793 ± 0.456
2.049TyrArg: 2.049 ± 0.479
3.073TyrSer: 3.073 ± 0.899
2.305TyrThr: 2.305 ± 0.459
1.793TyrVal: 1.793 ± 0.653
0.256TyrTrp: 0.256 ± 0.16
2.305TyrTyr: 2.305 ± 0.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3906 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski