Amino acid dipepetide frequency for Tunis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.253AlaAla: 3.253 ± 1.454
1.027AlaCys: 1.027 ± 0.315
2.74AlaAsp: 2.74 ± 0.3
3.425AlaGlu: 3.425 ± 1.065
1.884AlaPhe: 1.884 ± 0.438
2.397AlaGly: 2.397 ± 1.588
1.199AlaHis: 1.199 ± 0.603
2.397AlaIle: 2.397 ± 0.66
2.226AlaLys: 2.226 ± 0.358
5.479AlaLeu: 5.479 ± 1.404
1.712AlaMet: 1.712 ± 0.228
1.541AlaAsn: 1.541 ± 0.349
1.027AlaPro: 1.027 ± 0.34
1.027AlaGln: 1.027 ± 0.233
3.425AlaArg: 3.425 ± 1.57
3.253AlaSer: 3.253 ± 1.225
2.397AlaThr: 2.397 ± 0.51
4.281AlaVal: 4.281 ± 0.971
1.199AlaTrp: 1.199 ± 1.056
1.027AlaTyr: 1.027 ± 0.577
0.0AlaXaa: 0.0 ± 0.0
Cys
1.884CysAla: 1.884 ± 0.454
1.37CysCys: 1.37 ± 0.307
1.541CysAsp: 1.541 ± 0.349
1.37CysGlu: 1.37 ± 0.307
2.055CysPhe: 2.055 ± 0.877
0.856CysGly: 0.856 ± 0.433
1.027CysHis: 1.027 ± 0.315
1.884CysIle: 1.884 ± 0.782
3.082CysLys: 3.082 ± 0.698
2.74CysLeu: 2.74 ± 0.614
0.342CysMet: 0.342 ± 0.192
1.712CysAsn: 1.712 ± 0.497
1.541CysPro: 1.541 ± 1.092
0.342CysGln: 0.342 ± 0.192
1.541CysArg: 1.541 ± 0.349
1.712CysSer: 1.712 ± 0.497
2.74CysThr: 2.74 ± 1.186
1.199CysVal: 1.199 ± 0.948
0.685CysTrp: 0.685 ± 0.297
1.027CysTyr: 1.027 ± 0.907
0.0CysXaa: 0.0 ± 0.0
Asp
2.911AspAla: 2.911 ± 1.186
2.397AspCys: 2.397 ± 0.51
1.712AspAsp: 1.712 ± 0.711
3.082AspGlu: 3.082 ± 0.754
2.568AspPhe: 2.568 ± 1.163
2.055AspGly: 2.055 ± 0.64
0.685AspHis: 0.685 ± 0.154
3.425AspIle: 3.425 ± 0.627
2.911AspLys: 2.911 ± 0.447
5.137AspLeu: 5.137 ± 0.726
1.199AspMet: 1.199 ± 1.46
3.938AspAsn: 3.938 ± 0.443
1.541AspPro: 1.541 ± 0.593
1.884AspGln: 1.884 ± 0.133
3.082AspArg: 3.082 ± 0.771
4.795AspSer: 4.795 ± 0.903
3.253AspThr: 3.253 ± 0.381
4.281AspVal: 4.281 ± 1.111
0.514AspTrp: 0.514 ± 0.289
1.884AspTyr: 1.884 ± 0.438
0.0AspXaa: 0.0 ± 0.0
Glu
2.055GluAla: 2.055 ± 0.63
1.884GluCys: 1.884 ± 0.133
4.452GluAsp: 4.452 ± 0.832
3.938GluGlu: 3.938 ± 0.904
2.74GluPhe: 2.74 ± 0.789
3.082GluGly: 3.082 ± 0.543
1.884GluHis: 1.884 ± 0.438
3.596GluIle: 3.596 ± 1.469
2.911GluLys: 2.911 ± 0.856
7.705GluLeu: 7.705 ± 1.32
1.199GluMet: 1.199 ± 0.255
3.082GluAsn: 3.082 ± 0.259
2.055GluPro: 2.055 ± 0.043
3.596GluGln: 3.596 ± 0.367
3.425GluArg: 3.425 ± 0.457
3.938GluSer: 3.938 ± 1.403
3.082GluThr: 3.082 ± 1.622
6.336GluVal: 6.336 ± 0.96
1.37GluTrp: 1.37 ± 0.812
1.712GluTyr: 1.712 ± 0.501
0.0GluXaa: 0.0 ± 0.0
Phe
1.027PheAla: 1.027 ± 0.212
0.342PheCys: 0.342 ± 0.192
3.596PheAsp: 3.596 ± 1.19
2.226PheGlu: 2.226 ± 0.066
2.911PhePhe: 2.911 ± 1.129
2.226PheGly: 2.226 ± 0.873
1.027PheHis: 1.027 ± 0.728
3.082PheIle: 3.082 ± 0.364
2.74PheLys: 2.74 ± 0.326
3.596PheLeu: 3.596 ± 0.579
1.199PheMet: 1.199 ± 0.468
2.055PheAsn: 2.055 ± 0.568
1.37PhePro: 1.37 ± 0.541
1.884PheGln: 1.884 ± 1.199
2.226PheArg: 2.226 ± 0.406
4.452PheSer: 4.452 ± 0.592
2.226PheThr: 2.226 ± 0.529
2.055PheVal: 2.055 ± 0.465
0.342PheTrp: 0.342 ± 0.148
1.027PheTyr: 1.027 ± 0.577
0.0PheXaa: 0.0 ± 0.0
Gly
1.199GlyAla: 1.199 ± 1.056
1.884GlyCys: 1.884 ± 1.532
2.911GlyAsp: 2.911 ± 0.719
1.37GlyGlu: 1.37 ± 0.163
1.199GlyPhe: 1.199 ± 0.249
1.712GlyGly: 1.712 ± 0.867
1.884GlyHis: 1.884 ± 0.686
3.767GlyIle: 3.767 ± 1.184
4.452GlyLys: 4.452 ± 0.832
6.164GlyLeu: 6.164 ± 0.461
0.342GlyMet: 0.342 ± 0.192
1.37GlyAsn: 1.37 ± 0.42
1.37GlyPro: 1.37 ± 0.718
1.884GlyGln: 1.884 ± 0.755
2.74GlyArg: 2.74 ± 0.972
4.11GlySer: 4.11 ± 1.619
3.425GlyThr: 3.425 ± 1.225
2.74GlyVal: 2.74 ± 0.627
0.342GlyTrp: 0.342 ± 0.476
1.884GlyTyr: 1.884 ± 1.037
0.0GlyXaa: 0.0 ± 0.0
His
1.541HisAla: 1.541 ± 0.621
0.856HisCys: 0.856 ± 0.249
0.685HisAsp: 0.685 ± 0.584
1.199HisGlu: 1.199 ± 0.249
0.685HisPhe: 0.685 ± 0.297
1.541HisGly: 1.541 ± 0.324
1.199HisHis: 1.199 ± 0.948
1.37HisIle: 1.37 ± 0.593
1.199HisLys: 1.199 ± 0.406
1.884HisLeu: 1.884 ± 0.542
0.856HisMet: 0.856 ± 0.676
1.027HisAsn: 1.027 ± 1.025
1.884HisPro: 1.884 ± 0.555
1.199HisGln: 1.199 ± 0.516
1.884HisArg: 1.884 ± 0.438
2.055HisSer: 2.055 ± 0.266
1.541HisThr: 1.541 ± 0.324
0.514HisVal: 0.514 ± 0.116
0.514HisTrp: 0.514 ± 0.364
0.685HisTyr: 0.685 ± 0.297
0.0HisXaa: 0.0 ± 0.0
Ile
2.74IleAla: 2.74 ± 0.942
1.37IleCys: 1.37 ± 0.307
3.082IleAsp: 3.082 ± 0.648
2.911IleGlu: 2.911 ± 0.573
2.568IlePhe: 2.568 ± 0.546
2.055IleGly: 2.055 ± 0.941
2.055IleHis: 2.055 ± 0.568
2.911IleIle: 2.911 ± 0.743
5.479IleLys: 5.479 ± 0.987
5.822IleLeu: 5.822 ± 1.105
1.199IleMet: 1.199 ± 0.215
3.938IleAsn: 3.938 ± 1.171
1.199IlePro: 1.199 ± 0.255
2.74IleGln: 2.74 ± 0.998
2.226IleArg: 2.226 ± 0.358
5.479IleSer: 5.479 ± 1.062
3.596IleThr: 3.596 ± 1.19
4.623IleVal: 4.623 ± 0.449
0.514IleTrp: 0.514 ± 0.116
2.055IleTyr: 2.055 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
4.11LysAla: 4.11 ± 2.013
1.027LysCys: 1.027 ± 0.315
3.938LysAsp: 3.938 ± 0.136
7.021LysGlu: 7.021 ± 1.938
3.425LysPhe: 3.425 ± 0.735
2.397LysGly: 2.397 ± 0.75
1.37LysHis: 1.37 ± 0.42
3.082LysIle: 3.082 ± 0.643
4.11LysLys: 4.11 ± 1.755
9.418LysLeu: 9.418 ± 1.41
1.027LysMet: 1.027 ± 0.224
2.226LysAsn: 2.226 ± 0.569
2.055LysPro: 2.055 ± 1.154
2.911LysGln: 2.911 ± 0.463
2.911LysArg: 2.911 ± 1.036
5.308LysSer: 5.308 ± 0.711
4.281LysThr: 4.281 ± 1.173
5.993LysVal: 5.993 ± 0.506
1.199LysTrp: 1.199 ± 0.215
1.37LysTyr: 1.37 ± 0.769
0.0LysXaa: 0.0 ± 0.0
Leu
4.966LeuAla: 4.966 ± 0.285
3.082LeuCys: 3.082 ± 0.945
6.336LeuAsp: 6.336 ± 1.258
8.39LeuGlu: 8.39 ± 0.543
3.596LeuPhe: 3.596 ± 0.997
4.452LeuGly: 4.452 ± 0.527
2.226LeuHis: 2.226 ± 0.588
5.651LeuIle: 5.651 ± 0.362
9.247LeuLys: 9.247 ± 0.813
11.473LeuLeu: 11.473 ± 1.438
2.74LeuMet: 2.74 ± 0.155
5.651LeuAsn: 5.651 ± 1.377
3.938LeuPro: 3.938 ± 0.572
3.767LeuGln: 3.767 ± 0.824
4.623LeuArg: 4.623 ± 0.68
10.103LeuSer: 10.103 ± 0.319
9.247LeuThr: 9.247 ± 2.201
6.164LeuVal: 6.164 ± 0.185
1.199LeuTrp: 1.199 ± 0.703
1.884LeuTyr: 1.884 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
1.37MetAla: 1.37 ± 0.656
0.856MetCys: 0.856 ± 0.228
1.199MetAsp: 1.199 ± 0.703
1.884MetGlu: 1.884 ± 1.364
0.856MetPhe: 0.856 ± 0.481
1.027MetGly: 1.027 ± 0.577
0.685MetHis: 0.685 ± 0.528
1.37MetIle: 1.37 ± 0.769
1.541MetLys: 1.541 ± 0.593
2.226MetLeu: 2.226 ± 0.608
0.685MetMet: 0.685 ± 0.328
1.027MetAsn: 1.027 ± 0.34
0.685MetPro: 0.685 ± 0.328
0.171MetGln: 0.171 ± 0.096
0.685MetArg: 0.685 ± 0.584
2.568MetSer: 2.568 ± 1.248
1.199MetThr: 1.199 ± 0.215
1.541MetVal: 1.541 ± 0.1
0.171MetTrp: 0.171 ± 0.403
0.342MetTyr: 0.342 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.253AsnAla: 3.253 ± 0.599
1.884AsnCys: 1.884 ± 0.686
2.568AsnAsp: 2.568 ± 0.144
2.74AsnGlu: 2.74 ± 0.641
1.884AsnPhe: 1.884 ± 0.485
1.541AsnGly: 1.541 ± 0.624
1.37AsnHis: 1.37 ± 0.307
1.884AsnIle: 1.884 ± 0.133
3.938AsnLys: 3.938 ± 0.976
5.479AsnLeu: 5.479 ± 0.845
0.514AsnMet: 0.514 ± 0.328
2.568AsnAsn: 2.568 ± 0.144
1.541AsnPro: 1.541 ± 0.56
1.712AsnGln: 1.712 ± 0.365
2.397AsnArg: 2.397 ± 0.946
4.966AsnSer: 4.966 ± 0.699
3.253AsnThr: 3.253 ± 0.274
5.137AsnVal: 5.137 ± 0.527
0.685AsnTrp: 0.685 ± 0.297
1.37AsnTyr: 1.37 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
2.055ProAla: 2.055 ± 0.942
0.514ProCys: 0.514 ± 0.289
1.884ProAsp: 1.884 ± 0.133
2.74ProGlu: 2.74 ± 0.155
1.027ProPhe: 1.027 ± 0.212
1.541ProGly: 1.541 ± 0.624
0.0ProHis: 0.0 ± 0.0
2.055ProIle: 2.055 ± 0.63
1.541ProLys: 1.541 ± 0.1
2.568ProLeu: 2.568 ± 0.706
0.514ProMet: 0.514 ± 0.289
2.226ProAsn: 2.226 ± 0.358
1.027ProPro: 1.027 ± 0.653
1.027ProGln: 1.027 ± 0.315
1.541ProArg: 1.541 ± 0.593
3.253ProSer: 3.253 ± 1.035
3.253ProThr: 3.253 ± 0.928
2.226ProVal: 2.226 ± 0.588
0.685ProTrp: 0.685 ± 0.781
1.027ProTyr: 1.027 ± 0.728
0.0ProXaa: 0.0 ± 0.0
Gln
1.884GlnAla: 1.884 ± 0.454
1.712GlnCys: 1.712 ± 0.365
2.911GlnAsp: 2.911 ± 0.573
2.226GlnGlu: 2.226 ± 0.72
1.884GlnPhe: 1.884 ± 0.133
1.712GlnGly: 1.712 ± 0.228
1.027GlnHis: 1.027 ± 0.233
1.199GlnIle: 1.199 ± 0.392
2.568GlnLys: 2.568 ± 0.685
4.795GlnLeu: 4.795 ± 1.791
1.199GlnMet: 1.199 ± 0.215
1.884GlnAsn: 1.884 ± 1.199
1.37GlnPro: 1.37 ± 0.593
1.712GlnGln: 1.712 ± 0.705
1.37GlnArg: 1.37 ± 0.499
2.226GlnSer: 2.226 ± 0.846
1.541GlnThr: 1.541 ± 0.593
2.226GlnVal: 2.226 ± 0.264
0.0GlnTrp: 0.0 ± 0.0
1.027GlnTyr: 1.027 ± 0.445
0.0GlnXaa: 0.0 ± 0.0
Arg
1.712ArgAla: 1.712 ± 0.501
1.027ArgCys: 1.027 ± 0.577
2.226ArgAsp: 2.226 ± 0.664
2.911ArgGlu: 2.911 ± 0.663
3.082ArgPhe: 3.082 ± 0.854
3.082ArgGly: 3.082 ± 1.276
1.712ArgHis: 1.712 ± 0.501
2.911ArgIle: 2.911 ± 0.743
2.397ArgLys: 2.397 ± 0.51
7.021ArgLeu: 7.021 ± 1.464
0.856ArgMet: 0.856 ± 0.249
2.911ArgAsn: 2.911 ± 0.253
0.856ArgPro: 0.856 ± 0.433
1.541ArgGln: 1.541 ± 0.1
2.911ArgArg: 2.911 ± 2.259
3.253ArgSer: 3.253 ± 0.599
2.911ArgThr: 2.911 ± 1.341
3.082ArgVal: 3.082 ± 0.754
0.342ArgTrp: 0.342 ± 0.148
1.199ArgTyr: 1.199 ± 0.215
0.0ArgXaa: 0.0 ± 0.0
Ser
3.767SerAla: 3.767 ± 0.071
3.253SerCys: 3.253 ± 0.552
2.397SerAsp: 2.397 ± 0.51
5.137SerGlu: 5.137 ± 1.092
3.253SerPhe: 3.253 ± 1.058
4.795SerGly: 4.795 ± 1.227
1.37SerHis: 1.37 ± 0.307
6.507SerIle: 6.507 ± 0.273
5.651SerLys: 5.651 ± 0.237
9.418SerLeu: 9.418 ± 0.796
2.226SerMet: 2.226 ± 0.066
4.623SerAsn: 4.623 ± 0.68
2.74SerPro: 2.74 ± 0.942
2.397SerGln: 2.397 ± 0.812
3.938SerArg: 3.938 ± 1.403
9.075SerSer: 9.075 ± 1.076
5.308SerThr: 5.308 ± 1.168
7.705SerVal: 7.705 ± 1.224
1.199SerTrp: 1.199 ± 0.516
3.767SerTyr: 3.767 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
2.397ThrAla: 2.397 ± 0.16
2.397ThrCys: 2.397 ± 0.789
3.596ThrAsp: 3.596 ± 1.469
4.11ThrGlu: 4.11 ± 0.847
3.253ThrPhe: 3.253 ± 0.625
4.281ThrGly: 4.281 ± 1.718
1.541ThrHis: 1.541 ± 0.539
3.938ThrIle: 3.938 ± 1.311
3.596ThrLys: 3.596 ± 0.367
6.164ThrLeu: 6.164 ± 0.965
1.541ThrMet: 1.541 ± 0.22
2.397ThrAsn: 2.397 ± 0.497
2.055ThrPro: 2.055 ± 0.461
2.911ThrGln: 2.911 ± 0.681
2.055ThrArg: 2.055 ± 0.941
6.678ThrSer: 6.678 ± 0.923
4.11ThrThr: 4.11 ± 1.499
4.795ThrVal: 4.795 ± 1.273
0.856ThrTrp: 0.856 ± 0.228
2.055ThrTyr: 2.055 ± 0.573
0.0ThrXaa: 0.0 ± 0.0
Val
3.767ValAla: 3.767 ± 1.215
1.884ValCys: 1.884 ± 1.238
3.253ValAsp: 3.253 ± 0.832
5.308ValGlu: 5.308 ± 1.033
1.541ValPhe: 1.541 ± 0.54
3.938ValGly: 3.938 ± 0.447
0.685ValHis: 0.685 ± 0.154
4.966ValIle: 4.966 ± 0.414
6.164ValLys: 6.164 ± 0.739
7.534ValLeu: 7.534 ± 0.984
1.027ValMet: 1.027 ± 0.233
2.911ValAsn: 2.911 ± 0.888
3.253ValPro: 3.253 ± 0.718
2.055ValGln: 2.055 ± 0.568
3.596ValArg: 3.596 ± 0.765
7.534ValSer: 7.534 ± 1.467
5.479ValThr: 5.479 ± 1.03
4.966ValVal: 4.966 ± 1.063
0.171ValTrp: 0.171 ± 0.096
1.027ValTyr: 1.027 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
0.342TrpAla: 0.342 ± 0.442
0.514TrpCys: 0.514 ± 0.116
0.685TrpAsp: 0.685 ± 1.155
0.856TrpGlu: 0.856 ± 0.249
0.342TrpPhe: 0.342 ± 0.476
1.027TrpGly: 1.027 ± 0.653
0.171TrpHis: 0.171 ± 0.221
0.856TrpIle: 0.856 ± 0.249
1.541TrpLys: 1.541 ± 0.677
1.199TrpLeu: 1.199 ± 0.603
0.856TrpMet: 0.856 ± 0.901
0.856TrpAsn: 0.856 ± 0.25
0.514TrpPro: 0.514 ± 0.116
0.171TrpGln: 0.171 ± 0.221
0.171TrpArg: 0.171 ± 0.096
0.856TrpSer: 0.856 ± 0.249
0.856TrpThr: 0.856 ± 0.249
0.514TrpVal: 0.514 ± 0.116
0.171TrpTrp: 0.171 ± 0.096
0.685TrpTyr: 0.685 ± 0.314
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.685TyrAla: 0.685 ± 0.328
1.37TyrCys: 1.37 ± 0.874
1.541TyrAsp: 1.541 ± 0.349
1.199TyrGlu: 1.199 ± 0.215
0.856TyrPhe: 0.856 ± 0.249
1.027TyrGly: 1.027 ± 0.233
1.199TyrHis: 1.199 ± 0.657
1.712TyrIle: 1.712 ± 0.687
1.884TyrLys: 1.884 ± 0.454
2.911TyrLeu: 2.911 ± 0.285
0.685TyrMet: 0.685 ± 0.385
2.74TyrAsn: 2.74 ± 0.3
0.685TyrPro: 0.685 ± 0.297
1.541TyrGln: 1.541 ± 0.324
1.199TyrArg: 1.199 ± 0.215
2.911TyrSer: 2.911 ± 0.619
1.199TyrThr: 1.199 ± 1.114
0.685TyrVal: 0.685 ± 0.154
1.027TyrTrp: 1.027 ± 0.404
1.541TyrTyr: 1.541 ± 0.349
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski