Amino acid dipepetide frequency for Great Saltee virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.541AlaAla: 1.541 ± 2.242
1.37AlaCys: 1.37 ± 0.185
1.712AlaAsp: 1.712 ± 0.122
3.596AlaGlu: 3.596 ± 1.745
2.74AlaPhe: 2.74 ± 0.847
2.568AlaGly: 2.568 ± 0.668
1.027AlaHis: 1.027 ± 0.355
3.425AlaIle: 3.425 ± 2.294
3.425AlaLys: 3.425 ± 0.489
5.651AlaLeu: 5.651 ± 2.23
1.199AlaMet: 1.199 ± 0.743
2.568AlaAsn: 2.568 ± 0.549
1.37AlaPro: 1.37 ± 0.448
1.541AlaGln: 1.541 ± 0.425
1.712AlaArg: 1.712 ± 0.498
4.795AlaSer: 4.795 ± 1.265
3.082AlaThr: 3.082 ± 0.849
3.767AlaVal: 3.767 ± 0.714
0.342AlaTrp: 0.342 ± 0.441
1.712AlaTyr: 1.712 ± 1.147
0.0AlaXaa: 0.0 ± 0.0
Cys
1.37CysAla: 1.37 ± 1.225
1.199CysCys: 1.199 ± 0.317
0.856CysAsp: 0.856 ± 0.567
1.541CysGlu: 1.541 ± 0.444
1.027CysPhe: 1.027 ± 0.355
0.685CysGly: 0.685 ± 0.805
0.514CysHis: 0.514 ± 0.286
1.199CysIle: 1.199 ± 0.418
1.37CysLys: 1.37 ± 0.448
1.884CysLeu: 1.884 ± 0.81
0.0CysMet: 0.0 ± 0.0
1.712CysAsn: 1.712 ± 0.54
1.884CysPro: 1.884 ± 1.076
1.712CysGln: 1.712 ± 0.46
1.37CysArg: 1.37 ± 0.448
2.397CysSer: 2.397 ± 0.521
1.712CysThr: 1.712 ± 1.133
1.37CysVal: 1.37 ± 0.448
1.199CysTrp: 1.199 ± 0.472
0.685CysTyr: 0.685 ± 0.342
0.0CysXaa: 0.0 ± 0.0
Asp
3.938AspAla: 3.938 ± 1.097
1.027AspCys: 1.027 ± 0.33
3.938AspAsp: 3.938 ± 0.189
3.938AspGlu: 3.938 ± 1.009
2.397AspPhe: 2.397 ± 0.442
3.082AspGly: 3.082 ± 0.533
0.514AspHis: 0.514 ± 0.286
4.795AspIle: 4.795 ± 1.389
3.253AspLys: 3.253 ± 0.911
5.308AspLeu: 5.308 ± 0.927
1.199AspMet: 1.199 ± 0.819
2.911AspAsn: 2.911 ± 0.907
0.685AspPro: 0.685 ± 0.342
1.37AspGln: 1.37 ± 0.578
1.712AspArg: 1.712 ± 0.498
2.397AspSer: 2.397 ± 0.154
1.884AspThr: 1.884 ± 0.593
3.938AspVal: 3.938 ± 1.086
0.514AspTrp: 0.514 ± 0.148
2.226AspTyr: 2.226 ± 0.981
0.0AspXaa: 0.0 ± 0.0
Glu
2.397GluAla: 2.397 ± 1.216
1.199GluCys: 1.199 ± 0.472
4.452GluAsp: 4.452 ± 0.129
5.479GluGlu: 5.479 ± 0.459
3.425GluPhe: 3.425 ± 0.995
3.082GluGly: 3.082 ± 1.065
1.884GluHis: 1.884 ± 0.764
3.082GluIle: 3.082 ± 0.816
6.164GluLys: 6.164 ± 1.335
6.678GluLeu: 6.678 ± 1.917
3.253GluMet: 3.253 ± 0.857
3.596GluAsn: 3.596 ± 0.679
2.397GluPro: 2.397 ± 0.672
2.226GluGln: 2.226 ± 0.975
2.397GluArg: 2.397 ± 0.836
4.623GluSer: 4.623 ± 0.792
5.479GluThr: 5.479 ± 1.464
5.651GluVal: 5.651 ± 0.965
1.027GluTrp: 1.027 ± 0.355
1.884GluTyr: 1.884 ± 0.577
0.0GluXaa: 0.0 ± 0.0
Phe
2.568PheAla: 2.568 ± 0.911
1.37PheCys: 1.37 ± 0.722
2.226PheAsp: 2.226 ± 0.064
3.082PheGlu: 3.082 ± 0.657
2.568PhePhe: 2.568 ± 0.926
2.055PheGly: 2.055 ± 0.366
0.514PheHis: 0.514 ± 0.398
2.226PheIle: 2.226 ± 0.481
3.767PheLys: 3.767 ± 1.142
5.479PheLeu: 5.479 ± 0.472
1.027PheMet: 1.027 ± 0.572
2.397PheAsn: 2.397 ± 1.069
1.199PhePro: 1.199 ± 0.266
2.397PheGln: 2.397 ± 0.528
1.712PheArg: 1.712 ± 0.853
4.623PheSer: 4.623 ± 0.736
2.397PheThr: 2.397 ± 0.154
2.568PheVal: 2.568 ± 1.417
0.171PheTrp: 0.171 ± 0.095
2.568PheTyr: 2.568 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
2.055GlyAla: 2.055 ± 0.366
2.055GlyCys: 2.055 ± 1.816
3.767GlyAsp: 3.767 ± 0.714
2.74GlyGlu: 2.74 ± 0.392
1.884GlyPhe: 1.884 ± 0.801
2.74GlyGly: 2.74 ± 1.489
0.856GlyHis: 0.856 ± 0.249
4.795GlyIle: 4.795 ± 0.73
5.308GlyLys: 5.308 ± 1.219
4.966GlyLeu: 4.966 ± 1.498
0.514GlyMet: 0.514 ± 0.286
2.911GlyAsn: 2.911 ± 0.961
1.027GlyPro: 1.027 ± 0.614
1.37GlyGln: 1.37 ± 0.707
2.397GlyArg: 2.397 ± 0.763
2.74GlySer: 2.74 ± 0.343
3.253GlyThr: 3.253 ± 0.788
2.74GlyVal: 2.74 ± 0.392
0.514GlyTrp: 0.514 ± 0.398
1.712GlyTyr: 1.712 ± 0.122
0.0GlyXaa: 0.0 ± 0.0
His
1.37HisAla: 1.37 ± 0.185
0.856HisCys: 0.856 ± 0.305
0.685HisAsp: 0.685 ± 0.181
0.342HisGlu: 0.342 ± 0.191
1.199HisPhe: 1.199 ± 0.418
1.541HisGly: 1.541 ± 0.641
1.199HisHis: 1.199 ± 0.736
1.027HisIle: 1.027 ± 0.908
0.856HisLys: 0.856 ± 0.304
2.397HisLeu: 2.397 ± 0.221
1.541HisMet: 1.541 ± 1.244
0.514HisAsn: 0.514 ± 0.398
1.541HisPro: 1.541 ± 0.661
0.856HisGln: 0.856 ± 0.304
0.856HisArg: 0.856 ± 0.249
0.856HisSer: 0.856 ± 0.305
1.541HisThr: 1.541 ± 0.906
1.199HisVal: 1.199 ± 0.472
0.171HisTrp: 0.171 ± 0.234
0.514HisTyr: 0.514 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
2.568IleAla: 2.568 ± 0.668
1.541IleCys: 1.541 ± 0.444
3.082IleAsp: 3.082 ± 1.261
5.993IleGlu: 5.993 ± 0.97
2.911IlePhe: 2.911 ± 0.35
2.226IleGly: 2.226 ± 0.858
1.541IleHis: 1.541 ± 0.641
5.308IleIle: 5.308 ± 0.76
8.048IleLys: 8.048 ± 2.065
5.308IleLeu: 5.308 ± 0.377
1.37IleMet: 1.37 ± 0.508
3.253IleAsn: 3.253 ± 0.731
3.596IlePro: 3.596 ± 0.477
3.253IleGln: 3.253 ± 0.788
3.082IleArg: 3.082 ± 0.692
5.651IleSer: 5.651 ± 0.961
4.966IleThr: 4.966 ± 0.844
4.11IleVal: 4.11 ± 0.524
0.514IleTrp: 0.514 ± 0.433
1.541IleTyr: 1.541 ± 0.6
0.0IleXaa: 0.0 ± 0.0
Lys
3.767LysAla: 3.767 ± 0.641
1.712LysCys: 1.712 ± 1.426
4.281LysAsp: 4.281 ± 0.725
8.39LysGlu: 8.39 ± 0.822
4.11LysPhe: 4.11 ± 0.838
5.479LysGly: 5.479 ± 1.412
2.226LysHis: 2.226 ± 0.064
4.966LysIle: 4.966 ± 0.677
6.336LysLys: 6.336 ± 0.703
7.534LysLeu: 7.534 ± 0.92
1.37LysMet: 1.37 ± 0.424
3.938LysAsn: 3.938 ± 0.632
3.253LysPro: 3.253 ± 0.463
1.541LysGln: 1.541 ± 0.6
3.938LysArg: 3.938 ± 0.189
5.137LysSer: 5.137 ± 1.17
4.795LysThr: 4.795 ± 0.73
6.507LysVal: 6.507 ± 1.256
1.199LysTrp: 1.199 ± 0.266
2.397LysTyr: 2.397 ± 0.221
0.0LysXaa: 0.0 ± 0.0
Leu
4.11LeuAla: 4.11 ± 0.642
2.568LeuCys: 2.568 ± 0.248
4.281LeuAsp: 4.281 ± 0.991
5.822LeuGlu: 5.822 ± 0.513
2.911LeuPhe: 2.911 ± 0.722
5.479LeuGly: 5.479 ± 1.367
1.541LeuHis: 1.541 ± 0.661
7.705LeuIle: 7.705 ± 0.716
7.877LeuLys: 7.877 ± 2.475
9.075LeuLeu: 9.075 ± 2.586
2.055LeuMet: 2.055 ± 0.741
4.795LeuAsn: 4.795 ± 0.537
3.425LeuPro: 3.425 ± 0.525
5.137LeuGln: 5.137 ± 1.493
4.452LeuArg: 4.452 ± 1.034
10.445LeuSer: 10.445 ± 2.54
6.849LeuThr: 6.849 ± 0.489
5.822LeuVal: 5.822 ± 0.634
0.685LeuTrp: 0.685 ± 0.63
3.425LeuTyr: 3.425 ± 1.386
0.0LeuXaa: 0.0 ± 0.0
Met
1.199MetAla: 1.199 ± 0.264
0.171MetCys: 0.171 ± 0.095
1.37MetAsp: 1.37 ± 0.707
1.541MetGlu: 1.541 ± 0.346
0.685MetPhe: 0.685 ± 0.382
1.027MetGly: 1.027 ± 0.463
0.685MetHis: 0.685 ± 0.539
1.541MetIle: 1.541 ± 0.683
2.055MetLys: 2.055 ± 0.262
4.623MetLeu: 4.623 ± 1.335
0.685MetMet: 0.685 ± 0.181
1.712MetAsn: 1.712 ± 0.415
0.342MetPro: 0.342 ± 0.171
0.342MetGln: 0.342 ± 0.191
0.685MetArg: 0.685 ± 0.882
1.541MetSer: 1.541 ± 0.329
1.199MetThr: 1.199 ± 0.266
1.199MetVal: 1.199 ± 0.815
0.0MetTrp: 0.0 ± 0.0
0.685MetTyr: 0.685 ± 0.342
0.0MetXaa: 0.0 ± 0.0
Asn
2.74AsnAla: 2.74 ± 1.969
1.37AsnCys: 1.37 ± 0.683
1.884AsnAsp: 1.884 ± 0.787
3.938AsnGlu: 3.938 ± 0.551
3.253AsnPhe: 3.253 ± 1.338
2.226AsnGly: 2.226 ± 0.064
0.685AsnHis: 0.685 ± 0.181
3.938AsnIle: 3.938 ± 0.189
4.452AsnLys: 4.452 ± 0.961
4.452AsnLeu: 4.452 ± 1.207
1.37AsnMet: 1.37 ± 1.055
2.226AsnAsn: 2.226 ± 1.006
1.712AsnPro: 1.712 ± 0.498
1.541AsnGln: 1.541 ± 0.683
2.055AsnArg: 2.055 ± 0.053
3.596AsnSer: 3.596 ± 0.688
1.884AsnThr: 1.884 ± 0.14
4.11AsnVal: 4.11 ± 1.092
1.199AsnTrp: 1.199 ± 0.266
1.541AsnTyr: 1.541 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
2.226ProAla: 2.226 ± 1.529
0.171ProCys: 0.171 ± 0.095
2.568ProAsp: 2.568 ± 0.549
2.226ProGlu: 2.226 ± 0.975
1.712ProPhe: 1.712 ± 0.61
0.856ProGly: 0.856 ± 0.305
0.171ProHis: 0.171 ± 0.234
2.74ProIle: 2.74 ± 0.752
4.623ProLys: 4.623 ± 1.199
1.541ProLeu: 1.541 ± 0.444
0.342ProMet: 0.342 ± 0.441
0.514ProAsn: 0.514 ± 0.286
0.0ProPro: 0.0 ± 0.0
1.37ProGln: 1.37 ± 0.424
0.856ProArg: 0.856 ± 0.304
2.568ProSer: 2.568 ± 0.674
2.74ProThr: 2.74 ± 1.112
2.055ProVal: 2.055 ± 0.366
0.685ProTrp: 0.685 ± 0.362
1.027ProTyr: 1.027 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
2.397GlnAla: 2.397 ± 0.521
1.027GlnCys: 1.027 ± 0.296
1.37GlnAsp: 1.37 ± 0.508
2.397GlnGlu: 2.397 ± 0.751
1.541GlnPhe: 1.541 ± 0.649
1.541GlnGly: 1.541 ± 0.661
0.685GlnHis: 0.685 ± 0.181
1.541GlnIle: 1.541 ± 0.444
3.253GlnLys: 3.253 ± 0.395
5.308GlnLeu: 5.308 ± 1.772
1.884GlnMet: 1.884 ± 0.494
1.541GlnAsn: 1.541 ± 0.985
0.856GlnPro: 0.856 ± 0.249
2.568GlnGln: 2.568 ± 0.391
2.568GlnArg: 2.568 ± 0.668
2.055GlnSer: 2.055 ± 0.536
1.712GlnThr: 1.712 ± 0.853
1.712GlnVal: 1.712 ± 0.851
0.856GlnTrp: 0.856 ± 0.842
1.199GlnTyr: 1.199 ± 0.515
0.0GlnXaa: 0.0 ± 0.0
Arg
2.226ArgAla: 2.226 ± 0.62
1.199ArgCys: 1.199 ± 0.418
2.226ArgAsp: 2.226 ± 0.975
1.884ArgGlu: 1.884 ± 0.673
2.055ArgPhe: 2.055 ± 0.545
1.37ArgGly: 1.37 ± 0.424
1.712ArgHis: 1.712 ± 0.234
2.568ArgIle: 2.568 ± 0.841
3.253ArgLys: 3.253 ± 0.862
4.795ArgLeu: 4.795 ± 1.894
1.541ArgMet: 1.541 ± 0.346
2.568ArgAsn: 2.568 ± 0.248
1.199ArgPro: 1.199 ± 0.472
1.37ArgGln: 1.37 ± 0.82
1.712ArgArg: 1.712 ± 1.145
3.767ArgSer: 3.767 ± 0.662
3.082ArgThr: 3.082 ± 0.533
1.884ArgVal: 1.884 ± 0.565
0.514ArgTrp: 0.514 ± 0.433
1.541ArgTyr: 1.541 ± 0.6
0.0ArgXaa: 0.0 ± 0.0
Ser
5.308SerAla: 5.308 ± 1.452
1.884SerCys: 1.884 ± 0.494
3.938SerAsp: 3.938 ± 1.034
5.479SerGlu: 5.479 ± 1.142
4.11SerPhe: 4.11 ± 1.75
5.651SerGly: 5.651 ± 0.421
1.884SerHis: 1.884 ± 0.181
7.021SerIle: 7.021 ± 1.327
5.822SerLys: 5.822 ± 0.634
6.849SerLeu: 6.849 ± 1.337
1.37SerMet: 1.37 ± 0.508
1.884SerAsn: 1.884 ± 0.181
1.37SerPro: 1.37 ± 0.448
2.397SerGln: 2.397 ± 0.755
3.082SerArg: 3.082 ± 0.692
10.103SerSer: 10.103 ± 1.225
4.452SerThr: 4.452 ± 1.046
4.452SerVal: 4.452 ± 1.207
1.37SerTrp: 1.37 ± 0.964
1.884SerTyr: 1.884 ± 0.577
0.0SerXaa: 0.0 ± 0.0
Thr
2.397ThrAla: 2.397 ± 0.755
2.226ThrCys: 2.226 ± 1.224
3.082ThrAsp: 3.082 ± 0.533
4.795ThrGlu: 4.795 ± 0.537
3.253ThrPhe: 3.253 ± 1.69
4.11ThrGly: 4.11 ± 1.091
1.199ThrHis: 1.199 ± 0.519
4.281ThrIle: 4.281 ± 0.069
4.11ThrLys: 4.11 ± 0.524
5.651ThrLeu: 5.651 ± 0.968
0.856ThrMet: 0.856 ± 0.304
4.11ThrAsn: 4.11 ± 0.554
1.541ThrPro: 1.541 ± 0.124
1.884ThrGln: 1.884 ± 1.076
2.568ThrArg: 2.568 ± 0.534
5.308ThrSer: 5.308 ± 1.056
4.11ThrThr: 4.11 ± 1.091
3.253ThrVal: 3.253 ± 0.227
0.514ThrTrp: 0.514 ± 0.148
2.055ThrTyr: 2.055 ± 0.776
0.0ThrXaa: 0.0 ± 0.0
Val
3.596ValAla: 3.596 ± 1.745
1.37ValCys: 1.37 ± 0.683
2.74ValAsp: 2.74 ± 1.258
5.137ValGlu: 5.137 ± 1.17
2.911ValPhe: 2.911 ± 0.722
2.055ValGly: 2.055 ± 0.679
1.712ValHis: 1.712 ± 0.234
4.795ValIle: 4.795 ± 0.779
4.281ValLys: 4.281 ± 0.924
6.849ValLeu: 6.849 ± 0.809
0.514ValMet: 0.514 ± 0.148
3.082ValAsn: 3.082 ± 0.955
2.568ValPro: 2.568 ± 0.448
2.568ValGln: 2.568 ± 0.942
3.253ValArg: 3.253 ± 0.911
4.452ValSer: 4.452 ± 1.504
3.938ValThr: 3.938 ± 1.443
4.11ValVal: 4.11 ± 0.625
1.027ValTrp: 1.027 ± 0.463
1.712ValTyr: 1.712 ± 0.61
0.0ValXaa: 0.0 ± 0.0
Trp
0.514TrpAla: 0.514 ± 0.148
0.342TrpCys: 0.342 ± 0.171
0.685TrpAsp: 0.685 ± 0.362
0.514TrpGlu: 0.514 ± 0.398
0.342TrpPhe: 0.342 ± 0.513
1.027TrpGly: 1.027 ± 0.355
0.171TrpHis: 0.171 ± 0.234
0.685TrpIle: 0.685 ± 0.342
1.884TrpLys: 1.884 ± 0.451
1.541TrpLeu: 1.541 ± 0.346
0.514TrpMet: 0.514 ± 0.398
1.37TrpAsn: 1.37 ± 0.724
0.514TrpPro: 0.514 ± 0.398
0.342TrpGln: 0.342 ± 0.171
0.856TrpArg: 0.856 ± 0.842
0.514TrpSer: 0.514 ± 0.415
0.856TrpThr: 0.856 ± 0.249
0.342TrpVal: 0.342 ± 0.513
0.514TrpTrp: 0.514 ± 0.433
0.171TrpTyr: 0.171 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.856TyrAla: 0.856 ± 0.249
1.027TyrCys: 1.027 ± 0.268
1.884TyrAsp: 1.884 ± 0.492
1.541TyrGlu: 1.541 ± 0.124
1.884TyrPhe: 1.884 ± 0.81
1.541TyrGly: 1.541 ± 0.425
0.342TyrHis: 0.342 ± 0.171
2.226TyrIle: 2.226 ± 0.605
2.74TyrLys: 2.74 ± 0.752
2.568TyrLeu: 2.568 ± 0.926
0.514TyrMet: 0.514 ± 0.148
2.74TyrAsn: 2.74 ± 1.017
0.514TyrPro: 0.514 ± 0.415
2.226TyrGln: 2.226 ± 0.598
1.199TyrArg: 1.199 ± 0.418
2.74TyrSer: 2.74 ± 0.825
1.37TyrThr: 1.37 ± 0.362
1.884TyrVal: 1.884 ± 0.618
0.685TyrTrp: 0.685 ± 0.41
0.856TyrTyr: 0.856 ± 0.305
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski