Amino acid dipepetide frequency for Saesbyeol virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.942AlaAla: 3.942 ± 1.652
0.927AlaCys: 0.927 ± 0.375
1.623AlaAsp: 1.623 ± 0.228
2.319AlaGlu: 2.319 ± 0.368
1.159AlaPhe: 1.159 ± 0.3
4.637AlaGly: 4.637 ± 1.048
1.855AlaHis: 1.855 ± 0.332
3.014AlaIle: 3.014 ± 0.562
4.173AlaLys: 4.173 ± 1.12
3.942AlaLeu: 3.942 ± 0.542
1.623AlaMet: 1.623 ± 0.412
3.014AlaAsn: 3.014 ± 0.912
1.391AlaPro: 1.391 ± 0.8
1.391AlaGln: 1.391 ± 0.627
3.478AlaArg: 3.478 ± 1.027
4.869AlaSer: 4.869 ± 1.11
3.246AlaThr: 3.246 ± 0.895
2.782AlaVal: 2.782 ± 1.275
0.232AlaTrp: 0.232 ± 0.193
2.087AlaTyr: 2.087 ± 0.54
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.464CysCys: 0.464 ± 0.071
0.927CysAsp: 0.927 ± 0.375
1.855CysGlu: 1.855 ± 0.283
0.927CysPhe: 0.927 ± 0.771
0.696CysGly: 0.696 ± 0.157
0.232CysHis: 0.232 ± 0.193
1.855CysIle: 1.855 ± 0.332
2.782CysLys: 2.782 ± 0.662
1.855CysLeu: 1.855 ± 0.541
1.623CysMet: 1.623 ± 0.504
1.159CysAsn: 1.159 ± 0.3
1.391CysPro: 1.391 ± 0.662
0.464CysGln: 0.464 ± 0.374
0.927CysArg: 0.927 ± 0.142
2.782CysSer: 2.782 ± 1.632
2.319CysThr: 2.319 ± 0.915
0.927CysVal: 0.927 ± 0.375
0.0CysTrp: 0.0 ± 0.0
0.696CysTyr: 0.696 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
3.71AspAla: 3.71 ± 2.346
1.391AspCys: 1.391 ± 0.662
3.942AspAsp: 3.942 ± 0.587
1.855AspGlu: 1.855 ± 0.332
2.087AspPhe: 2.087 ± 0.235
5.101AspGly: 5.101 ± 1.07
0.927AspHis: 0.927 ± 0.375
6.956AspIle: 6.956 ± 2.861
1.623AspLys: 1.623 ± 0.299
4.405AspLeu: 4.405 ± 1.504
0.927AspMet: 0.927 ± 0.142
3.478AspAsn: 3.478 ± 0.442
2.319AspPro: 2.319 ± 0.364
2.55AspGln: 2.55 ± 0.833
2.55AspArg: 2.55 ± 0.177
4.869AspSer: 4.869 ± 0.838
3.942AspThr: 3.942 ± 1.19
2.087AspVal: 2.087 ± 0.284
0.464AspTrp: 0.464 ± 0.071
1.855AspTyr: 1.855 ± 0.618
0.0AspXaa: 0.0 ± 0.0
Glu
2.319GluAla: 2.319 ± 0.368
1.391GluCys: 1.391 ± 0.212
4.637GluAsp: 4.637 ± 0.581
3.71GluGlu: 3.71 ± 0.612
2.319GluPhe: 2.319 ± 0.726
3.246GluGly: 3.246 ± 0.638
1.391GluHis: 1.391 ± 0.484
4.637GluIle: 4.637 ± 0.627
3.014GluLys: 3.014 ± 0.774
5.796GluLeu: 5.796 ± 1.498
2.087GluMet: 2.087 ± 0.284
3.014GluAsn: 3.014 ± 0.84
2.087GluPro: 2.087 ± 0.198
1.855GluGln: 1.855 ± 0.332
2.087GluArg: 2.087 ± 0.902
3.942GluSer: 3.942 ± 1.394
4.869GluThr: 4.869 ± 0.684
3.246GluVal: 3.246 ± 0.642
0.232GluTrp: 0.232 ± 0.193
3.246GluTyr: 3.246 ± 0.926
0.0GluXaa: 0.0 ± 0.0
Phe
3.246PheAla: 3.246 ± 1.1
0.464PheCys: 0.464 ± 0.071
2.087PheAsp: 2.087 ± 0.284
3.014PheGlu: 3.014 ± 0.41
1.159PhePhe: 1.159 ± 0.624
2.319PheGly: 2.319 ± 0.354
1.159PheHis: 1.159 ± 0.502
3.246PheIle: 3.246 ± 0.291
2.55PheLys: 2.55 ± 0.346
1.391PheLeu: 1.391 ± 0.354
1.391PheMet: 1.391 ± 0.212
3.942PheAsn: 3.942 ± 1.312
0.927PhePro: 0.927 ± 0.432
0.464PheGln: 0.464 ± 0.374
1.623PheArg: 1.623 ± 0.463
3.014PheSer: 3.014 ± 0.132
2.55PheThr: 2.55 ± 0.598
0.696PheVal: 0.696 ± 0.242
0.464PheTrp: 0.464 ± 0.321
2.087PheTyr: 2.087 ± 1.362
0.0PheXaa: 0.0 ± 0.0
Gly
3.71GlyAla: 3.71 ± 0.856
1.391GlyCys: 1.391 ± 0.212
2.087GlyAsp: 2.087 ± 0.969
2.55GlyGlu: 2.55 ± 0.908
2.087GlyPhe: 2.087 ± 0.856
2.782GlyGly: 2.782 ± 0.043
0.927GlyHis: 0.927 ± 0.142
2.782GlyIle: 2.782 ± 0.043
3.014GlyLys: 3.014 ± 0.84
6.492GlyLeu: 6.492 ± 0.357
1.623GlyMet: 1.623 ± 0.622
2.782GlyAsn: 2.782 ± 1.254
2.087GlyPro: 2.087 ± 0.515
1.623GlyGln: 1.623 ± 0.225
2.087GlyArg: 2.087 ± 0.902
5.565GlySer: 5.565 ± 0.98
3.942GlyThr: 3.942 ± 0.7
4.173GlyVal: 4.173 ± 1.178
1.391GlyTrp: 1.391 ± 0.313
3.014GlyTyr: 3.014 ± 0.84
0.0GlyXaa: 0.0 ± 0.0
His
0.927HisAla: 0.927 ± 0.309
0.927HisCys: 0.927 ± 0.432
1.159HisAsp: 1.159 ± 0.467
1.623HisGlu: 1.623 ± 0.609
0.696HisPhe: 0.696 ± 0.579
0.927HisGly: 0.927 ± 0.375
0.464HisHis: 0.464 ± 0.321
2.782HisIle: 2.782 ± 0.662
0.927HisLys: 0.927 ± 0.432
0.927HisLeu: 0.927 ± 0.142
0.464HisMet: 0.464 ± 0.071
0.232HisAsn: 0.232 ± 0.161
0.464HisPro: 0.464 ± 0.071
0.927HisGln: 0.927 ± 0.304
1.391HisArg: 1.391 ± 0.234
1.623HisSer: 1.623 ± 0.673
1.159HisThr: 1.159 ± 0.3
0.464HisVal: 0.464 ± 0.071
0.232HisTrp: 0.232 ± 0.161
1.159HisTyr: 1.159 ± 0.467
0.0HisXaa: 0.0 ± 0.0
Ile
4.869IleAla: 4.869 ± 0.544
1.159IleCys: 1.159 ± 0.624
5.796IleAsp: 5.796 ± 0.591
3.478IleGlu: 3.478 ± 0.547
3.478IlePhe: 3.478 ± 0.552
5.101IleGly: 5.101 ± 0.389
0.927IleHis: 0.927 ± 0.309
6.028IleIle: 6.028 ± 0.492
7.419IleLys: 7.419 ± 0.835
5.101IleLeu: 5.101 ± 0.899
3.014IleMet: 3.014 ± 0.774
4.869IleAsn: 4.869 ± 1.096
3.71IlePro: 3.71 ± 1.138
2.319IleGln: 2.319 ± 0.709
2.55IleArg: 2.55 ± 0.346
6.028IleSer: 6.028 ± 1.128
6.26IleThr: 6.26 ± 0.448
4.637IleVal: 4.637 ± 0.194
0.464IleTrp: 0.464 ± 0.321
2.319IleTyr: 2.319 ± 0.364
0.0IleXaa: 0.0 ± 0.0
Lys
2.55LysAla: 2.55 ± 0.854
2.087LysCys: 2.087 ± 0.47
4.405LysAsp: 4.405 ± 0.596
3.478LysGlu: 3.478 ± 0.116
2.782LysPhe: 2.782 ± 0.671
2.782LysGly: 2.782 ± 0.626
0.927LysHis: 0.927 ± 0.432
7.651LysIle: 7.651 ± 1.261
6.028LysLys: 6.028 ± 0.738
6.26LysLeu: 6.26 ± 0.422
2.087LysMet: 2.087 ± 0.726
2.55LysAsn: 2.55 ± 0.195
2.319LysPro: 2.319 ± 0.538
2.087LysGln: 2.087 ± 0.284
3.014LysArg: 3.014 ± 0.216
5.796LysSer: 5.796 ± 1.379
5.101LysThr: 5.101 ± 0.576
4.637LysVal: 4.637 ± 1.508
0.464LysTrp: 0.464 ± 0.321
3.246LysTyr: 3.246 ± 0.725
0.0LysXaa: 0.0 ± 0.0
Leu
4.637LeuAla: 4.637 ± 0.41
0.927LeuCys: 0.927 ± 0.142
4.869LeuAsp: 4.869 ± 1.918
6.26LeuGlu: 6.26 ± 1.7
3.246LeuPhe: 3.246 ± 0.449
2.782LeuGly: 2.782 ± 0.928
1.623LeuHis: 1.623 ± 0.228
3.942LeuIle: 3.942 ± 0.542
6.724LeuLys: 6.724 ± 0.385
7.188LeuLeu: 7.188 ± 1.711
3.246LeuMet: 3.246 ± 0.398
3.71LeuAsn: 3.71 ± 1.167
2.087LeuPro: 2.087 ± 0.428
0.464LeuGln: 0.464 ± 0.071
4.869LeuArg: 4.869 ± 0.24
7.651LeuSer: 7.651 ± 1.55
5.101LeuThr: 5.101 ± 0.768
4.173LeuVal: 4.173 ± 0.834
0.927LeuTrp: 0.927 ± 0.771
3.71LeuTyr: 3.71 ± 0.79
0.0LeuXaa: 0.0 ± 0.0
Met
2.087MetAla: 2.087 ± 0.47
0.927MetCys: 0.927 ± 0.304
2.087MetAsp: 2.087 ± 0.653
2.087MetGlu: 2.087 ± 0.428
0.464MetPhe: 0.464 ± 0.071
1.391MetGly: 1.391 ± 0.212
0.927MetHis: 0.927 ± 0.142
2.087MetIle: 2.087 ± 0.47
2.319MetLys: 2.319 ± 0.6
3.71MetLeu: 3.71 ± 0.533
0.696MetMet: 0.696 ± 0.242
1.159MetAsn: 1.159 ± 0.3
1.391MetPro: 1.391 ± 0.313
0.927MetGln: 0.927 ± 0.309
1.623MetArg: 1.623 ± 0.299
2.782MetSer: 2.782 ± 0.708
1.855MetThr: 1.855 ± 0.332
1.855MetVal: 1.855 ± 1.017
0.464MetTrp: 0.464 ± 0.386
1.623MetTyr: 1.623 ± 0.363
0.0MetXaa: 0.0 ± 0.0
Asn
2.55AsnAla: 2.55 ± 0.195
1.391AsnCys: 1.391 ± 0.816
3.478AsnAsp: 3.478 ± 1.492
3.246AsnGlu: 3.246 ± 0.926
2.319AsnPhe: 2.319 ± 0.097
2.319AsnGly: 2.319 ± 0.915
0.927AsnHis: 0.927 ± 0.142
6.956AsnIle: 6.956 ± 0.192
5.101AsnLys: 5.101 ± 0.989
2.782AsnLeu: 2.782 ± 0.324
3.246AsnMet: 3.246 ± 0.065
3.478AsnAsn: 3.478 ± 0.804
1.623AsnPro: 1.623 ± 0.228
0.927AsnGln: 0.927 ± 0.142
2.782AsnArg: 2.782 ± 0.626
5.333AsnSer: 5.333 ± 0.832
3.478AsnThr: 3.478 ± 0.88
2.782AsnVal: 2.782 ± 0.928
0.464AsnTrp: 0.464 ± 0.071
1.623AsnTyr: 1.623 ± 0.609
0.0AsnXaa: 0.0 ± 0.0
Pro
1.623ProAla: 1.623 ± 1.008
0.464ProCys: 0.464 ± 0.321
1.391ProAsp: 1.391 ± 0.354
2.782ProGlu: 2.782 ± 0.912
2.087ProPhe: 2.087 ± 0.54
1.855ProGly: 1.855 ± 0.752
0.696ProHis: 0.696 ± 0.482
2.319ProIle: 2.319 ± 0.354
2.782ProLys: 2.782 ± 0.406
3.246ProLeu: 3.246 ± 1.007
0.464ProMet: 0.464 ± 0.386
2.087ProAsn: 2.087 ± 1.055
1.159ProPro: 1.159 ± 0.182
0.696ProGln: 0.696 ± 0.242
1.391ProArg: 1.391 ± 0.212
4.869ProSer: 4.869 ± 0.544
1.855ProThr: 1.855 ± 0.675
1.855ProVal: 1.855 ± 0.165
0.232ProTrp: 0.232 ± 0.193
1.391ProTyr: 1.391 ± 0.816
0.0ProXaa: 0.0 ± 0.0
Gln
1.159GlnAla: 1.159 ± 0.268
0.464GlnCys: 0.464 ± 0.071
1.855GlnAsp: 1.855 ± 0.347
1.855GlnGlu: 1.855 ± 0.541
1.623GlnPhe: 1.623 ± 0.604
2.319GlnGly: 2.319 ± 0.726
0.0GlnHis: 0.0 ± 0.0
1.855GlnIle: 1.855 ± 0.752
1.623GlnLys: 1.623 ± 0.945
1.855GlnLeu: 1.855 ± 0.618
0.232GlnMet: 0.232 ± 0.298
1.855GlnAsn: 1.855 ± 0.618
0.927GlnPro: 0.927 ± 0.304
0.696GlnGln: 0.696 ± 0.157
0.464GlnArg: 0.464 ± 0.071
2.782GlnSer: 2.782 ± 0.928
2.087GlnThr: 2.087 ± 0.235
0.927GlnVal: 0.927 ± 0.142
0.0GlnTrp: 0.0 ± 0.0
1.623GlnTyr: 1.623 ± 0.463
0.0GlnXaa: 0.0 ± 0.0
Arg
2.319ArgAla: 2.319 ± 0.538
1.623ArgCys: 1.623 ± 1.008
1.855ArgAsp: 1.855 ± 1.286
3.71ArgGlu: 3.71 ± 0.93
1.623ArgPhe: 1.623 ± 0.463
1.855ArgGly: 1.855 ± 0.165
1.391ArgHis: 1.391 ± 0.627
2.55ArgIle: 2.55 ± 0.772
2.782ArgLys: 2.782 ± 0.664
2.782ArgLeu: 2.782 ± 0.671
2.087ArgMet: 2.087 ± 0.198
3.478ArgAsn: 3.478 ± 1.027
1.623ArgPro: 1.623 ± 0.225
1.159ArgGln: 1.159 ± 0.268
3.246ArgArg: 3.246 ± 0.291
3.478ArgSer: 3.478 ± 1.081
4.173ArgThr: 4.173 ± 0.694
2.782ArgVal: 2.782 ± 0.043
0.464ArgTrp: 0.464 ± 0.386
2.55ArgTyr: 2.55 ± 0.582
0.0ArgXaa: 0.0 ± 0.0
Ser
3.246SerAla: 3.246 ± 0.449
1.855SerCys: 1.855 ± 0.332
6.028SerAsp: 6.028 ± 0.084
5.796SerGlu: 5.796 ± 1.846
3.246SerPhe: 3.246 ± 1.217
5.101SerGly: 5.101 ± 1.564
1.391SerHis: 1.391 ± 0.678
6.956SerIle: 6.956 ± 0.512
6.724SerLys: 6.724 ± 1.118
7.883SerLeu: 7.883 ± 1.101
2.55SerMet: 2.55 ± 0.772
5.101SerAsn: 5.101 ± 1.242
3.014SerPro: 3.014 ± 0.854
2.319SerGln: 2.319 ± 1.01
4.173SerArg: 4.173 ± 1.461
9.274SerSer: 9.274 ± 0.823
6.956SerThr: 6.956 ± 0.192
3.71SerVal: 3.71 ± 0.329
1.159SerTrp: 1.159 ± 0.624
3.014SerTyr: 3.014 ± 0.132
0.0SerXaa: 0.0 ± 0.0
Thr
3.014ThrAla: 3.014 ± 1.237
2.087ThrCys: 2.087 ± 1.201
4.637ThrAsp: 4.637 ± 0.549
4.637ThrGlu: 4.637 ± 0.708
2.087ThrPhe: 2.087 ± 0.47
4.637ThrGly: 4.637 ± 0.709
0.927ThrHis: 0.927 ± 0.432
5.565ThrIle: 5.565 ± 1.445
4.173ThrLys: 4.173 ± 1.369
4.637ThrLeu: 4.637 ± 1.869
2.55ThrMet: 2.55 ± 0.882
3.014ThrAsn: 3.014 ± 0.48
3.014ThrPro: 3.014 ± 0.564
2.55ThrGln: 2.55 ± 0.486
3.478ThrArg: 3.478 ± 0.797
5.796ThrSer: 5.796 ± 0.455
5.101ThrThr: 5.101 ± 1.353
7.188ThrVal: 7.188 ± 1.241
0.464ThrTrp: 0.464 ± 0.071
1.391ThrTyr: 1.391 ± 0.484
0.0ThrXaa: 0.0 ± 0.0
Val
3.246ValAla: 3.246 ± 0.496
1.391ValCys: 1.391 ± 1.157
3.246ValAsp: 3.246 ± 0.407
2.782ValGlu: 2.782 ± 0.425
2.087ValPhe: 2.087 ± 0.428
3.246ValGly: 3.246 ± 1.763
1.159ValHis: 1.159 ± 0.3
3.71ValIle: 3.71 ± 1.016
4.173ValLys: 4.173 ± 0.471
3.246ValLeu: 3.246 ± 0.456
1.391ValMet: 1.391 ± 0.475
4.405ValAsn: 4.405 ± 1.027
2.319ValPro: 2.319 ± 0.6
1.623ValGln: 1.623 ± 0.225
3.246ValArg: 3.246 ± 0.926
4.637ValSer: 4.637 ± 1.016
3.014ValThr: 3.014 ± 0.512
4.637ValVal: 4.637 ± 1.048
0.464ValTrp: 0.464 ± 0.321
2.55ValTyr: 2.55 ± 0.195
0.0ValXaa: 0.0 ± 0.0
Trp
0.927TrpAla: 0.927 ± 0.309
0.232TrpCys: 0.232 ± 0.193
0.232TrpAsp: 0.232 ± 0.193
0.464TrpGlu: 0.464 ± 0.321
0.464TrpPhe: 0.464 ± 0.071
0.696TrpGly: 0.696 ± 0.242
0.0TrpHis: 0.0 ± 0.0
1.391TrpIle: 1.391 ± 0.313
0.0TrpLys: 0.0 ± 0.0
0.464TrpLeu: 0.464 ± 0.071
0.0TrpMet: 0.0 ± 0.0
0.464TrpAsn: 0.464 ± 0.071
0.696TrpPro: 0.696 ± 0.157
0.232TrpGln: 0.232 ± 0.193
0.464TrpArg: 0.464 ± 0.386
0.232TrpSer: 0.232 ± 0.193
0.696TrpThr: 0.696 ± 0.242
0.464TrpVal: 0.464 ± 0.071
0.232TrpTrp: 0.232 ± 0.193
0.696TrpTyr: 0.696 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.623TyrAla: 1.623 ± 0.299
2.319TyrCys: 2.319 ± 0.354
0.696TyrAsp: 0.696 ± 0.157
1.623TyrGlu: 1.623 ± 0.363
1.855TyrPhe: 1.855 ± 0.947
2.319TyrGly: 2.319 ± 0.709
1.623TyrHis: 1.623 ± 0.363
3.246TyrIle: 3.246 ± 0.642
2.319TyrLys: 2.319 ± 0.538
4.173TyrLeu: 4.173 ± 0.568
0.927TyrMet: 0.927 ± 0.304
3.014TyrAsn: 3.014 ± 0.849
0.696TyrPro: 0.696 ± 0.242
1.159TyrGln: 1.159 ± 0.3
2.087TyrArg: 2.087 ± 0.198
4.173TyrSer: 4.173 ± 0.261
3.246TyrThr: 3.246 ± 1.346
2.55TyrVal: 2.55 ± 0.486
0.232TyrTrp: 0.232 ± 0.161
2.782TyrTyr: 2.782 ± 0.375
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4314 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski