Amino acid dipepetide frequency for Merida virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.158AlaAla: 2.158 ± 0.461
0.809AlaCys: 0.809 ± 0.8
3.507AlaAsp: 3.507 ± 1.338
2.967AlaGlu: 2.967 ± 0.775
2.428AlaPhe: 2.428 ± 0.575
3.507AlaGly: 3.507 ± 0.836
1.079AlaHis: 1.079 ± 0.993
4.316AlaIle: 4.316 ± 1.0
2.967AlaLys: 2.967 ± 0.84
6.204AlaLeu: 6.204 ± 1.133
0.809AlaMet: 0.809 ± 0.491
1.079AlaAsn: 1.079 ± 0.655
1.888AlaPro: 1.888 ± 0.478
1.888AlaGln: 1.888 ± 0.624
4.586AlaArg: 4.586 ± 1.055
5.125AlaSer: 5.125 ± 1.15
3.237AlaThr: 3.237 ± 0.734
2.698AlaVal: 2.698 ± 0.456
0.27AlaTrp: 0.27 ± 0.164
2.698AlaTyr: 2.698 ± 1.227
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.387
0.0CysCys: 0.0 ± 0.0
1.349CysAsp: 1.349 ± 0.525
0.27CysGlu: 0.27 ± 0.512
1.349CysPhe: 1.349 ± 0.494
0.54CysGly: 0.54 ± 0.363
0.54CysHis: 0.54 ± 0.363
0.27CysIle: 0.27 ± 0.164
1.349CysLys: 1.349 ± 0.956
2.158CysLeu: 2.158 ± 0.728
0.27CysMet: 0.27 ± 0.449
0.27CysAsn: 0.27 ± 0.164
1.079CysPro: 1.079 ± 0.685
0.27CysGln: 0.27 ± 0.164
0.54CysArg: 0.54 ± 0.898
1.619CysSer: 1.619 ± 1.211
1.619CysThr: 1.619 ± 0.773
0.809CysVal: 0.809 ± 0.34
0.54CysTrp: 0.54 ± 0.328
0.27CysTyr: 0.27 ± 0.449
0.0CysXaa: 0.0 ± 0.0
Asp
1.079AspAla: 1.079 ± 0.553
1.619AspCys: 1.619 ± 0.599
2.158AspAsp: 2.158 ± 0.656
4.856AspGlu: 4.856 ± 1.956
2.158AspPhe: 2.158 ± 1.05
4.046AspGly: 4.046 ± 1.143
1.888AspHis: 1.888 ± 0.61
4.046AspIle: 4.046 ± 0.674
2.967AspLys: 2.967 ± 0.89
8.093AspLeu: 8.093 ± 2.063
2.158AspMet: 2.158 ± 0.811
1.349AspAsn: 1.349 ± 0.819
2.967AspPro: 2.967 ± 0.529
1.888AspGln: 1.888 ± 0.477
2.698AspArg: 2.698 ± 0.601
3.507AspSer: 3.507 ± 0.688
2.698AspThr: 2.698 ± 1.072
1.619AspVal: 1.619 ± 0.906
1.619AspTrp: 1.619 ± 0.773
2.158AspTyr: 2.158 ± 0.962
0.0AspXaa: 0.0 ± 0.0
Glu
4.046GluAla: 4.046 ± 0.515
1.079GluCys: 1.079 ± 0.979
3.777GluAsp: 3.777 ± 1.615
5.395GluGlu: 5.395 ± 2.487
3.237GluPhe: 3.237 ± 1.032
6.474GluGly: 6.474 ± 1.555
1.349GluHis: 1.349 ± 0.684
3.777GluIle: 3.777 ± 0.722
2.698GluLys: 2.698 ± 1.319
4.856GluLeu: 4.856 ± 1.087
0.809GluMet: 0.809 ± 0.355
2.158GluAsn: 2.158 ± 0.783
3.237GluPro: 3.237 ± 0.977
1.619GluGln: 1.619 ± 0.843
2.967GluArg: 2.967 ± 0.894
5.935GluSer: 5.935 ± 2.021
2.158GluThr: 2.158 ± 0.642
3.507GluVal: 3.507 ± 0.795
0.809GluTrp: 0.809 ± 0.8
3.237GluTyr: 3.237 ± 1.27
0.0GluXaa: 0.0 ± 0.0
Phe
0.54PheAla: 0.54 ± 0.328
0.54PheCys: 0.54 ± 0.328
2.428PheAsp: 2.428 ± 1.206
0.809PheGlu: 0.809 ± 0.485
2.428PhePhe: 2.428 ± 1.02
1.619PheGly: 1.619 ± 0.624
1.888PheHis: 1.888 ± 1.146
2.698PheIle: 2.698 ± 1.638
2.428PheLys: 2.428 ± 0.592
4.316PheLeu: 4.316 ± 0.822
1.349PheMet: 1.349 ± 0.698
1.619PheAsn: 1.619 ± 0.68
2.698PhePro: 2.698 ± 0.706
1.888PheGln: 1.888 ± 0.938
3.237PheArg: 3.237 ± 0.516
5.125PheSer: 5.125 ± 1.423
1.888PheThr: 1.888 ± 0.908
2.967PheVal: 2.967 ± 1.349
1.079PheTrp: 1.079 ± 0.328
1.888PheTyr: 1.888 ± 0.775
0.0PheXaa: 0.0 ± 0.0
Gly
2.428GlyAla: 2.428 ± 0.41
0.54GlyCys: 0.54 ± 0.346
4.316GlyAsp: 4.316 ± 1.445
5.125GlyGlu: 5.125 ± 1.583
2.158GlyPhe: 2.158 ± 0.248
5.935GlyGly: 5.935 ± 0.943
1.619GlyHis: 1.619 ± 0.523
3.237GlyIle: 3.237 ± 0.909
3.777GlyLys: 3.777 ± 1.001
8.632GlyLeu: 8.632 ± 1.082
1.349GlyMet: 1.349 ± 0.766
2.428GlyAsn: 2.428 ± 0.566
2.428GlyPro: 2.428 ± 1.531
3.507GlyGln: 3.507 ± 0.746
4.316GlyArg: 4.316 ± 1.411
6.474GlySer: 6.474 ± 1.729
4.586GlyThr: 4.586 ± 1.392
4.046GlyVal: 4.046 ± 1.853
1.079GlyTrp: 1.079 ± 0.693
2.158GlyTyr: 2.158 ± 1.448
0.0GlyXaa: 0.0 ± 0.0
His
0.54HisAla: 0.54 ± 0.363
0.27HisCys: 0.27 ± 0.379
0.809HisAsp: 0.809 ± 0.485
0.809HisGlu: 0.809 ± 0.355
0.809HisPhe: 0.809 ± 0.491
1.619HisGly: 1.619 ± 0.599
0.54HisHis: 0.54 ± 0.363
1.619HisIle: 1.619 ± 0.624
0.54HisLys: 0.54 ± 0.543
3.507HisLeu: 3.507 ± 0.838
1.349HisMet: 1.349 ± 1.116
0.54HisAsn: 0.54 ± 0.328
1.619HisPro: 1.619 ± 0.697
0.809HisGln: 0.809 ± 0.491
1.619HisArg: 1.619 ± 0.748
1.349HisSer: 1.349 ± 0.417
0.809HisThr: 0.809 ± 0.453
1.079HisVal: 1.079 ± 0.553
0.27HisTrp: 0.27 ± 0.449
0.809HisTyr: 0.809 ± 0.355
0.0HisXaa: 0.0 ± 0.0
Ile
2.158IleAla: 2.158 ± 0.698
0.809IleCys: 0.809 ± 0.418
2.967IleAsp: 2.967 ± 0.782
2.967IleGlu: 2.967 ± 0.677
2.698IlePhe: 2.698 ± 0.31
4.046IleGly: 4.046 ± 0.921
0.809IleHis: 0.809 ± 0.418
2.428IleIle: 2.428 ± 1.07
4.586IleLys: 4.586 ± 0.818
6.204IleLeu: 6.204 ± 1.467
0.809IleMet: 0.809 ± 0.453
1.349IleAsn: 1.349 ± 0.307
2.967IlePro: 2.967 ± 1.741
3.237IleGln: 3.237 ± 0.366
4.856IleArg: 4.856 ± 1.238
6.474IleSer: 6.474 ± 1.435
2.967IleThr: 2.967 ± 0.753
3.507IleVal: 3.507 ± 0.867
0.54IleTrp: 0.54 ± 0.346
1.349IleTyr: 1.349 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
4.856LysAla: 4.856 ± 1.52
1.079LysCys: 1.079 ± 0.52
2.428LysAsp: 2.428 ± 0.539
3.237LysGlu: 3.237 ± 1.555
1.349LysPhe: 1.349 ± 0.684
3.507LysGly: 3.507 ± 1.798
0.0LysHis: 0.0 ± 0.0
4.046LysIle: 4.046 ± 1.15
3.237LysLys: 3.237 ± 1.509
4.046LysLeu: 4.046 ± 0.726
1.349LysMet: 1.349 ± 0.65
2.158LysAsn: 2.158 ± 0.775
2.698LysPro: 2.698 ± 1.241
1.888LysGln: 1.888 ± 0.68
3.777LysArg: 3.777 ± 1.885
4.856LysSer: 4.856 ± 1.264
4.046LysThr: 4.046 ± 0.722
3.507LysVal: 3.507 ± 0.778
1.619LysTrp: 1.619 ± 0.983
1.888LysTyr: 1.888 ± 1.113
0.0LysXaa: 0.0 ± 0.0
Leu
7.014LeuAla: 7.014 ± 1.267
1.349LeuCys: 1.349 ± 0.494
6.474LeuAsp: 6.474 ± 1.892
7.284LeuGlu: 7.284 ± 1.35
3.777LeuPhe: 3.777 ± 1.147
7.553LeuGly: 7.553 ± 1.472
1.349LeuHis: 1.349 ± 0.539
7.553LeuIle: 7.553 ± 1.884
7.284LeuLys: 7.284 ± 2.503
7.284LeuLeu: 7.284 ± 1.148
3.237LeuMet: 3.237 ± 1.314
2.158LeuAsn: 2.158 ± 0.971
4.046LeuPro: 4.046 ± 0.994
2.698LeuGln: 2.698 ± 1.086
8.093LeuArg: 8.093 ± 1.709
8.363LeuSer: 8.363 ± 0.784
6.474LeuThr: 6.474 ± 1.984
4.586LeuVal: 4.586 ± 1.009
1.349LeuTrp: 1.349 ± 0.539
2.698LeuTyr: 2.698 ± 0.85
0.0LeuXaa: 0.0 ± 0.0
Met
1.079MetAla: 1.079 ± 0.693
0.27MetCys: 0.27 ± 0.403
1.619MetAsp: 1.619 ± 0.523
1.079MetGlu: 1.079 ± 0.55
1.349MetPhe: 1.349 ± 0.307
2.428MetGly: 2.428 ± 1.085
0.54MetHis: 0.54 ± 0.328
0.809MetIle: 0.809 ± 0.555
1.888MetLys: 1.888 ± 0.68
1.349MetLeu: 1.349 ± 1.39
0.54MetMet: 0.54 ± 0.363
0.27MetAsn: 0.27 ± 0.164
0.27MetPro: 0.27 ± 0.164
0.27MetGln: 0.27 ± 0.164
0.27MetArg: 0.27 ± 0.164
2.967MetSer: 2.967 ± 1.483
1.619MetThr: 1.619 ± 0.489
1.888MetVal: 1.888 ± 0.61
1.079MetTrp: 1.079 ± 1.082
1.349MetTyr: 1.349 ± 0.536
0.0MetXaa: 0.0 ± 0.0
Asn
1.079AsnAla: 1.079 ± 0.693
0.809AsnCys: 0.809 ± 0.491
0.809AsnAsp: 0.809 ± 0.34
2.158AsnGlu: 2.158 ± 0.595
1.888AsnPhe: 1.888 ± 0.693
1.888AsnGly: 1.888 ± 0.761
1.619AsnHis: 1.619 ± 0.367
2.158AsnIle: 2.158 ± 0.248
1.619AsnLys: 1.619 ± 0.748
4.046AsnLeu: 4.046 ± 1.654
0.27AsnMet: 0.27 ± 0.449
1.619AsnAsn: 1.619 ± 0.748
2.698AsnPro: 2.698 ± 0.77
1.349AsnGln: 1.349 ± 0.417
1.349AsnArg: 1.349 ± 0.684
2.158AsnSer: 2.158 ± 0.824
1.619AsnThr: 1.619 ± 0.599
1.619AsnVal: 1.619 ± 0.748
1.079AsnTrp: 1.079 ± 0.391
0.27AsnTyr: 0.27 ± 0.164
0.0AsnXaa: 0.0 ± 0.0
Pro
3.237ProAla: 3.237 ± 1.248
0.809ProCys: 0.809 ± 0.34
4.316ProAsp: 4.316 ± 0.705
3.237ProGlu: 3.237 ± 1.809
2.428ProPhe: 2.428 ± 0.888
2.698ProGly: 2.698 ± 0.658
2.158ProHis: 2.158 ± 0.493
2.158ProIle: 2.158 ± 0.625
1.619ProLys: 1.619 ± 0.656
4.316ProLeu: 4.316 ± 1.075
0.54ProMet: 0.54 ± 0.328
1.888ProAsn: 1.888 ± 0.478
1.619ProPro: 1.619 ± 0.906
1.079ProGln: 1.079 ± 0.685
2.428ProArg: 2.428 ± 0.88
4.856ProSer: 4.856 ± 2.235
3.777ProThr: 3.777 ± 1.452
3.507ProVal: 3.507 ± 2.165
1.349ProTrp: 1.349 ± 0.395
2.698ProTyr: 2.698 ± 1.681
0.0ProXaa: 0.0 ± 0.0
Gln
2.158GlnAla: 2.158 ± 1.032
0.0GlnCys: 0.0 ± 0.0
1.079GlnAsp: 1.079 ± 0.547
2.967GlnGlu: 2.967 ± 1.363
1.079GlnPhe: 1.079 ± 0.655
2.967GlnGly: 2.967 ± 1.078
0.54GlnHis: 0.54 ± 0.363
1.888GlnIle: 1.888 ± 1.056
2.158GlnLys: 2.158 ± 0.595
2.428GlnLeu: 2.428 ± 0.772
1.079GlnMet: 1.079 ± 0.403
0.809GlnAsn: 0.809 ± 0.491
1.079GlnPro: 1.079 ± 0.391
1.349GlnGln: 1.349 ± 0.417
1.888GlnArg: 1.888 ± 0.199
4.316GlnSer: 4.316 ± 1.29
2.428GlnThr: 2.428 ± 1.785
1.888GlnVal: 1.888 ± 0.91
0.27GlnTrp: 0.27 ± 0.164
1.079GlnTyr: 1.079 ± 0.693
0.0GlnXaa: 0.0 ± 0.0
Arg
4.316ArgAla: 4.316 ± 1.335
0.27ArgCys: 0.27 ± 0.164
3.507ArgAsp: 3.507 ± 1.043
3.237ArgGlu: 3.237 ± 0.516
2.967ArgPhe: 2.967 ± 0.909
4.316ArgGly: 4.316 ± 1.388
0.54ArgHis: 0.54 ± 0.328
2.967ArgIle: 2.967 ± 0.654
2.158ArgLys: 2.158 ± 0.248
5.125ArgLeu: 5.125 ± 1.497
1.888ArgMet: 1.888 ± 0.673
1.349ArgAsn: 1.349 ± 0.623
3.237ArgPro: 3.237 ± 0.909
2.428ArgGln: 2.428 ± 1.085
3.777ArgArg: 3.777 ± 1.545
7.553ArgSer: 7.553 ± 1.241
3.777ArgThr: 3.777 ± 1.198
2.428ArgVal: 2.428 ± 0.869
1.349ArgTrp: 1.349 ± 0.307
2.158ArgTyr: 2.158 ± 0.883
0.0ArgXaa: 0.0 ± 0.0
Ser
6.474SerAla: 6.474 ± 0.972
1.349SerCys: 1.349 ± 0.678
4.316SerAsp: 4.316 ± 0.926
5.665SerGlu: 5.665 ± 1.518
4.316SerPhe: 4.316 ± 1.229
6.744SerGly: 6.744 ± 0.83
2.158SerHis: 2.158 ± 0.987
5.125SerIle: 5.125 ± 1.574
2.698SerLys: 2.698 ± 0.901
11.869SerLeu: 11.869 ± 3.286
1.888SerMet: 1.888 ± 1.469
3.777SerAsn: 3.777 ± 0.397
5.395SerPro: 5.395 ± 2.198
2.967SerGln: 2.967 ± 1.585
4.586SerArg: 4.586 ± 0.758
10.251SerSer: 10.251 ± 3.239
4.856SerThr: 4.856 ± 1.579
8.093SerVal: 8.093 ± 2.096
1.888SerTrp: 1.888 ± 0.477
2.967SerTyr: 2.967 ± 1.552
0.0SerXaa: 0.0 ± 0.0
Thr
3.507ThrAla: 3.507 ± 1.206
1.079ThrCys: 1.079 ± 1.246
2.967ThrAsp: 2.967 ± 0.932
2.967ThrGlu: 2.967 ± 0.901
2.428ThrPhe: 2.428 ± 0.654
4.586ThrGly: 4.586 ± 0.872
1.888ThrHis: 1.888 ± 0.199
3.507ThrIle: 3.507 ± 0.688
3.777ThrLys: 3.777 ± 0.704
4.316ThrLeu: 4.316 ± 0.786
1.619ThrMet: 1.619 ± 0.621
2.428ThrAsn: 2.428 ± 0.468
4.316ThrPro: 4.316 ± 2.836
0.809ThrGln: 0.809 ± 0.418
2.698ThrArg: 2.698 ± 0.766
5.125ThrSer: 5.125 ± 1.233
3.237ThrThr: 3.237 ± 0.969
4.586ThrVal: 4.586 ± 1.198
1.349ThrTrp: 1.349 ± 0.623
0.54ThrTyr: 0.54 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
4.586ValAla: 4.586 ± 0.977
1.349ValCys: 1.349 ± 0.307
4.856ValAsp: 4.856 ± 0.555
4.856ValGlu: 4.856 ± 0.821
2.698ValPhe: 2.698 ± 0.516
2.698ValGly: 2.698 ± 0.77
0.0ValHis: 0.0 ± 0.0
2.698ValIle: 2.698 ± 0.777
3.507ValLys: 3.507 ± 0.431
5.395ValLeu: 5.395 ± 0.291
1.079ValMet: 1.079 ± 0.943
2.158ValAsn: 2.158 ± 0.769
4.586ValPro: 4.586 ± 1.342
1.888ValGln: 1.888 ± 0.882
2.428ValArg: 2.428 ± 0.838
4.586ValSer: 4.586 ± 1.407
2.428ValThr: 2.428 ± 0.922
2.698ValVal: 2.698 ± 0.77
0.54ValTrp: 0.54 ± 0.543
1.888ValTyr: 1.888 ± 0.478
0.0ValXaa: 0.0 ± 0.0
Trp
0.809TrpAla: 0.809 ± 0.708
0.54TrpCys: 0.54 ± 0.346
0.27TrpAsp: 0.27 ± 0.379
2.158TrpGlu: 2.158 ± 0.656
1.349TrpPhe: 1.349 ± 0.684
1.619TrpGly: 1.619 ± 0.669
0.809TrpHis: 0.809 ± 0.387
1.619TrpIle: 1.619 ± 0.569
1.888TrpLys: 1.888 ± 0.478
1.888TrpLeu: 1.888 ± 0.478
0.0TrpMet: 0.0 ± 0.0
0.54TrpAsn: 0.54 ± 0.328
0.809TrpPro: 0.809 ± 0.508
0.54TrpGln: 0.54 ± 0.346
1.349TrpArg: 1.349 ± 0.882
1.888TrpSer: 1.888 ± 0.478
0.809TrpThr: 0.809 ± 0.418
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.349TyrAla: 1.349 ± 0.494
1.349TyrCys: 1.349 ± 0.368
1.619TyrAsp: 1.619 ± 1.03
1.079TyrGlu: 1.079 ± 0.652
0.54TyrPhe: 0.54 ± 0.328
1.349TyrGly: 1.349 ± 0.698
0.0TyrHis: 0.0 ± 0.0
0.809TyrIle: 0.809 ± 0.782
2.428TyrLys: 2.428 ± 0.942
4.856TyrLeu: 4.856 ± 1.811
0.27TyrMet: 0.27 ± 0.379
2.158TyrAsn: 2.158 ± 0.493
1.079TyrPro: 1.079 ± 0.391
1.079TyrGln: 1.079 ± 0.655
1.888TyrArg: 1.888 ± 0.478
4.856TyrSer: 4.856 ± 1.158
2.698TyrThr: 2.698 ± 0.841
1.888TyrVal: 1.888 ± 0.598
0.809TyrTrp: 0.809 ± 0.418
0.809TyrTyr: 0.809 ± 0.491
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3708 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski