Amino acid dipepetide frequency for Merida-like virus KE-2017a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.698AlaAla: 2.698 ± 1.143
0.809AlaCys: 0.809 ± 0.676
4.046AlaAsp: 4.046 ± 1.016
3.237AlaGlu: 3.237 ± 0.782
2.158AlaPhe: 2.158 ± 0.584
2.967AlaGly: 2.967 ± 1.163
0.54AlaHis: 0.54 ± 0.556
4.316AlaIle: 4.316 ± 1.257
3.237AlaLys: 3.237 ± 1.116
5.935AlaLeu: 5.935 ± 1.087
0.27AlaMet: 0.27 ± 0.157
1.079AlaAsn: 1.079 ± 0.355
2.428AlaPro: 2.428 ± 0.339
1.619AlaGln: 1.619 ± 0.312
4.586AlaArg: 4.586 ± 0.884
4.586AlaSer: 4.586 ± 1.033
3.777AlaThr: 3.777 ± 0.941
3.777AlaVal: 3.777 ± 1.046
0.27AlaTrp: 0.27 ± 0.376
2.967AlaTyr: 2.967 ± 0.967
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.296
0.0CysCys: 0.0 ± 0.0
1.079CysAsp: 1.079 ± 0.378
0.54CysGlu: 0.54 ± 0.32
1.349CysPhe: 1.349 ± 0.517
0.54CysGly: 0.54 ± 0.315
0.54CysHis: 0.54 ± 0.315
0.27CysIle: 0.27 ± 0.157
1.619CysLys: 1.619 ± 0.855
2.428CysLeu: 2.428 ± 0.345
0.0CysMet: 0.0 ± 0.0
0.27CysAsn: 0.27 ± 0.157
1.349CysPro: 1.349 ± 0.919
0.27CysGln: 0.27 ± 0.157
0.54CysArg: 0.54 ± 0.752
1.349CysSer: 1.349 ± 0.986
1.619CysThr: 1.619 ± 0.942
0.809CysVal: 0.809 ± 0.326
0.54CysTrp: 0.54 ± 0.313
0.27CysTyr: 0.27 ± 0.376
0.0CysXaa: 0.0 ± 0.0
Asp
1.079AspAla: 1.079 ± 0.378
2.158AspCys: 2.158 ± 0.858
2.698AspAsp: 2.698 ± 0.684
3.777AspGlu: 3.777 ± 1.695
2.428AspPhe: 2.428 ± 0.742
4.586AspGly: 4.586 ± 1.035
1.079AspHis: 1.079 ± 0.403
3.507AspIle: 3.507 ± 0.306
3.237AspLys: 3.237 ± 1.024
8.093AspLeu: 8.093 ± 1.53
2.428AspMet: 2.428 ± 0.742
1.079AspAsn: 1.079 ± 0.442
3.237AspPro: 3.237 ± 0.605
2.158AspGln: 2.158 ± 0.66
2.428AspArg: 2.428 ± 0.339
3.507AspSer: 3.507 ± 0.662
2.428AspThr: 2.428 ± 0.874
2.158AspVal: 2.158 ± 0.708
1.349AspTrp: 1.349 ± 0.59
2.428AspTyr: 2.428 ± 0.718
0.0AspXaa: 0.0 ± 0.0
Glu
4.316GluAla: 4.316 ± 0.926
0.809GluCys: 0.809 ± 0.676
2.428GluAsp: 2.428 ± 0.758
4.316GluGlu: 4.316 ± 2.601
3.507GluPhe: 3.507 ± 0.944
5.395GluGly: 5.395 ± 0.816
1.619GluHis: 1.619 ± 0.557
4.316GluIle: 4.316 ± 0.856
2.428GluLys: 2.428 ± 0.678
4.046GluLeu: 4.046 ± 1.06
0.809GluMet: 0.809 ± 0.346
2.967GluAsn: 2.967 ± 0.756
3.237GluPro: 3.237 ± 0.856
1.349GluGln: 1.349 ± 0.551
2.967GluArg: 2.967 ± 0.731
5.665GluSer: 5.665 ± 1.283
2.967GluThr: 2.967 ± 1.23
3.237GluVal: 3.237 ± 0.908
0.809GluTrp: 0.809 ± 0.676
3.237GluTyr: 3.237 ± 1.073
0.0GluXaa: 0.0 ± 0.0
Phe
0.809PheAla: 0.809 ± 0.346
0.54PheCys: 0.54 ± 0.313
2.158PheAsp: 2.158 ± 0.876
1.079PheGlu: 1.079 ± 0.451
2.428PhePhe: 2.428 ± 0.979
1.888PheGly: 1.888 ± 0.79
1.888PheHis: 1.888 ± 1.096
2.698PheIle: 2.698 ± 1.565
1.888PheLys: 1.888 ± 0.409
4.316PheLeu: 4.316 ± 0.628
1.349PheMet: 1.349 ± 0.67
1.619PheAsn: 1.619 ± 0.652
2.698PhePro: 2.698 ± 0.757
1.619PheGln: 1.619 ± 0.417
4.046PheArg: 4.046 ± 0.565
5.665PheSer: 5.665 ± 1.701
2.158PheThr: 2.158 ± 0.737
3.237PheVal: 3.237 ± 0.848
1.079PheTrp: 1.079 ± 0.325
1.619PheTyr: 1.619 ± 0.704
0.0PheXaa: 0.0 ± 0.0
Gly
2.428GlyAla: 2.428 ± 0.566
0.54GlyCys: 0.54 ± 0.314
4.046GlyAsp: 4.046 ± 1.319
4.856GlyGlu: 4.856 ± 1.137
1.888GlyPhe: 1.888 ± 0.79
6.474GlyGly: 6.474 ± 0.665
1.349GlyHis: 1.349 ± 0.551
3.777GlyIle: 3.777 ± 0.796
4.046GlyLys: 4.046 ± 0.814
9.711GlyLeu: 9.711 ± 0.588
1.349GlyMet: 1.349 ± 0.605
2.158GlyAsn: 2.158 ± 0.66
2.158GlyPro: 2.158 ± 0.906
3.777GlyGln: 3.777 ± 0.544
4.316GlyArg: 4.316 ± 1.35
5.935GlySer: 5.935 ± 1.712
4.586GlyThr: 4.586 ± 1.324
3.507GlyVal: 3.507 ± 1.83
1.079GlyTrp: 1.079 ± 0.628
2.158GlyTyr: 2.158 ± 1.247
0.0GlyXaa: 0.0 ± 0.0
His
0.54HisAla: 0.54 ± 0.315
0.27HisCys: 0.27 ± 0.398
0.809HisAsp: 0.809 ± 0.333
0.54HisGlu: 0.54 ± 0.313
0.809HisPhe: 0.809 ± 0.47
1.619HisGly: 1.619 ± 0.558
0.54HisHis: 0.54 ± 0.315
1.619HisIle: 1.619 ± 0.649
0.54HisLys: 0.54 ± 0.556
3.237HisLeu: 3.237 ± 0.634
1.619HisMet: 1.619 ± 1.043
0.809HisAsn: 0.809 ± 0.47
1.349HisPro: 1.349 ± 0.622
0.809HisGln: 0.809 ± 0.47
1.619HisArg: 1.619 ± 0.591
1.349HisSer: 1.349 ± 0.342
0.0HisThr: 0.0 ± 0.0
1.079HisVal: 1.079 ± 0.378
0.54HisTrp: 0.54 ± 0.752
0.809HisTyr: 0.809 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
1.349IleAla: 1.349 ± 0.524
0.809IleCys: 0.809 ± 0.428
3.237IleAsp: 3.237 ± 0.465
3.237IleGlu: 3.237 ± 0.709
3.237IlePhe: 3.237 ± 0.73
4.586IleGly: 4.586 ± 0.66
0.809IleHis: 0.809 ± 0.428
2.967IleIle: 2.967 ± 0.967
5.125IleLys: 5.125 ± 1.201
6.744IleLeu: 6.744 ± 1.797
1.079IleMet: 1.079 ± 0.364
1.619IleAsn: 1.619 ± 0.206
3.777IlePro: 3.777 ± 1.92
2.967IleGln: 2.967 ± 0.546
4.316IleArg: 4.316 ± 0.579
6.744IleSer: 6.744 ± 1.261
2.158IleThr: 2.158 ± 0.48
3.237IleVal: 3.237 ± 0.869
0.54IleTrp: 0.54 ± 0.314
1.349IleTyr: 1.349 ± 0.567
0.0IleXaa: 0.0 ± 0.0
Lys
4.586LysAla: 4.586 ± 1.206
1.079LysCys: 1.079 ± 0.442
2.967LysAsp: 2.967 ± 0.334
3.507LysGlu: 3.507 ± 1.534
1.349LysPhe: 1.349 ± 0.986
4.046LysGly: 4.046 ± 2.88
0.0LysHis: 0.0 ± 0.0
4.856LysIle: 4.856 ± 0.809
3.237LysLys: 3.237 ± 1.503
4.586LysLeu: 4.586 ± 0.636
1.079LysMet: 1.079 ± 0.41
2.158LysAsn: 2.158 ± 0.737
2.698LysPro: 2.698 ± 0.821
1.888LysGln: 1.888 ± 0.839
3.507LysArg: 3.507 ± 1.065
5.395LysSer: 5.395 ± 1.522
3.507LysThr: 3.507 ± 0.907
3.237LysVal: 3.237 ± 0.479
1.349LysTrp: 1.349 ± 0.783
2.158LysTyr: 2.158 ± 0.847
0.0LysXaa: 0.0 ± 0.0
Leu
7.284LeuAla: 7.284 ± 0.621
1.619LeuCys: 1.619 ± 0.413
7.553LeuAsp: 7.553 ± 1.614
7.553LeuGlu: 7.553 ± 1.556
4.046LeuPhe: 4.046 ± 1.005
6.744LeuGly: 6.744 ± 1.507
1.349LeuHis: 1.349 ± 0.486
7.553LeuIle: 7.553 ± 1.293
6.744LeuLys: 6.744 ± 2.071
7.823LeuLeu: 7.823 ± 0.946
2.967LeuMet: 2.967 ± 0.783
2.428LeuAsn: 2.428 ± 0.813
4.046LeuPro: 4.046 ± 1.21
2.428LeuGln: 2.428 ± 0.892
8.093LeuArg: 8.093 ± 1.416
8.632LeuSer: 8.632 ± 1.875
6.204LeuThr: 6.204 ± 1.346
4.586LeuVal: 4.586 ± 1.226
1.349LeuTrp: 1.349 ± 0.486
2.428LeuTyr: 2.428 ± 0.529
0.0LeuXaa: 0.0 ± 0.0
Met
1.619MetAla: 1.619 ± 0.942
0.27MetCys: 0.27 ± 0.337
1.079MetAsp: 1.079 ± 0.442
1.079MetGlu: 1.079 ± 0.41
1.349MetPhe: 1.349 ± 0.649
2.428MetGly: 2.428 ± 0.808
0.54MetHis: 0.54 ± 0.313
0.54MetIle: 0.54 ± 0.32
1.888MetLys: 1.888 ± 0.651
1.619MetLeu: 1.619 ± 1.027
0.809MetMet: 0.809 ± 0.47
0.54MetAsn: 0.54 ± 0.313
0.54MetPro: 0.54 ± 0.313
0.27MetGln: 0.27 ± 0.337
0.0MetArg: 0.0 ± 0.0
2.967MetSer: 2.967 ± 1.37
1.349MetThr: 1.349 ± 0.567
1.349MetVal: 1.349 ± 0.342
1.079MetTrp: 1.079 ± 1.095
1.079MetTyr: 1.079 ± 0.41
0.0MetXaa: 0.0 ± 0.0
Asn
1.079AsnAla: 1.079 ± 0.628
0.809AsnCys: 0.809 ± 0.47
0.809AsnAsp: 0.809 ± 0.326
1.888AsnGlu: 1.888 ± 0.548
1.619AsnPhe: 1.619 ± 0.591
1.888AsnGly: 1.888 ± 0.515
1.619AsnHis: 1.619 ± 0.312
1.888AsnIle: 1.888 ± 0.129
1.888AsnLys: 1.888 ± 0.953
3.507AsnLeu: 3.507 ± 1.757
0.27AsnMet: 0.27 ± 0.376
1.079AsnAsn: 1.079 ± 0.626
2.428AsnPro: 2.428 ± 0.781
1.349AsnGln: 1.349 ± 0.342
2.158AsnArg: 2.158 ± 0.847
2.158AsnSer: 2.158 ± 0.638
1.619AsnThr: 1.619 ± 0.558
2.158AsnVal: 2.158 ± 0.622
1.079AsnTrp: 1.079 ± 0.626
0.27AsnTyr: 0.27 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
2.967ProAla: 2.967 ± 1.095
1.079ProCys: 1.079 ± 0.403
3.507ProAsp: 3.507 ± 0.943
3.237ProGlu: 3.237 ± 1.47
2.158ProPhe: 2.158 ± 0.723
3.237ProGly: 3.237 ± 1.048
1.888ProHis: 1.888 ± 0.515
2.698ProIle: 2.698 ± 0.829
1.619ProLys: 1.619 ± 0.596
4.856ProLeu: 4.856 ± 1.647
0.809ProMet: 0.809 ± 0.47
1.888ProAsn: 1.888 ± 0.515
1.888ProPro: 1.888 ± 1.113
1.079ProGln: 1.079 ± 0.607
1.888ProArg: 1.888 ± 0.515
5.935ProSer: 5.935 ± 2.299
3.237ProThr: 3.237 ± 0.954
3.237ProVal: 3.237 ± 1.135
1.349ProTrp: 1.349 ± 0.342
2.428ProTyr: 2.428 ± 1.945
0.0ProXaa: 0.0 ± 0.0
Gln
1.888GlnAla: 1.888 ± 0.89
0.0GlnCys: 0.0 ± 0.0
1.079GlnAsp: 1.079 ± 0.451
2.967GlnGlu: 2.967 ± 0.761
1.079GlnPhe: 1.079 ± 0.626
2.158GlnGly: 2.158 ± 0.584
0.27GlnHis: 0.27 ± 0.376
2.428GlnIle: 2.428 ± 0.994
2.698GlnLys: 2.698 ± 0.829
2.428GlnLeu: 2.428 ± 0.566
0.54GlnMet: 0.54 ± 0.313
0.809GlnAsn: 0.809 ± 0.47
1.619GlnPro: 1.619 ± 0.558
1.079GlnGln: 1.079 ± 0.355
2.698GlnArg: 2.698 ± 0.78
4.586GlnSer: 4.586 ± 0.843
1.888GlnThr: 1.888 ± 1.266
2.158GlnVal: 2.158 ± 0.638
0.27GlnTrp: 0.27 ± 0.157
1.079GlnTyr: 1.079 ± 0.628
0.0GlnXaa: 0.0 ± 0.0
Arg
3.777ArgAla: 3.777 ± 0.826
0.27ArgCys: 0.27 ± 0.157
4.046ArgAsp: 4.046 ± 0.405
2.967ArgGlu: 2.967 ± 0.734
3.237ArgPhe: 3.237 ± 0.969
5.125ArgGly: 5.125 ± 1.612
0.809ArgHis: 0.809 ± 0.47
2.967ArgIle: 2.967 ± 0.797
2.158ArgLys: 2.158 ± 0.535
4.586ArgLeu: 4.586 ± 1.031
1.349ArgMet: 1.349 ± 0.538
1.349ArgAsn: 1.349 ± 0.935
2.698ArgPro: 2.698 ± 0.946
2.967ArgGln: 2.967 ± 1.048
2.967ArgArg: 2.967 ± 0.967
6.744ArgSer: 6.744 ± 0.798
4.856ArgThr: 4.856 ± 0.883
2.698ArgVal: 2.698 ± 0.78
1.619ArgTrp: 1.619 ± 0.312
1.619ArgTyr: 1.619 ± 0.942
0.0ArgXaa: 0.0 ± 0.0
Ser
6.744SerAla: 6.744 ± 0.479
0.809SerCys: 0.809 ± 0.346
4.586SerAsp: 4.586 ± 0.942
5.665SerGlu: 5.665 ± 1.433
4.856SerPhe: 4.856 ± 0.592
7.823SerGly: 7.823 ± 1.193
2.698SerHis: 2.698 ± 0.483
6.204SerIle: 6.204 ± 1.008
3.237SerLys: 3.237 ± 0.791
11.869SerLeu: 11.869 ± 2.653
1.888SerMet: 1.888 ± 1.315
2.698SerAsn: 2.698 ± 0.483
5.125SerPro: 5.125 ± 1.656
3.237SerGln: 3.237 ± 0.782
3.507SerArg: 3.507 ± 0.662
9.981SerSer: 9.981 ± 1.701
4.856SerThr: 4.856 ± 1.291
7.014SerVal: 7.014 ± 1.728
1.349SerTrp: 1.349 ± 0.517
2.967SerTyr: 2.967 ± 1.693
0.0SerXaa: 0.0 ± 0.0
Thr
4.856ThrAla: 4.856 ± 1.188
1.079ThrCys: 1.079 ± 1.048
2.698ThrAsp: 2.698 ± 0.688
1.888ThrGlu: 1.888 ± 0.495
2.698ThrPhe: 2.698 ± 0.676
3.237ThrGly: 3.237 ± 0.341
2.158ThrHis: 2.158 ± 0.298
2.967ThrIle: 2.967 ± 0.498
3.777ThrLys: 3.777 ± 0.531
4.316ThrLeu: 4.316 ± 0.318
1.619ThrMet: 1.619 ± 0.507
2.158ThrAsn: 2.158 ± 0.535
3.507ThrPro: 3.507 ± 1.299
1.079ThrGln: 1.079 ± 0.319
2.967ThrArg: 2.967 ± 1.018
5.125ThrSer: 5.125 ± 1.102
3.507ThrThr: 3.507 ± 0.963
3.507ThrVal: 3.507 ± 1.398
2.158ThrTrp: 2.158 ± 0.424
0.809ThrTyr: 0.809 ± 0.622
0.0ThrXaa: 0.0 ± 0.0
Val
4.316ValAla: 4.316 ± 0.961
1.349ValCys: 1.349 ± 0.278
4.856ValAsp: 4.856 ± 0.896
4.316ValGlu: 4.316 ± 0.405
2.698ValPhe: 2.698 ± 0.483
2.698ValGly: 2.698 ± 0.684
0.0ValHis: 0.0 ± 0.0
2.158ValIle: 2.158 ± 0.806
3.507ValLys: 3.507 ± 0.658
4.856ValLeu: 4.856 ± 0.436
1.619ValMet: 1.619 ± 0.982
2.428ValAsn: 2.428 ± 0.586
4.316ValPro: 4.316 ± 0.952
2.158ValGln: 2.158 ± 1.022
2.698ValArg: 2.698 ± 1.02
4.316ValSer: 4.316 ± 1.439
2.428ValThr: 2.428 ± 0.339
3.507ValVal: 3.507 ± 1.54
0.27ValTrp: 0.27 ± 0.398
1.888ValTyr: 1.888 ± 0.409
0.0ValXaa: 0.0 ± 0.0
Trp
0.809TrpAla: 0.809 ± 0.7
0.54TrpCys: 0.54 ± 0.314
0.27TrpAsp: 0.27 ± 0.398
2.428TrpGlu: 2.428 ± 0.946
1.349TrpPhe: 1.349 ± 0.622
1.349TrpGly: 1.349 ± 0.342
0.809TrpHis: 0.809 ± 0.296
1.349TrpIle: 1.349 ± 0.649
1.888TrpLys: 1.888 ± 0.495
1.888TrpLeu: 1.888 ± 0.409
0.0TrpMet: 0.0 ± 0.0
0.27TrpAsn: 0.27 ± 0.157
0.809TrpPro: 0.809 ± 0.375
0.54TrpGln: 0.54 ± 0.314
1.619TrpArg: 1.619 ± 0.618
1.888TrpSer: 1.888 ± 0.409
0.809TrpThr: 0.809 ± 0.326
0.27TrpVal: 0.27 ± 0.157
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.349TyrAla: 1.349 ± 0.517
1.349TyrCys: 1.349 ± 0.334
1.888TyrAsp: 1.888 ± 0.839
1.349TyrGlu: 1.349 ± 0.713
0.54TyrPhe: 0.54 ± 0.313
1.349TyrGly: 1.349 ± 0.67
0.0TyrHis: 0.0 ± 0.0
0.809TyrIle: 0.809 ± 0.755
2.428TyrLys: 2.428 ± 0.742
4.586TyrLeu: 4.586 ± 1.986
0.27TyrMet: 0.27 ± 0.398
2.158TyrAsn: 2.158 ± 0.398
1.079TyrPro: 1.079 ± 0.403
1.079TyrGln: 1.079 ± 0.626
2.158TyrArg: 2.158 ± 0.535
4.586TyrSer: 4.586 ± 1.228
2.158TyrThr: 2.158 ± 0.538
1.619TyrVal: 1.619 ± 0.704
0.809TyrTrp: 0.809 ± 0.428
0.809TyrTyr: 0.809 ± 0.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3708 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski