Amino acid dipepetide frequency for Munguba virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.719AlaAla: 5.719 ± 5.284
1.74AlaCys: 1.74 ± 1.279
1.989AlaAsp: 1.989 ± 1.077
3.481AlaGlu: 3.481 ± 0.432
2.486AlaPhe: 2.486 ± 0.755
2.735AlaGly: 2.735 ± 1.101
2.486AlaHis: 2.486 ± 0.755
3.729AlaIle: 3.729 ± 1.463
1.989AlaLys: 1.989 ± 0.488
5.221AlaLeu: 5.221 ± 1.679
1.989AlaMet: 1.989 ± 0.735
1.243AlaAsn: 1.243 ± 0.454
2.486AlaPro: 2.486 ± 0.647
1.989AlaGln: 1.989 ± 0.561
1.989AlaArg: 1.989 ± 0.488
5.719AlaSer: 5.719 ± 1.448
3.978AlaThr: 3.978 ± 1.022
4.475AlaVal: 4.475 ± 1.534
0.497AlaTrp: 0.497 ± 0.14
1.989AlaTyr: 1.989 ± 1.242
0.0AlaXaa: 0.0 ± 0.0
Cys
1.492CysAla: 1.492 ± 0.642
0.497CysCys: 0.497 ± 0.323
0.249CysAsp: 0.249 ± 0.161
0.497CysGlu: 0.497 ± 0.14
2.735CysPhe: 2.735 ± 0.886
0.497CysGly: 0.497 ± 0.424
0.746CysHis: 0.746 ± 0.321
0.995CysIle: 0.995 ± 0.354
2.735CysLys: 2.735 ± 0.545
2.486CysLeu: 2.486 ± 0.89
0.746CysMet: 0.746 ± 0.484
0.995CysAsn: 0.995 ± 0.28
1.74CysPro: 1.74 ± 0.453
1.492CysGln: 1.492 ± 0.867
1.492CysArg: 1.492 ± 0.642
3.232CysSer: 3.232 ± 1.213
2.238CysThr: 2.238 ± 0.716
1.492CysVal: 1.492 ± 0.421
0.0CysTrp: 0.0 ± 0.0
1.492CysTyr: 1.492 ± 0.497
0.0CysXaa: 0.0 ± 0.0
Asp
2.735AspAla: 2.735 ± 1.98
1.492AspCys: 1.492 ± 1.089
5.221AspAsp: 5.221 ± 1.203
4.227AspGlu: 4.227 ± 1.566
1.989AspPhe: 1.989 ± 0.502
3.729AspGly: 3.729 ± 0.655
1.243AspHis: 1.243 ± 0.779
2.486AspIle: 2.486 ± 1.301
2.735AspLys: 2.735 ± 0.897
8.205AspLeu: 8.205 ± 1.311
1.74AspMet: 1.74 ± 0.645
2.486AspAsn: 2.486 ± 0.609
2.486AspPro: 2.486 ± 0.54
1.243AspGln: 1.243 ± 0.932
1.989AspArg: 1.989 ± 1.274
4.973AspSer: 4.973 ± 1.0
2.238AspThr: 2.238 ± 0.647
2.238AspVal: 2.238 ± 0.893
0.995AspTrp: 0.995 ± 0.28
2.238AspTyr: 2.238 ± 0.576
0.0AspXaa: 0.0 ± 0.0
Glu
4.475GluAla: 4.475 ± 1.333
1.492GluCys: 1.492 ± 0.421
4.973GluAsp: 4.973 ± 1.564
3.481GluGlu: 3.481 ± 1.291
4.475GluPhe: 4.475 ± 1.482
2.486GluGly: 2.486 ± 0.438
0.995GluHis: 0.995 ± 0.354
5.47GluIle: 5.47 ± 1.526
2.984GluLys: 2.984 ± 1.045
6.216GluLeu: 6.216 ± 2.044
1.243GluMet: 1.243 ± 0.41
2.238GluAsn: 2.238 ± 0.916
2.238GluPro: 2.238 ± 0.45
2.486GluGln: 2.486 ± 0.701
4.227GluArg: 4.227 ± 1.377
5.967GluSer: 5.967 ± 1.662
2.238GluThr: 2.238 ± 0.586
4.973GluVal: 4.973 ± 0.54
0.249GluTrp: 0.249 ± 0.161
2.238GluTyr: 2.238 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
1.74PheAla: 1.74 ± 1.094
1.74PheCys: 1.74 ± 0.844
2.735PheAsp: 2.735 ± 0.817
3.232PheGlu: 3.232 ± 0.766
2.238PhePhe: 2.238 ± 0.421
2.238PheGly: 2.238 ± 1.031
0.497PheHis: 0.497 ± 0.14
1.989PheIle: 1.989 ± 0.426
2.735PheLys: 2.735 ± 1.461
4.227PheLeu: 4.227 ± 1.417
0.995PheMet: 0.995 ± 0.637
2.238PheAsn: 2.238 ± 0.805
2.238PhePro: 2.238 ± 0.586
0.995PheGln: 0.995 ± 0.28
2.735PheArg: 2.735 ± 0.747
4.973PheSer: 4.973 ± 0.594
2.735PheThr: 2.735 ± 0.897
4.227PheVal: 4.227 ± 0.687
0.746PheTrp: 0.746 ± 0.216
0.746PheTyr: 0.746 ± 0.484
0.0PheXaa: 0.0 ± 0.0
Gly
5.221GlyAla: 5.221 ± 0.549
1.989GlyCys: 1.989 ± 0.535
2.735GlyAsp: 2.735 ± 0.471
1.989GlyGlu: 1.989 ± 1.214
4.973GlyPhe: 4.973 ± 1.579
5.47GlyGly: 5.47 ± 1.244
1.243GlyHis: 1.243 ± 0.506
3.729GlyIle: 3.729 ± 0.726
3.978GlyLys: 3.978 ± 0.852
4.724GlyLeu: 4.724 ± 1.838
1.989GlyMet: 1.989 ± 0.458
1.74GlyAsn: 1.74 ± 1.263
2.238GlyPro: 2.238 ± 0.859
1.243GlyGln: 1.243 ± 0.734
2.735GlyArg: 2.735 ± 1.238
6.216GlySer: 6.216 ± 0.609
2.238GlyThr: 2.238 ± 0.463
2.984GlyVal: 2.984 ± 0.759
0.249GlyTrp: 0.249 ± 0.161
0.746GlyTyr: 0.746 ± 0.216
0.0GlyXaa: 0.0 ± 0.0
His
1.243HisAla: 1.243 ± 0.512
0.746HisCys: 0.746 ± 0.321
1.243HisAsp: 1.243 ± 0.506
0.746HisGlu: 0.746 ± 0.216
1.243HisPhe: 1.243 ± 0.326
1.492HisGly: 1.492 ± 0.663
0.249HisHis: 0.249 ± 0.161
2.735HisIle: 2.735 ± 0.747
1.74HisLys: 1.74 ± 0.581
2.238HisLeu: 2.238 ± 1.04
0.249HisMet: 0.249 ± 0.161
0.746HisAsn: 0.746 ± 0.484
0.746HisPro: 0.746 ± 0.484
1.243HisGln: 1.243 ± 0.506
0.995HisArg: 0.995 ± 0.645
1.989HisSer: 1.989 ± 0.998
1.492HisThr: 1.492 ± 1.672
0.746HisVal: 0.746 ± 0.216
0.0HisTrp: 0.0 ± 0.0
1.492HisTyr: 1.492 ± 0.431
0.0HisXaa: 0.0 ± 0.0
Ile
2.735IleAla: 2.735 ± 0.877
2.486IleCys: 2.486 ± 0.895
4.724IleAsp: 4.724 ± 1.495
5.221IleGlu: 5.221 ± 1.21
1.989IlePhe: 1.989 ± 0.535
4.475IleGly: 4.475 ± 1.173
1.74IleHis: 1.74 ± 0.453
7.21IleIle: 7.21 ± 1.393
3.729IleLys: 3.729 ± 0.655
5.47IleLeu: 5.47 ± 0.559
1.989IleMet: 1.989 ± 0.708
3.978IleAsn: 3.978 ± 0.852
2.735IlePro: 2.735 ± 0.471
1.74IleGln: 1.74 ± 0.453
4.475IleArg: 4.475 ± 1.184
7.708IleSer: 7.708 ± 1.048
2.486IleThr: 2.486 ± 0.647
2.735IleVal: 2.735 ± 0.625
0.249IleTrp: 0.249 ± 0.161
1.989IleTyr: 1.989 ± 0.774
0.0IleXaa: 0.0 ± 0.0
Lys
4.973LysAla: 4.973 ± 0.828
1.492LysCys: 1.492 ± 0.421
2.984LysAsp: 2.984 ± 1.062
5.967LysGlu: 5.967 ± 1.165
1.74LysPhe: 1.74 ± 0.453
2.486LysGly: 2.486 ± 0.895
1.492LysHis: 1.492 ± 0.534
4.227LysIle: 4.227 ± 1.269
4.475LysLys: 4.475 ± 0.231
5.719LysLeu: 5.719 ± 1.37
3.232LysMet: 3.232 ± 1.198
2.984LysAsn: 2.984 ± 0.777
1.492LysPro: 1.492 ± 0.55
0.995LysGln: 0.995 ± 0.354
3.729LysArg: 3.729 ± 1.138
4.973LysSer: 4.973 ± 1.036
6.216LysThr: 6.216 ± 1.343
4.227LysVal: 4.227 ± 0.902
0.995LysTrp: 0.995 ± 0.645
1.989LysTyr: 1.989 ± 0.607
0.0LysXaa: 0.0 ± 0.0
Leu
5.221LeuAla: 5.221 ± 0.509
2.984LeuCys: 2.984 ± 1.045
4.227LeuAsp: 4.227 ± 0.805
5.967LeuGlu: 5.967 ± 1.309
4.227LeuPhe: 4.227 ± 1.306
2.984LeuGly: 2.984 ± 0.417
1.492LeuHis: 1.492 ± 0.819
7.708LeuIle: 7.708 ± 0.843
8.702LeuLys: 8.702 ± 1.122
4.973LeuLeu: 4.973 ± 0.824
3.232LeuMet: 3.232 ± 1.555
4.227LeuAsn: 4.227 ± 0.769
3.232LeuPro: 3.232 ± 0.972
2.984LeuGln: 2.984 ± 0.944
7.956LeuArg: 7.956 ± 1.658
9.199LeuSer: 9.199 ± 1.329
3.481LeuThr: 3.481 ± 1.031
3.978LeuVal: 3.978 ± 1.806
0.746LeuTrp: 0.746 ± 0.876
1.74LeuTyr: 1.74 ± 0.629
0.0LeuXaa: 0.0 ± 0.0
Met
0.995MetAla: 0.995 ± 0.423
0.497MetCys: 0.497 ± 0.14
1.989MetAsp: 1.989 ± 0.735
1.74MetGlu: 1.74 ± 0.564
2.486MetPhe: 2.486 ± 0.754
2.238MetGly: 2.238 ± 1.04
0.995MetHis: 0.995 ± 0.637
2.735MetIle: 2.735 ± 0.362
2.238MetLys: 2.238 ± 0.647
1.74MetLeu: 1.74 ± 0.645
1.74MetMet: 1.74 ± 0.64
0.746MetAsn: 0.746 ± 0.216
0.249MetPro: 0.249 ± 0.666
1.492MetGln: 1.492 ± 0.55
2.486MetArg: 2.486 ± 1.013
3.481MetSer: 3.481 ± 1.654
0.995MetThr: 0.995 ± 0.354
1.243MetVal: 1.243 ± 0.705
0.497MetTrp: 0.497 ± 0.53
0.746MetTyr: 0.746 ± 0.484
0.0MetXaa: 0.0 ± 0.0
Asn
0.497AsnAla: 0.497 ± 0.14
0.746AsnCys: 0.746 ± 0.321
2.735AsnAsp: 2.735 ± 0.886
3.232AsnGlu: 3.232 ± 0.854
1.74AsnPhe: 1.74 ± 0.414
2.486AsnGly: 2.486 ± 0.609
0.995AsnHis: 0.995 ± 0.598
1.243AsnIle: 1.243 ± 0.326
2.486AsnLys: 2.486 ± 0.54
6.713AsnLeu: 6.713 ± 1.277
0.995AsnMet: 0.995 ± 0.523
0.746AsnAsn: 0.746 ± 0.321
1.989AsnPro: 1.989 ± 0.426
2.238AsnGln: 2.238 ± 0.716
1.492AsnArg: 1.492 ± 0.903
3.729AsnSer: 3.729 ± 0.908
1.492AsnThr: 1.492 ± 0.421
1.492AsnVal: 1.492 ± 1.487
0.497AsnTrp: 0.497 ± 0.53
0.995AsnTyr: 0.995 ± 0.494
0.0AsnXaa: 0.0 ± 0.0
Pro
0.995ProAla: 0.995 ± 0.523
0.249ProCys: 0.249 ± 0.161
2.238ProAsp: 2.238 ± 0.586
4.227ProGlu: 4.227 ± 1.106
1.989ProPhe: 1.989 ± 0.708
3.978ProGly: 3.978 ± 0.725
0.497ProHis: 0.497 ± 0.323
1.989ProIle: 1.989 ± 0.895
1.492ProLys: 1.492 ± 0.431
3.232ProLeu: 3.232 ± 0.932
0.746ProMet: 0.746 ± 0.827
0.995ProAsn: 0.995 ± 0.354
0.746ProPro: 0.746 ± 0.216
0.746ProGln: 0.746 ± 0.743
1.243ProArg: 1.243 ± 0.778
3.729ProSer: 3.729 ± 1.361
1.492ProThr: 1.492 ± 0.649
2.984ProVal: 2.984 ± 0.92
1.243ProTrp: 1.243 ± 0.512
1.243ProTyr: 1.243 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
1.989GlnAla: 1.989 ± 1.242
1.243GlnCys: 1.243 ± 0.41
0.746GlnAsp: 0.746 ± 0.635
1.492GlnGlu: 1.492 ± 0.642
1.243GlnPhe: 1.243 ± 0.52
2.238GlnGly: 2.238 ± 0.576
1.492GlnHis: 1.492 ± 0.431
2.486GlnIle: 2.486 ± 0.438
2.735GlnLys: 2.735 ± 0.722
2.735GlnLeu: 2.735 ± 0.362
0.497GlnMet: 0.497 ± 0.323
0.995GlnAsn: 0.995 ± 0.621
1.492GlnPro: 1.492 ± 0.801
0.995GlnGln: 0.995 ± 0.621
2.238GlnArg: 2.238 ± 1.743
2.735GlnSer: 2.735 ± 1.133
1.989GlnThr: 1.989 ± 1.051
1.243GlnVal: 1.243 ± 0.326
0.249GlnTrp: 0.249 ± 0.212
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.227ArgAla: 4.227 ± 1.106
2.238ArgCys: 2.238 ± 0.716
2.238ArgAsp: 2.238 ± 1.136
4.475ArgGlu: 4.475 ± 1.0
1.74ArgPhe: 1.74 ± 0.821
4.973ArgGly: 4.973 ± 2.022
0.746ArgHis: 0.746 ± 0.321
4.227ArgIle: 4.227 ± 1.024
2.984ArgLys: 2.984 ± 1.326
3.481ArgLeu: 3.481 ± 0.307
1.989ArgMet: 1.989 ± 0.562
1.989ArgAsn: 1.989 ± 0.785
1.492ArgPro: 1.492 ± 0.903
1.492ArgGln: 1.492 ± 0.456
1.989ArgArg: 1.989 ± 0.488
5.719ArgSer: 5.719 ± 1.648
1.492ArgThr: 1.492 ± 0.804
3.481ArgVal: 3.481 ± 0.307
0.995ArgTrp: 0.995 ± 0.645
1.243ArgTyr: 1.243 ± 1.019
0.0ArgXaa: 0.0 ± 0.0
Ser
4.724SerAla: 4.724 ± 1.785
3.729SerCys: 3.729 ± 1.27
6.216SerAsp: 6.216 ± 1.449
7.956SerGlu: 7.956 ± 1.606
3.232SerPhe: 3.232 ± 1.301
5.47SerGly: 5.47 ± 1.578
3.232SerHis: 3.232 ± 0.778
5.719SerIle: 5.719 ± 1.058
6.713SerLys: 6.713 ± 1.19
9.945SerLeu: 9.945 ± 1.421
3.978SerMet: 3.978 ± 0.361
3.232SerAsn: 3.232 ± 0.226
4.227SerPro: 4.227 ± 0.974
3.232SerGln: 3.232 ± 0.926
3.729SerArg: 3.729 ± 1.201
10.94SerSer: 10.94 ± 2.128
3.978SerThr: 3.978 ± 0.898
6.464SerVal: 6.464 ± 1.65
1.989SerTrp: 1.989 ± 0.846
3.729SerTyr: 3.729 ± 0.857
0.0SerXaa: 0.0 ± 0.0
Thr
2.238ThrAla: 2.238 ± 1.075
1.243ThrCys: 1.243 ± 0.512
3.978ThrAsp: 3.978 ± 1.257
2.735ThrGlu: 2.735 ± 0.625
1.243ThrPhe: 1.243 ± 1.498
3.729ThrGly: 3.729 ± 0.968
1.243ThrHis: 1.243 ± 0.326
4.475ThrIle: 4.475 ± 0.788
1.989ThrLys: 1.989 ± 0.488
5.221ThrLeu: 5.221 ± 1.295
0.746ThrMet: 0.746 ± 1.036
1.989ThrAsn: 1.989 ± 0.535
0.746ThrPro: 0.746 ± 0.216
1.74ThrGln: 1.74 ± 0.581
1.989ThrArg: 1.989 ± 0.426
6.216ThrSer: 6.216 ± 2.398
1.989ThrThr: 1.989 ± 0.535
3.232ThrVal: 3.232 ± 0.568
0.497ThrTrp: 0.497 ± 0.615
0.995ThrTyr: 0.995 ± 0.774
0.0ThrXaa: 0.0 ± 0.0
Val
4.724ValAla: 4.724 ± 2.114
0.249ValCys: 0.249 ± 0.161
2.984ValAsp: 2.984 ± 0.994
2.984ValGlu: 2.984 ± 1.107
2.486ValPhe: 2.486 ± 0.777
2.486ValGly: 2.486 ± 0.647
1.243ValHis: 1.243 ± 0.807
3.978ValIle: 3.978 ± 0.748
5.967ValLys: 5.967 ± 0.993
3.481ValLeu: 3.481 ± 1.756
1.74ValMet: 1.74 ± 0.564
2.735ValAsn: 2.735 ± 1.949
1.492ValPro: 1.492 ± 0.662
1.492ValGln: 1.492 ± 0.642
4.724ValArg: 4.724 ± 1.641
6.216ValSer: 6.216 ± 0.516
3.481ValThr: 3.481 ± 0.531
4.227ValVal: 4.227 ± 0.7
0.746ValTrp: 0.746 ± 0.321
1.243ValTyr: 1.243 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.497TrpAla: 0.497 ± 0.503
0.0TrpCys: 0.0 ± 0.0
0.249TrpAsp: 0.249 ± 0.161
0.497TrpGlu: 0.497 ± 0.14
0.249TrpPhe: 0.249 ± 0.161
1.492TrpGly: 1.492 ± 0.771
0.0TrpHis: 0.0 ± 0.0
0.995TrpIle: 0.995 ± 0.525
0.995TrpLys: 0.995 ± 0.523
0.995TrpLeu: 0.995 ± 0.423
0.497TrpMet: 0.497 ± 0.323
0.746TrpAsn: 0.746 ± 0.451
0.497TrpPro: 0.497 ± 0.503
0.249TrpGln: 0.249 ± 0.212
0.497TrpArg: 0.497 ± 0.323
0.995TrpSer: 0.995 ± 0.354
0.995TrpThr: 0.995 ± 0.523
0.995TrpVal: 0.995 ± 0.637
0.249TrpTrp: 0.249 ± 0.161
0.249TrpTyr: 0.249 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.492TyrAla: 1.492 ± 0.379
0.995TyrCys: 0.995 ± 0.525
2.238TyrAsp: 2.238 ± 1.04
0.995TyrGlu: 0.995 ± 0.354
1.492TyrPhe: 1.492 ± 0.801
0.746TyrGly: 0.746 ± 0.451
0.746TyrHis: 0.746 ± 0.484
1.74TyrIle: 1.74 ± 0.564
2.984TyrLys: 2.984 ± 0.777
2.486TyrLeu: 2.486 ± 0.908
0.746TyrMet: 0.746 ± 0.484
1.492TyrAsn: 1.492 ± 0.497
1.492TyrPro: 1.492 ± 0.837
0.746TyrGln: 0.746 ± 1.997
0.497TyrArg: 0.497 ± 0.615
3.729TyrSer: 3.729 ± 0.435
0.995TyrThr: 0.995 ± 0.28
1.243TyrVal: 1.243 ± 0.41
0.249TyrTrp: 0.249 ± 0.212
0.746TyrTyr: 0.746 ± 0.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski