Amino acid dipepetide frequency for Mosqueiro virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.848AlaAla: 1.848 ± 0.772
0.693AlaCys: 0.693 ± 0.32
3.004AlaAsp: 3.004 ± 0.832
0.924AlaGlu: 0.924 ± 0.473
2.773AlaPhe: 2.773 ± 1.537
2.773AlaGly: 2.773 ± 1.571
1.386AlaHis: 1.386 ± 0.519
3.697AlaIle: 3.697 ± 1.013
3.466AlaLys: 3.466 ± 0.535
5.083AlaLeu: 5.083 ± 1.038
0.924AlaMet: 0.924 ± 0.655
2.079AlaAsn: 2.079 ± 0.582
0.693AlaPro: 0.693 ± 0.679
2.773AlaGln: 2.773 ± 0.698
2.542AlaArg: 2.542 ± 1.226
3.235AlaSer: 3.235 ± 0.868
2.542AlaThr: 2.542 ± 0.773
1.848AlaVal: 1.848 ± 0.504
0.693AlaTrp: 0.693 ± 0.37
1.848AlaTyr: 1.848 ± 0.877
0.0AlaXaa: 0.0 ± 0.0
Cys
0.462CysAla: 0.462 ± 0.283
0.231CysCys: 0.231 ± 0.147
0.231CysAsp: 0.231 ± 0.147
1.386CysGlu: 1.386 ± 0.578
1.155CysPhe: 1.155 ± 0.855
1.386CysGly: 1.386 ± 0.484
0.462CysHis: 0.462 ± 0.407
1.386CysIle: 1.386 ± 0.57
0.924CysLys: 0.924 ± 0.393
0.693CysLeu: 0.693 ± 1.074
0.462CysMet: 0.462 ± 0.596
0.462CysAsn: 0.462 ± 0.27
0.693CysPro: 0.693 ± 0.552
0.693CysGln: 0.693 ± 0.316
0.462CysArg: 0.462 ± 0.297
1.617CysSer: 1.617 ± 0.828
0.462CysThr: 0.462 ± 0.3
0.462CysVal: 0.462 ± 0.597
0.462CysTrp: 0.462 ± 0.293
2.079CysTyr: 2.079 ± 0.441
0.0CysXaa: 0.0 ± 0.0
Asp
2.542AspAla: 2.542 ± 1.543
0.693AspCys: 0.693 ± 0.736
3.928AspAsp: 3.928 ± 1.542
4.39AspGlu: 4.39 ± 1.126
2.542AspPhe: 2.542 ± 0.704
3.466AspGly: 3.466 ± 1.111
0.462AspHis: 0.462 ± 0.293
3.004AspIle: 3.004 ± 1.271
3.466AspLys: 3.466 ± 0.881
6.701AspLeu: 6.701 ± 1.41
1.848AspMet: 1.848 ± 0.883
2.542AspAsn: 2.542 ± 0.673
4.159AspPro: 4.159 ± 0.717
3.004AspGln: 3.004 ± 0.802
1.155AspArg: 1.155 ± 0.527
2.773AspSer: 2.773 ± 0.752
1.617AspThr: 1.617 ± 0.616
4.159AspVal: 4.159 ± 1.222
1.386AspTrp: 1.386 ± 0.53
3.697AspTyr: 3.697 ± 1.158
0.0AspXaa: 0.0 ± 0.0
Glu
3.235GluAla: 3.235 ± 0.76
0.693GluCys: 0.693 ± 0.549
3.697GluAsp: 3.697 ± 1.074
4.621GluGlu: 4.621 ± 1.717
2.311GluPhe: 2.311 ± 0.883
2.542GluGly: 2.542 ± 0.56
0.693GluHis: 0.693 ± 0.682
4.621GluIle: 4.621 ± 0.858
4.159GluLys: 4.159 ± 1.326
4.621GluLeu: 4.621 ± 1.307
0.924GluMet: 0.924 ± 0.601
2.079GluAsn: 2.079 ± 0.602
1.386GluPro: 1.386 ± 0.417
0.693GluGln: 0.693 ± 0.634
1.848GluArg: 1.848 ± 0.873
3.466GluSer: 3.466 ± 1.169
4.159GluThr: 4.159 ± 1.377
3.004GluVal: 3.004 ± 0.996
1.155GluTrp: 1.155 ± 0.323
2.773GluTyr: 2.773 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
0.924PheAla: 0.924 ± 0.362
1.155PheCys: 1.155 ± 0.869
2.311PheAsp: 2.311 ± 1.292
1.617PheGlu: 1.617 ± 0.428
2.542PhePhe: 2.542 ± 0.68
3.466PheGly: 3.466 ± 0.628
1.848PheHis: 1.848 ± 0.744
3.235PheIle: 3.235 ± 1.367
4.852PheLys: 4.852 ± 0.602
5.314PheLeu: 5.314 ± 0.931
1.155PheMet: 1.155 ± 0.432
1.848PheAsn: 1.848 ± 0.494
2.773PhePro: 2.773 ± 0.885
1.848PheGln: 1.848 ± 0.575
1.848PheArg: 1.848 ± 0.709
3.004PheSer: 3.004 ± 1.051
1.386PheThr: 1.386 ± 0.433
3.466PheVal: 3.466 ± 0.686
0.231PheTrp: 0.231 ± 0.147
2.079PheTyr: 2.079 ± 0.801
0.0PheXaa: 0.0 ± 0.0
Gly
2.311GlyAla: 2.311 ± 1.348
0.231GlyCys: 0.231 ± 0.147
3.235GlyAsp: 3.235 ± 0.601
2.079GlyGlu: 2.079 ± 1.128
2.542GlyPhe: 2.542 ± 0.755
4.852GlyGly: 4.852 ± 0.549
0.924GlyHis: 0.924 ± 0.4
6.007GlyIle: 6.007 ± 0.962
3.466GlyLys: 3.466 ± 0.525
7.856GlyLeu: 7.856 ± 1.66
1.155GlyMet: 1.155 ± 0.587
3.928GlyAsn: 3.928 ± 1.355
1.617GlyPro: 1.617 ± 0.689
1.386GlyGln: 1.386 ± 0.632
2.542GlyArg: 2.542 ± 1.623
4.159GlySer: 4.159 ± 0.817
3.004GlyThr: 3.004 ± 0.776
3.235GlyVal: 3.235 ± 0.661
0.924GlyTrp: 0.924 ± 0.412
2.773GlyTyr: 2.773 ± 0.795
0.0GlyXaa: 0.0 ± 0.0
His
1.155HisAla: 1.155 ± 0.532
0.231HisCys: 0.231 ± 0.344
2.542HisAsp: 2.542 ± 1.0
1.386HisGlu: 1.386 ± 0.794
0.924HisPhe: 0.924 ± 0.362
0.693HisGly: 0.693 ± 0.404
0.693HisHis: 0.693 ± 0.412
0.924HisIle: 0.924 ± 0.328
1.386HisLys: 1.386 ± 0.487
3.004HisLeu: 3.004 ± 0.904
0.0HisMet: 0.0 ± 0.0
1.386HisAsn: 1.386 ± 0.479
2.311HisPro: 2.311 ± 0.853
0.693HisGln: 0.693 ± 0.399
1.386HisArg: 1.386 ± 0.779
1.155HisSer: 1.155 ± 0.37
1.155HisThr: 1.155 ± 0.709
2.542HisVal: 2.542 ± 0.611
0.462HisTrp: 0.462 ± 0.293
0.462HisTyr: 0.462 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
3.235IleAla: 3.235 ± 0.758
1.848IleCys: 1.848 ± 0.692
4.621IleAsp: 4.621 ± 0.637
3.697IleGlu: 3.697 ± 0.748
2.311IlePhe: 2.311 ± 0.733
6.701IleGly: 6.701 ± 0.871
3.004IleHis: 3.004 ± 1.32
6.47IleIle: 6.47 ± 2.133
4.39IleLys: 4.39 ± 1.378
5.314IleLeu: 5.314 ± 1.266
0.462IleMet: 0.462 ± 0.617
5.314IleAsn: 5.314 ± 1.088
3.697IlePro: 3.697 ± 1.272
2.773IleGln: 2.773 ± 0.917
5.083IleArg: 5.083 ± 0.802
6.701IleSer: 6.701 ± 0.703
3.235IleThr: 3.235 ± 0.421
3.466IleVal: 3.466 ± 1.242
1.848IleTrp: 1.848 ± 0.76
2.311IleTyr: 2.311 ± 0.688
0.0IleXaa: 0.0 ± 0.0
Lys
1.617LysAla: 1.617 ± 0.902
2.079LysCys: 2.079 ± 1.029
2.542LysAsp: 2.542 ± 0.801
2.079LysGlu: 2.079 ± 0.795
2.773LysPhe: 2.773 ± 0.883
3.697LysGly: 3.697 ± 0.778
1.386LysHis: 1.386 ± 0.602
4.621LysIle: 4.621 ± 1.132
5.314LysLys: 5.314 ± 1.468
7.163LysLeu: 7.163 ± 1.585
2.773LysMet: 2.773 ± 1.177
3.466LysAsn: 3.466 ± 1.204
1.386LysPro: 1.386 ± 0.632
1.617LysGln: 1.617 ± 0.564
5.314LysArg: 5.314 ± 1.376
4.621LysSer: 4.621 ± 1.092
2.773LysThr: 2.773 ± 0.67
3.928LysVal: 3.928 ± 1.082
1.155LysTrp: 1.155 ± 0.561
1.617LysTyr: 1.617 ± 0.874
0.0LysXaa: 0.0 ± 0.0
Leu
5.314LeuAla: 5.314 ± 0.983
2.311LeuCys: 2.311 ± 0.687
7.394LeuAsp: 7.394 ± 1.372
6.007LeuGlu: 6.007 ± 1.448
4.159LeuPhe: 4.159 ± 1.154
4.621LeuGly: 4.621 ± 1.227
3.004LeuHis: 3.004 ± 0.605
7.625LeuIle: 7.625 ± 2.454
5.083LeuLys: 5.083 ± 1.456
8.087LeuLeu: 8.087 ± 2.136
3.928LeuMet: 3.928 ± 1.413
6.47LeuAsn: 6.47 ± 1.358
4.621LeuPro: 4.621 ± 0.84
3.466LeuGln: 3.466 ± 0.841
6.238LeuArg: 6.238 ± 0.813
7.163LeuSer: 7.163 ± 1.412
5.083LeuThr: 5.083 ± 1.396
4.621LeuVal: 4.621 ± 0.609
1.617LeuTrp: 1.617 ± 0.759
4.159LeuTyr: 4.159 ± 1.035
0.0LeuXaa: 0.0 ± 0.0
Met
1.386MetAla: 1.386 ± 0.715
0.231MetCys: 0.231 ± 0.147
2.542MetAsp: 2.542 ± 1.315
3.235MetGlu: 3.235 ± 0.762
1.848MetPhe: 1.848 ± 0.655
0.693MetGly: 0.693 ± 0.404
0.693MetHis: 0.693 ± 0.32
3.928MetIle: 3.928 ± 1.431
1.155MetLys: 1.155 ± 0.578
1.386MetLeu: 1.386 ± 0.325
2.079MetMet: 2.079 ± 1.058
1.155MetAsn: 1.155 ± 0.367
0.462MetPro: 0.462 ± 0.302
0.693MetGln: 0.693 ± 0.746
0.924MetArg: 0.924 ± 0.74
2.311MetSer: 2.311 ± 0.883
1.386MetThr: 1.386 ± 0.891
1.155MetVal: 1.155 ± 0.532
0.0MetTrp: 0.0 ± 0.0
0.693MetTyr: 0.693 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
3.235AsnAla: 3.235 ± 0.632
1.155AsnCys: 1.155 ± 0.573
1.386AsnAsp: 1.386 ± 0.367
1.617AsnGlu: 1.617 ± 0.72
2.079AsnPhe: 2.079 ± 0.744
1.386AsnGly: 1.386 ± 0.486
2.311AsnHis: 2.311 ± 0.688
4.39AsnIle: 4.39 ± 1.062
2.542AsnLys: 2.542 ± 0.708
7.625AsnLeu: 7.625 ± 1.997
2.079AsnMet: 2.079 ± 0.709
3.697AsnAsn: 3.697 ± 0.701
3.928AsnPro: 3.928 ± 0.865
3.466AsnGln: 3.466 ± 1.073
1.848AsnArg: 1.848 ± 0.562
3.235AsnSer: 3.235 ± 1.047
2.542AsnThr: 2.542 ± 0.702
4.39AsnVal: 4.39 ± 1.105
1.155AsnTrp: 1.155 ± 0.525
2.773AsnTyr: 2.773 ± 1.191
0.0AsnXaa: 0.0 ± 0.0
Pro
1.617ProAla: 1.617 ± 0.96
0.462ProCys: 0.462 ± 0.439
3.004ProAsp: 3.004 ± 0.93
2.079ProGlu: 2.079 ± 0.886
1.617ProPhe: 1.617 ± 1.049
1.617ProGly: 1.617 ± 1.009
1.155ProHis: 1.155 ± 0.444
3.235ProIle: 3.235 ± 0.762
3.235ProLys: 3.235 ± 1.268
4.159ProLeu: 4.159 ± 0.947
1.386ProMet: 1.386 ± 0.761
3.466ProAsn: 3.466 ± 0.83
2.079ProPro: 2.079 ± 0.986
1.617ProGln: 1.617 ± 0.393
1.848ProArg: 1.848 ± 0.972
4.39ProSer: 4.39 ± 0.859
2.542ProThr: 2.542 ± 0.671
2.542ProVal: 2.542 ± 0.913
0.924ProTrp: 0.924 ± 0.454
1.155ProTyr: 1.155 ± 0.686
0.0ProXaa: 0.0 ± 0.0
Gln
1.155GlnAla: 1.155 ± 0.532
0.231GlnCys: 0.231 ± 0.454
1.848GlnAsp: 1.848 ± 0.884
2.079GlnGlu: 2.079 ± 0.645
2.079GlnPhe: 2.079 ± 0.569
2.542GlnGly: 2.542 ± 1.108
0.462GlnHis: 0.462 ± 0.533
3.466GlnIle: 3.466 ± 1.046
1.386GlnLys: 1.386 ± 0.276
3.004GlnLeu: 3.004 ± 0.498
0.231GlnMet: 0.231 ± 0.298
2.311GlnAsn: 2.311 ± 0.938
1.155GlnPro: 1.155 ± 0.532
1.617GlnGln: 1.617 ± 0.507
1.386GlnArg: 1.386 ± 0.663
2.542GlnSer: 2.542 ± 0.673
2.773GlnThr: 2.773 ± 0.666
2.773GlnVal: 2.773 ± 1.105
1.617GlnTrp: 1.617 ± 0.526
1.155GlnTyr: 1.155 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
2.542ArgAla: 2.542 ± 0.674
0.231ArgCys: 0.231 ± 0.357
3.004ArgAsp: 3.004 ± 1.221
3.004ArgGlu: 3.004 ± 0.68
3.466ArgPhe: 3.466 ± 1.321
3.235ArgGly: 3.235 ± 0.7
1.155ArgHis: 1.155 ± 0.58
2.773ArgIle: 2.773 ± 0.716
2.079ArgLys: 2.079 ± 1.277
4.159ArgLeu: 4.159 ± 1.444
0.924ArgMet: 0.924 ± 0.397
3.235ArgAsn: 3.235 ± 0.61
1.848ArgPro: 1.848 ± 0.49
0.924ArgGln: 0.924 ± 0.414
2.311ArgArg: 2.311 ± 0.996
5.083ArgSer: 5.083 ± 0.768
3.235ArgThr: 3.235 ± 0.777
3.004ArgVal: 3.004 ± 0.866
0.924ArgTrp: 0.924 ± 0.546
1.848ArgTyr: 1.848 ± 0.514
0.0ArgXaa: 0.0 ± 0.0
Ser
3.697SerAla: 3.697 ± 0.628
0.924SerCys: 0.924 ± 0.4
2.079SerAsp: 2.079 ± 0.544
3.235SerGlu: 3.235 ± 1.115
3.928SerPhe: 3.928 ± 0.892
2.542SerGly: 2.542 ± 0.634
2.079SerHis: 2.079 ± 0.523
4.621SerIle: 4.621 ± 1.061
4.621SerLys: 4.621 ± 0.88
9.473SerLeu: 9.473 ± 1.951
2.311SerMet: 2.311 ± 0.81
4.159SerAsn: 4.159 ± 0.742
3.466SerPro: 3.466 ± 0.466
2.079SerGln: 2.079 ± 0.939
3.235SerArg: 3.235 ± 1.039
7.163SerSer: 7.163 ± 1.301
5.314SerThr: 5.314 ± 1.003
3.928SerVal: 3.928 ± 1.015
2.773SerTrp: 2.773 ± 0.736
3.004SerTyr: 3.004 ± 0.785
0.0SerXaa: 0.0 ± 0.0
Thr
2.079ThrAla: 2.079 ± 1.074
0.693ThrCys: 0.693 ± 0.316
2.542ThrAsp: 2.542 ± 1.072
2.542ThrGlu: 2.542 ± 1.11
1.617ThrPhe: 1.617 ± 0.577
4.39ThrGly: 4.39 ± 0.989
1.617ThrHis: 1.617 ± 0.575
3.466ThrIle: 3.466 ± 1.029
2.542ThrLys: 2.542 ± 0.486
6.007ThrLeu: 6.007 ± 1.618
1.617ThrMet: 1.617 ± 0.564
3.235ThrAsn: 3.235 ± 0.541
2.542ThrPro: 2.542 ± 1.574
1.386ThrGln: 1.386 ± 0.702
2.542ThrArg: 2.542 ± 0.691
3.235ThrSer: 3.235 ± 0.554
3.235ThrThr: 3.235 ± 1.171
3.004ThrVal: 3.004 ± 1.005
1.848ThrTrp: 1.848 ± 0.52
2.079ThrTyr: 2.079 ± 0.597
0.0ThrXaa: 0.0 ± 0.0
Val
3.235ValAla: 3.235 ± 1.167
0.924ValCys: 0.924 ± 0.455
3.697ValAsp: 3.697 ± 1.276
1.386ValGlu: 1.386 ± 0.595
2.773ValPhe: 2.773 ± 0.775
3.235ValGly: 3.235 ± 0.939
0.462ValHis: 0.462 ± 0.293
5.314ValIle: 5.314 ± 1.41
3.697ValLys: 3.697 ± 1.217
4.621ValLeu: 4.621 ± 1.438
1.848ValMet: 1.848 ± 0.689
2.311ValAsn: 2.311 ± 0.798
2.773ValPro: 2.773 ± 1.197
2.079ValGln: 2.079 ± 0.524
3.928ValArg: 3.928 ± 0.858
4.159ValSer: 4.159 ± 1.194
2.079ValThr: 2.079 ± 0.604
2.079ValVal: 2.079 ± 0.598
0.231ValTrp: 0.231 ± 0.147
5.083ValTyr: 5.083 ± 0.904
0.0ValXaa: 0.0 ± 0.0
Trp
0.924TrpAla: 0.924 ± 0.414
0.462TrpCys: 0.462 ± 0.407
0.924TrpAsp: 0.924 ± 0.587
3.235TrpGlu: 3.235 ± 0.629
1.617TrpPhe: 1.617 ± 0.729
1.848TrpGly: 1.848 ± 0.716
0.231TrpHis: 0.231 ± 0.147
1.848TrpIle: 1.848 ± 0.613
1.617TrpLys: 1.617 ± 1.061
1.155TrpLeu: 1.155 ± 0.526
0.693TrpMet: 0.693 ± 0.593
0.462TrpAsn: 0.462 ± 0.41
0.462TrpPro: 0.462 ± 0.293
0.0TrpGln: 0.0 ± 0.0
0.462TrpArg: 0.462 ± 0.324
1.386TrpSer: 1.386 ± 0.681
0.693TrpThr: 0.693 ± 0.549
1.155TrpVal: 1.155 ± 0.569
0.462TrpTrp: 0.462 ± 0.596
0.693TrpTyr: 0.693 ± 0.373
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.311TyrAla: 2.311 ± 0.555
0.693TyrCys: 0.693 ± 0.549
3.004TyrAsp: 3.004 ± 0.965
2.079TyrGlu: 2.079 ± 0.794
1.848TyrPhe: 1.848 ± 0.824
2.542TyrGly: 2.542 ± 1.108
0.462TyrHis: 0.462 ± 0.302
2.079TyrIle: 2.079 ± 0.801
2.542TyrLys: 2.542 ± 0.828
5.545TyrLeu: 5.545 ± 1.281
0.924TyrMet: 0.924 ± 0.877
3.235TyrAsn: 3.235 ± 0.751
2.079TyrPro: 2.079 ± 0.64
3.004TyrGln: 3.004 ± 0.928
2.079TyrArg: 2.079 ± 1.26
3.235TyrSer: 3.235 ± 0.999
3.004TyrThr: 3.004 ± 1.199
1.155TyrVal: 1.155 ± 0.506
0.462TyrTrp: 0.462 ± 0.297
0.693TyrTyr: 0.693 ± 0.44
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (4329 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski