Amino acid dipepetide frequency for Marco virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.86AlaAla: 1.86 ± 0.962
0.698AlaCys: 0.698 ± 0.316
2.558AlaAsp: 2.558 ± 0.713
1.395AlaGlu: 1.395 ± 1.113
1.628AlaPhe: 1.628 ± 0.623
1.628AlaGly: 1.628 ± 1.063
0.93AlaHis: 0.93 ± 0.404
4.186AlaIle: 4.186 ± 1.56
1.86AlaLys: 1.86 ± 0.516
3.953AlaLeu: 3.953 ± 0.455
0.698AlaMet: 0.698 ± 0.451
2.326AlaAsn: 2.326 ± 0.491
2.093AlaPro: 2.093 ± 1.961
1.163AlaGln: 1.163 ± 0.413
1.628AlaArg: 1.628 ± 0.771
3.721AlaSer: 3.721 ± 0.816
3.721AlaThr: 3.721 ± 2.061
1.628AlaVal: 1.628 ± 0.55
0.93AlaTrp: 0.93 ± 0.398
1.628AlaTyr: 1.628 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
0.233CysAla: 0.233 ± 0.25
0.698CysCys: 0.698 ± 0.496
0.698CysAsp: 0.698 ± 0.322
0.93CysGlu: 0.93 ± 0.829
0.698CysPhe: 0.698 ± 0.451
0.698CysGly: 0.698 ± 0.642
0.465CysHis: 0.465 ± 0.221
0.698CysIle: 0.698 ± 0.551
1.86CysLys: 1.86 ± 0.894
1.395CysLeu: 1.395 ± 0.411
0.0CysMet: 0.0 ± 0.0
0.698CysAsn: 0.698 ± 0.451
0.93CysPro: 0.93 ± 0.57
1.163CysGln: 1.163 ± 0.317
0.233CysArg: 0.233 ± 0.138
2.093CysSer: 2.093 ± 0.693
0.698CysThr: 0.698 ± 0.271
0.465CysVal: 0.465 ± 0.277
0.465CysTrp: 0.465 ± 0.275
1.163CysTyr: 1.163 ± 0.359
0.0CysXaa: 0.0 ± 0.0
Asp
2.326AspAla: 2.326 ± 1.514
0.465AspCys: 0.465 ± 0.26
2.558AspAsp: 2.558 ± 1.4
2.791AspGlu: 2.791 ± 0.638
2.791AspPhe: 2.791 ± 0.738
2.326AspGly: 2.326 ± 0.539
2.093AspHis: 2.093 ± 0.36
3.023AspIle: 3.023 ± 0.521
4.884AspLys: 4.884 ± 0.813
6.977AspLeu: 6.977 ± 1.149
1.86AspMet: 1.86 ± 0.53
2.791AspAsn: 2.791 ± 0.571
3.721AspPro: 3.721 ± 0.682
3.256AspGln: 3.256 ± 1.573
2.791AspArg: 2.791 ± 0.674
3.721AspSer: 3.721 ± 0.739
1.86AspThr: 1.86 ± 0.548
2.558AspVal: 2.558 ± 0.907
0.93AspTrp: 0.93 ± 0.389
1.163AspTyr: 1.163 ± 0.483
0.0AspXaa: 0.0 ± 0.0
Glu
2.558GluAla: 2.558 ± 0.81
0.465GluCys: 0.465 ± 0.221
3.023GluAsp: 3.023 ± 1.584
3.721GluGlu: 3.721 ± 1.364
4.186GluPhe: 4.186 ± 1.174
3.256GluGly: 3.256 ± 0.522
0.465GluHis: 0.465 ± 0.5
4.884GluIle: 4.884 ± 1.704
5.581GluLys: 5.581 ± 0.431
5.349GluLeu: 5.349 ± 1.329
2.558GluMet: 2.558 ± 0.655
2.326GluAsn: 2.326 ± 0.499
3.256GluPro: 3.256 ± 1.148
1.86GluGln: 1.86 ± 0.715
0.698GluArg: 0.698 ± 0.271
6.512GluSer: 6.512 ± 0.899
2.326GluThr: 2.326 ± 0.709
3.023GluVal: 3.023 ± 0.905
0.698GluTrp: 0.698 ± 0.324
2.558GluTyr: 2.558 ± 0.544
0.0GluXaa: 0.0 ± 0.0
Phe
0.93PheAla: 0.93 ± 0.656
0.93PheCys: 0.93 ± 0.554
2.558PheAsp: 2.558 ± 0.829
3.488PheGlu: 3.488 ± 0.852
2.326PhePhe: 2.326 ± 0.762
2.093PheGly: 2.093 ± 0.609
0.465PheHis: 0.465 ± 0.277
3.023PheIle: 3.023 ± 0.879
4.651PheLys: 4.651 ± 1.831
3.721PheLeu: 3.721 ± 0.978
0.233PheMet: 0.233 ± 0.138
2.326PheAsn: 2.326 ± 0.876
3.488PhePro: 3.488 ± 0.975
3.256PheGln: 3.256 ± 0.693
2.093PheArg: 2.093 ± 1.013
2.558PheSer: 2.558 ± 0.779
1.628PheThr: 1.628 ± 0.73
2.558PheVal: 2.558 ± 1.203
0.93PheTrp: 0.93 ± 0.554
1.163PheTyr: 1.163 ± 0.394
0.0PheXaa: 0.0 ± 0.0
Gly
1.395GlyAla: 1.395 ± 0.643
0.698GlyCys: 0.698 ± 0.271
2.093GlyAsp: 2.093 ± 0.994
2.558GlyGlu: 2.558 ± 0.555
2.326GlyPhe: 2.326 ± 0.333
2.326GlyGly: 2.326 ± 0.892
1.395GlyHis: 1.395 ± 0.779
5.814GlyIle: 5.814 ± 1.04
4.419GlyLys: 4.419 ± 1.43
6.744GlyLeu: 6.744 ± 1.558
1.395GlyMet: 1.395 ± 0.477
1.86GlyAsn: 1.86 ± 0.564
1.86GlyPro: 1.86 ± 0.555
2.326GlyGln: 2.326 ± 0.796
0.698GlyArg: 0.698 ± 0.271
5.581GlySer: 5.581 ± 0.59
2.558GlyThr: 2.558 ± 0.757
3.023GlyVal: 3.023 ± 1.042
0.93GlyTrp: 0.93 ± 0.369
2.093GlyTyr: 2.093 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
1.163HisAla: 1.163 ± 0.394
0.0HisCys: 0.0 ± 0.0
0.93HisAsp: 0.93 ± 0.557
1.86HisGlu: 1.86 ± 0.65
0.698HisPhe: 0.698 ± 0.415
0.465HisGly: 0.465 ± 0.277
0.698HisHis: 0.698 ± 0.415
1.628HisIle: 1.628 ± 0.528
2.558HisLys: 2.558 ± 0.681
1.395HisLeu: 1.395 ± 0.589
0.465HisMet: 0.465 ± 0.249
1.395HisAsn: 1.395 ± 0.623
1.86HisPro: 1.86 ± 0.882
0.93HisGln: 0.93 ± 0.454
1.395HisArg: 1.395 ± 0.403
1.395HisSer: 1.395 ± 0.411
0.698HisThr: 0.698 ± 0.322
1.395HisVal: 1.395 ± 0.831
0.233HisTrp: 0.233 ± 0.138
1.163HisTyr: 1.163 ± 0.442
0.0HisXaa: 0.0 ± 0.0
Ile
2.093IleAla: 2.093 ± 0.815
1.628IleCys: 1.628 ± 0.56
3.721IleAsp: 3.721 ± 0.862
5.814IleGlu: 5.814 ± 1.147
2.791IlePhe: 2.791 ± 0.907
6.047IleGly: 6.047 ± 0.871
1.163IleHis: 1.163 ± 0.472
6.744IleIle: 6.744 ± 1.858
5.814IleLys: 5.814 ± 1.316
6.977IleLeu: 6.977 ± 1.022
1.86IleMet: 1.86 ± 0.521
5.814IleAsn: 5.814 ± 1.74
3.953IlePro: 3.953 ± 0.937
3.256IleGln: 3.256 ± 1.292
4.186IleArg: 4.186 ± 1.028
8.372IleSer: 8.372 ± 0.427
5.581IleThr: 5.581 ± 0.594
4.884IleVal: 4.884 ± 1.32
1.163IleTrp: 1.163 ± 0.467
2.326IleTyr: 2.326 ± 0.629
0.0IleXaa: 0.0 ± 0.0
Lys
3.023LysAla: 3.023 ± 0.988
0.93LysCys: 0.93 ± 0.441
4.884LysAsp: 4.884 ± 1.162
4.651LysGlu: 4.651 ± 0.705
2.093LysPhe: 2.093 ± 0.976
4.419LysGly: 4.419 ± 0.96
1.628LysHis: 1.628 ± 0.418
9.07LysIle: 9.07 ± 1.351
7.907LysLys: 7.907 ± 3.685
6.279LysLeu: 6.279 ± 0.863
2.558LysMet: 2.558 ± 1.064
1.628LysAsn: 1.628 ± 0.528
4.186LysPro: 4.186 ± 2.636
2.326LysGln: 2.326 ± 0.712
3.023LysArg: 3.023 ± 0.744
6.047LysSer: 6.047 ± 1.25
5.116LysThr: 5.116 ± 0.614
4.884LysVal: 4.884 ± 1.22
1.163LysTrp: 1.163 ± 0.499
1.395LysTyr: 1.395 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
4.651LeuAla: 4.651 ± 0.869
1.86LeuCys: 1.86 ± 0.661
5.814LeuAsp: 5.814 ± 0.856
5.814LeuGlu: 5.814 ± 0.782
4.419LeuPhe: 4.419 ± 0.738
7.442LeuGly: 7.442 ± 1.527
1.395LeuHis: 1.395 ± 0.636
9.535LeuIle: 9.535 ± 1.997
6.512LeuLys: 6.512 ± 0.881
8.605LeuLeu: 8.605 ± 2.015
2.791LeuMet: 2.791 ± 0.694
6.047LeuAsn: 6.047 ± 1.153
2.558LeuPro: 2.558 ± 0.437
3.488LeuGln: 3.488 ± 0.986
4.651LeuArg: 4.651 ± 0.887
7.442LeuSer: 7.442 ± 1.429
5.349LeuThr: 5.349 ± 0.76
4.651LeuVal: 4.651 ± 0.48
1.163LeuTrp: 1.163 ± 0.825
1.628LeuTyr: 1.628 ± 0.526
0.0LeuXaa: 0.0 ± 0.0
Met
1.163MetAla: 1.163 ± 0.496
0.93MetCys: 0.93 ± 0.326
2.326MetAsp: 2.326 ± 0.989
2.093MetGlu: 2.093 ± 0.794
1.163MetPhe: 1.163 ± 0.514
1.628MetGly: 1.628 ± 0.431
0.233MetHis: 0.233 ± 0.138
1.628MetIle: 1.628 ± 0.532
1.395MetLys: 1.395 ± 0.561
2.791MetLeu: 2.791 ± 0.588
0.465MetMet: 0.465 ± 0.277
0.93MetAsn: 0.93 ± 0.372
0.93MetPro: 0.93 ± 0.302
0.93MetGln: 0.93 ± 0.301
0.0MetArg: 0.0 ± 0.0
2.326MetSer: 2.326 ± 0.958
2.326MetThr: 2.326 ± 0.656
1.628MetVal: 1.628 ± 0.738
0.465MetTrp: 0.465 ± 0.395
1.163MetTyr: 1.163 ± 0.51
0.0MetXaa: 0.0 ± 0.0
Asn
2.326AsnAla: 2.326 ± 1.02
1.395AsnCys: 1.395 ± 0.662
2.791AsnAsp: 2.791 ± 0.925
3.488AsnGlu: 3.488 ± 0.645
2.558AsnPhe: 2.558 ± 0.586
2.093AsnGly: 2.093 ± 0.471
2.093AsnHis: 2.093 ± 0.561
3.488AsnIle: 3.488 ± 0.95
3.721AsnLys: 3.721 ± 0.642
7.907AsnLeu: 7.907 ± 1.871
1.163AsnMet: 1.163 ± 0.612
3.953AsnAsn: 3.953 ± 0.961
2.791AsnPro: 2.791 ± 1.006
3.256AsnGln: 3.256 ± 0.705
0.93AsnArg: 0.93 ± 0.843
3.256AsnSer: 3.256 ± 0.765
2.093AsnThr: 2.093 ± 0.373
1.395AsnVal: 1.395 ± 0.918
2.558AsnTrp: 2.558 ± 0.688
1.86AsnTyr: 1.86 ± 0.961
0.0AsnXaa: 0.0 ± 0.0
Pro
2.326ProAla: 2.326 ± 0.6
0.465ProCys: 0.465 ± 0.5
3.953ProAsp: 3.953 ± 1.675
3.256ProGlu: 3.256 ± 1.828
1.163ProPhe: 1.163 ± 0.824
1.395ProGly: 1.395 ± 0.277
0.233ProHis: 0.233 ± 0.138
2.326ProIle: 2.326 ± 0.796
4.419ProLys: 4.419 ± 2.448
3.023ProLeu: 3.023 ± 1.046
1.86ProMet: 1.86 ± 0.583
5.116ProAsn: 5.116 ± 1.899
2.093ProPro: 2.093 ± 0.572
1.86ProGln: 1.86 ± 0.573
2.558ProArg: 2.558 ± 0.551
2.791ProSer: 2.791 ± 1.017
2.791ProThr: 2.791 ± 0.607
2.791ProVal: 2.791 ± 1.129
0.465ProTrp: 0.465 ± 0.249
2.558ProTyr: 2.558 ± 0.486
0.0ProXaa: 0.0 ± 0.0
Gln
1.86GlnAla: 1.86 ± 1.051
0.233GlnCys: 0.233 ± 0.344
2.326GlnAsp: 2.326 ± 1.253
2.791GlnGlu: 2.791 ± 0.956
2.326GlnPhe: 2.326 ± 0.592
1.395GlnGly: 1.395 ± 0.614
1.163GlnHis: 1.163 ± 0.487
3.488GlnIle: 3.488 ± 0.461
4.419GlnLys: 4.419 ± 0.549
3.721GlnLeu: 3.721 ± 0.604
0.698GlnMet: 0.698 ± 0.312
1.163GlnAsn: 1.163 ± 0.471
0.465GlnPro: 0.465 ± 0.277
0.93GlnGln: 0.93 ± 0.42
1.628GlnArg: 1.628 ± 0.523
3.256GlnSer: 3.256 ± 0.519
2.558GlnThr: 2.558 ± 0.541
2.558GlnVal: 2.558 ± 0.996
0.93GlnTrp: 0.93 ± 0.369
0.698GlnTyr: 0.698 ± 0.556
0.0GlnXaa: 0.0 ± 0.0
Arg
1.628ArgAla: 1.628 ± 0.845
0.93ArgCys: 0.93 ± 0.359
1.628ArgAsp: 1.628 ± 0.445
3.023ArgGlu: 3.023 ± 1.036
3.256ArgPhe: 3.256 ± 0.753
1.628ArgGly: 1.628 ± 0.529
0.465ArgHis: 0.465 ± 0.277
3.023ArgIle: 3.023 ± 0.899
1.86ArgLys: 1.86 ± 0.384
2.326ArgLeu: 2.326 ± 0.814
0.465ArgMet: 0.465 ± 0.26
2.326ArgAsn: 2.326 ± 0.845
1.395ArgPro: 1.395 ± 0.332
1.395ArgGln: 1.395 ± 0.416
2.093ArgArg: 2.093 ± 0.525
4.419ArgSer: 4.419 ± 0.792
2.558ArgThr: 2.558 ± 0.609
2.326ArgVal: 2.326 ± 0.543
0.698ArgTrp: 0.698 ± 0.327
1.163ArgTyr: 1.163 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
3.721SerAla: 3.721 ± 0.946
1.628SerCys: 1.628 ± 0.695
4.186SerAsp: 4.186 ± 0.736
4.419SerGlu: 4.419 ± 0.897
3.256SerPhe: 3.256 ± 0.915
4.419SerGly: 4.419 ± 1.053
3.721SerHis: 3.721 ± 0.812
6.512SerIle: 6.512 ± 1.002
5.116SerLys: 5.116 ± 1.032
8.837SerLeu: 8.837 ± 1.26
1.86SerMet: 1.86 ± 0.808
4.419SerAsn: 4.419 ± 1.378
3.953SerPro: 3.953 ± 0.844
2.326SerGln: 2.326 ± 0.487
5.116SerArg: 5.116 ± 0.926
5.814SerSer: 5.814 ± 1.432
6.744SerThr: 6.744 ± 0.735
6.047SerVal: 6.047 ± 1.782
1.395SerTrp: 1.395 ± 0.603
2.558SerTyr: 2.558 ± 0.681
0.0SerXaa: 0.0 ± 0.0
Thr
2.093ThrAla: 2.093 ± 1.109
0.698ThrCys: 0.698 ± 0.415
3.488ThrAsp: 3.488 ± 0.88
2.093ThrGlu: 2.093 ± 0.564
1.628ThrPhe: 1.628 ± 0.763
3.721ThrGly: 3.721 ± 1.446
1.395ThrHis: 1.395 ± 0.467
5.814ThrIle: 5.814 ± 1.033
3.488ThrLys: 3.488 ± 1.991
4.884ThrLeu: 4.884 ± 0.893
2.326ThrMet: 2.326 ± 0.909
3.256ThrAsn: 3.256 ± 0.678
3.023ThrPro: 3.023 ± 1.175
1.628ThrGln: 1.628 ± 0.585
1.395ThrArg: 1.395 ± 0.589
3.488ThrSer: 3.488 ± 0.669
2.326ThrThr: 2.326 ± 0.672
2.326ThrVal: 2.326 ± 0.609
2.326ThrTrp: 2.326 ± 0.487
3.256ThrTyr: 3.256 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
2.326ValAla: 2.326 ± 0.889
0.93ValCys: 0.93 ± 0.399
2.326ValAsp: 2.326 ± 0.655
2.326ValGlu: 2.326 ± 0.802
2.558ValPhe: 2.558 ± 0.801
2.093ValGly: 2.093 ± 1.026
0.698ValHis: 0.698 ± 0.324
5.116ValIle: 5.116 ± 1.5
2.558ValLys: 2.558 ± 0.599
5.814ValLeu: 5.814 ± 1.374
1.628ValMet: 1.628 ± 0.575
2.791ValAsn: 2.791 ± 0.606
2.791ValPro: 2.791 ± 1.196
2.326ValGln: 2.326 ± 0.528
1.628ValArg: 1.628 ± 0.419
6.279ValSer: 6.279 ± 0.748
2.558ValThr: 2.558 ± 1.077
2.093ValVal: 2.093 ± 0.819
0.93ValTrp: 0.93 ± 0.554
2.326ValTyr: 2.326 ± 0.799
0.0ValXaa: 0.0 ± 0.0
Trp
1.163TrpAla: 1.163 ± 0.576
0.465TrpCys: 0.465 ± 0.402
1.163TrpAsp: 1.163 ± 0.514
0.93TrpGlu: 0.93 ± 0.554
0.93TrpPhe: 0.93 ± 0.369
1.86TrpGly: 1.86 ± 0.649
0.465TrpHis: 0.465 ± 0.275
1.395TrpIle: 1.395 ± 0.857
1.163TrpLys: 1.163 ± 0.314
0.465TrpLeu: 0.465 ± 0.471
0.93TrpMet: 0.93 ± 0.414
1.628TrpAsn: 1.628 ± 0.532
0.233TrpPro: 0.233 ± 0.138
0.233TrpGln: 0.233 ± 0.138
0.698TrpArg: 0.698 ± 0.415
3.023TrpSer: 3.023 ± 0.675
0.465TrpThr: 0.465 ± 0.354
0.698TrpVal: 0.698 ± 0.271
0.233TrpTrp: 0.233 ± 0.25
0.698TrpTyr: 0.698 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.395TyrAla: 1.395 ± 0.332
0.233TyrCys: 0.233 ± 0.286
1.86TyrAsp: 1.86 ± 0.892
1.86TyrGlu: 1.86 ± 0.324
1.628TyrPhe: 1.628 ± 0.293
1.163TyrGly: 1.163 ± 0.499
1.163TyrHis: 1.163 ± 0.317
2.791TyrIle: 2.791 ± 0.683
3.023TyrLys: 3.023 ± 0.771
4.651TyrLeu: 4.651 ± 0.806
0.465TyrMet: 0.465 ± 0.277
2.326TyrAsn: 2.326 ± 0.462
1.86TyrPro: 1.86 ± 0.677
0.698TyrGln: 0.698 ± 0.61
1.163TyrArg: 1.163 ± 0.903
3.953TyrSer: 3.953 ± 0.401
0.698TyrThr: 0.698 ± 0.316
1.395TyrVal: 1.395 ± 0.416
0.233TyrTrp: 0.233 ± 0.25
2.791TyrTyr: 2.791 ± 0.786
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4301 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski