Amino acid dipepetide frequency for Rose rosette emaravirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.866AlaAla: 1.866 ± 0.88
0.622AlaCys: 0.622 ± 0.346
3.524AlaAsp: 3.524 ± 0.872
1.451AlaGlu: 1.451 ± 0.552
2.488AlaPhe: 2.488 ± 0.841
1.244AlaGly: 1.244 ± 0.712
0.829AlaHis: 0.829 ± 0.254
4.768AlaIle: 4.768 ± 0.982
3.317AlaLys: 3.317 ± 1.109
3.939AlaLeu: 3.939 ± 1.15
0.622AlaMet: 0.622 ± 0.289
2.695AlaAsn: 2.695 ± 1.016
1.658AlaPro: 1.658 ± 1.333
1.451AlaGln: 1.451 ± 0.601
1.244AlaArg: 1.244 ± 0.343
2.902AlaSer: 2.902 ± 0.885
1.658AlaThr: 1.658 ± 0.768
2.28AlaVal: 2.28 ± 0.865
0.207AlaTrp: 0.207 ± 0.225
2.28AlaTyr: 2.28 ± 0.866
0.0AlaXaa: 0.0 ± 0.0
Cys
0.622CysAla: 0.622 ± 0.346
0.0CysCys: 0.0 ± 0.0
1.244CysAsp: 1.244 ± 0.779
0.829CysGlu: 0.829 ± 0.293
0.622CysPhe: 0.622 ± 0.389
0.622CysGly: 0.622 ± 0.389
0.0CysHis: 0.0 ± 0.0
1.451CysIle: 1.451 ± 0.507
0.622CysLys: 0.622 ± 0.261
2.488CysLeu: 2.488 ± 0.537
0.415CysMet: 0.415 ± 0.235
2.28CysAsn: 2.28 ± 1.013
0.622CysPro: 0.622 ± 0.223
0.829CysGln: 0.829 ± 0.366
0.415CysArg: 0.415 ± 0.258
1.451CysSer: 1.451 ± 0.468
1.451CysThr: 1.451 ± 0.744
1.036CysVal: 1.036 ± 0.448
0.0CysTrp: 0.0 ± 0.0
0.622CysTyr: 0.622 ± 0.598
0.0CysXaa: 0.0 ± 0.0
Asp
2.28AspAla: 2.28 ± 0.78
1.244AspCys: 1.244 ± 0.415
4.975AspAsp: 4.975 ± 0.815
4.146AspGlu: 4.146 ± 0.867
3.317AspPhe: 3.317 ± 0.907
1.658AspGly: 1.658 ± 0.571
1.658AspHis: 1.658 ± 0.543
5.182AspIle: 5.182 ± 0.891
4.353AspLys: 4.353 ± 1.406
4.353AspLeu: 4.353 ± 1.404
1.866AspMet: 1.866 ± 0.581
2.902AspAsn: 2.902 ± 0.72
3.317AspPro: 3.317 ± 0.545
1.036AspGln: 1.036 ± 0.377
2.28AspArg: 2.28 ± 0.499
3.524AspSer: 3.524 ± 1.002
4.353AspThr: 4.353 ± 1.588
6.012AspVal: 6.012 ± 1.622
0.207AspTrp: 0.207 ± 0.363
4.561AspTyr: 4.561 ± 1.081
0.0AspXaa: 0.0 ± 0.0
Glu
2.28GluAla: 2.28 ± 1.34
0.829GluCys: 0.829 ± 0.254
3.317GluAsp: 3.317 ± 1.0
3.317GluGlu: 3.317 ± 0.83
4.353GluPhe: 4.353 ± 1.472
0.207GluGly: 0.207 ± 0.298
1.658GluHis: 1.658 ± 0.531
6.219GluIle: 6.219 ± 0.515
5.597GluLys: 5.597 ± 1.219
5.182GluLeu: 5.182 ± 1.728
1.658GluMet: 1.658 ± 0.862
2.695GluAsn: 2.695 ± 1.456
0.829GluPro: 0.829 ± 0.434
1.244GluGln: 1.244 ± 0.357
2.28GluArg: 2.28 ± 0.539
2.902GluSer: 2.902 ± 0.644
3.939GluThr: 3.939 ± 0.904
6.426GluVal: 6.426 ± 1.867
0.207GluTrp: 0.207 ± 0.129
2.28GluTyr: 2.28 ± 0.632
0.0GluXaa: 0.0 ± 0.0
Phe
1.036PheAla: 1.036 ± 0.842
1.036PheCys: 1.036 ± 0.566
2.28PheAsp: 2.28 ± 0.686
0.829PheGlu: 0.829 ± 0.265
2.488PhePhe: 2.488 ± 0.894
1.866PheGly: 1.866 ± 0.601
1.658PheHis: 1.658 ± 0.568
4.146PheIle: 4.146 ± 1.027
3.939PheLys: 3.939 ± 0.924
4.146PheLeu: 4.146 ± 0.596
2.28PheMet: 2.28 ± 1.014
4.353PheAsn: 4.353 ± 0.66
2.073PhePro: 2.073 ± 0.88
0.415PheGln: 0.415 ± 0.263
1.036PheArg: 1.036 ± 0.377
4.146PheSer: 4.146 ± 0.529
1.244PheThr: 1.244 ± 0.419
1.658PheVal: 1.658 ± 1.016
0.415PheTrp: 0.415 ± 0.434
4.768PheTyr: 4.768 ± 0.915
0.0PheXaa: 0.0 ± 0.0
Gly
1.244GlyAla: 1.244 ± 0.761
1.036GlyCys: 1.036 ± 0.78
2.073GlyAsp: 2.073 ± 0.62
2.695GlyGlu: 2.695 ± 0.827
1.244GlyPhe: 1.244 ± 0.362
0.207GlyGly: 0.207 ± 0.129
0.415GlyHis: 0.415 ± 0.258
2.073GlyIle: 2.073 ± 0.87
3.109GlyLys: 3.109 ± 0.693
1.658GlyLeu: 1.658 ± 0.457
1.036GlyMet: 1.036 ± 0.375
2.695GlyAsn: 2.695 ± 1.215
0.829GlyPro: 0.829 ± 0.292
0.829GlyGln: 0.829 ± 0.472
1.036GlyArg: 1.036 ± 0.406
1.658GlySer: 1.658 ± 0.633
3.109GlyThr: 3.109 ± 0.641
1.451GlyVal: 1.451 ± 0.552
0.0GlyTrp: 0.0 ± 0.0
2.902GlyTyr: 2.902 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
1.036HisAla: 1.036 ± 0.321
0.622HisCys: 0.622 ± 0.223
2.28HisAsp: 2.28 ± 0.616
1.451HisGlu: 1.451 ± 0.749
1.036HisPhe: 1.036 ± 0.446
1.036HisGly: 1.036 ± 0.386
0.829HisHis: 0.829 ± 0.35
2.073HisIle: 2.073 ± 0.465
1.866HisLys: 1.866 ± 0.89
3.317HisLeu: 3.317 ± 1.186
0.415HisMet: 0.415 ± 0.228
1.036HisAsn: 1.036 ± 0.424
0.622HisPro: 0.622 ± 0.388
1.244HisGln: 1.244 ± 0.402
0.622HisArg: 0.622 ± 0.388
1.658HisSer: 1.658 ± 0.827
0.622HisThr: 0.622 ± 0.284
1.866HisVal: 1.866 ± 0.411
0.415HisTrp: 0.415 ± 0.258
1.866HisTyr: 1.866 ± 0.573
0.0HisXaa: 0.0 ± 0.0
Ile
4.353IleAla: 4.353 ± 0.729
2.073IleCys: 2.073 ± 0.635
8.499IleAsp: 8.499 ± 1.953
7.255IleGlu: 7.255 ± 0.725
3.109IlePhe: 3.109 ± 0.861
4.353IleGly: 4.353 ± 1.344
2.695IleHis: 2.695 ± 0.434
8.706IleIle: 8.706 ± 1.666
8.085IleLys: 8.085 ± 0.49
7.255IleLeu: 7.255 ± 1.003
2.488IleMet: 2.488 ± 0.73
7.048IleAsn: 7.048 ± 1.288
4.353IlePro: 4.353 ± 1.368
2.28IleGln: 2.28 ± 0.545
2.488IleArg: 2.488 ± 1.23
8.499IleSer: 8.499 ± 1.194
4.146IleThr: 4.146 ± 1.618
3.731IleVal: 3.731 ± 1.044
0.415IleTrp: 0.415 ± 0.263
2.695IleTyr: 2.695 ± 0.717
0.0IleXaa: 0.0 ± 0.0
Lys
3.731LysAla: 3.731 ± 0.807
0.829LysCys: 0.829 ± 0.901
4.561LysAsp: 4.561 ± 0.969
4.768LysGlu: 4.768 ± 1.26
4.353LysPhe: 4.353 ± 0.91
0.829LysGly: 0.829 ± 0.517
2.902LysHis: 2.902 ± 0.768
7.463LysIle: 7.463 ± 1.073
9.536LysLys: 9.536 ± 0.979
8.292LysLeu: 8.292 ± 1.329
1.244LysMet: 1.244 ± 0.596
7.048LysAsn: 7.048 ± 0.925
3.317LysPro: 3.317 ± 0.64
2.28LysGln: 2.28 ± 0.626
3.524LysArg: 3.524 ± 0.646
3.731LysSer: 3.731 ± 0.732
6.012LysThr: 6.012 ± 1.399
5.597LysVal: 5.597 ± 1.03
0.415LysTrp: 0.415 ± 0.258
5.597LysTyr: 5.597 ± 1.368
0.0LysXaa: 0.0 ± 0.0
Leu
5.182LeuAla: 5.182 ± 2.25
1.866LeuCys: 1.866 ± 0.735
4.561LeuAsp: 4.561 ± 0.617
5.804LeuGlu: 5.804 ± 1.298
4.768LeuPhe: 4.768 ± 0.782
3.109LeuGly: 3.109 ± 0.864
2.488LeuHis: 2.488 ± 0.967
9.328LeuIle: 9.328 ± 2.427
9.328LeuLys: 9.328 ± 0.676
7.877LeuLeu: 7.877 ± 1.252
1.658LeuMet: 1.658 ± 0.574
7.463LeuAsn: 7.463 ± 0.667
4.146LeuPro: 4.146 ± 1.067
3.317LeuGln: 3.317 ± 0.845
3.939LeuArg: 3.939 ± 0.702
6.633LeuSer: 6.633 ± 0.948
5.39LeuThr: 5.39 ± 0.95
4.353LeuVal: 4.353 ± 1.448
0.207LeuTrp: 0.207 ± 0.225
3.524LeuTyr: 3.524 ± 0.695
0.0LeuXaa: 0.0 ± 0.0
Met
1.866MetAla: 1.866 ± 0.795
0.207MetCys: 0.207 ± 0.129
1.244MetAsp: 1.244 ± 0.654
1.244MetGlu: 1.244 ± 0.49
0.622MetPhe: 0.622 ± 0.464
1.866MetGly: 1.866 ± 0.778
0.829MetHis: 0.829 ± 0.626
2.488MetIle: 2.488 ± 0.741
2.695MetLys: 2.695 ± 0.489
2.488MetLeu: 2.488 ± 0.727
1.036MetMet: 1.036 ± 0.355
1.036MetAsn: 1.036 ± 0.35
0.415MetPro: 0.415 ± 0.235
1.036MetGln: 1.036 ± 0.508
0.0MetArg: 0.0 ± 0.0
1.658MetSer: 1.658 ± 0.32
1.866MetThr: 1.866 ± 0.345
1.866MetVal: 1.866 ± 0.573
0.0MetTrp: 0.0 ± 0.0
1.244MetTyr: 1.244 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
3.109AsnAla: 3.109 ± 0.66
0.622AsnCys: 0.622 ± 0.299
3.524AsnAsp: 3.524 ± 0.79
4.353AsnGlu: 4.353 ± 0.955
2.902AsnPhe: 2.902 ± 1.169
1.244AsnGly: 1.244 ± 0.59
2.28AsnHis: 2.28 ± 0.51
7.255AsnIle: 7.255 ± 0.824
7.048AsnLys: 7.048 ± 0.875
6.012AsnLeu: 6.012 ± 1.372
1.866AsnMet: 1.866 ± 0.69
3.524AsnAsn: 3.524 ± 1.086
2.073AsnPro: 2.073 ± 1.048
1.866AsnGln: 1.866 ± 0.43
2.488AsnArg: 2.488 ± 1.007
7.255AsnSer: 7.255 ± 1.604
3.939AsnThr: 3.939 ± 0.7
4.353AsnVal: 4.353 ± 1.157
0.829AsnTrp: 0.829 ± 0.431
4.353AsnTyr: 4.353 ± 0.767
0.0AsnXaa: 0.0 ± 0.0
Pro
1.866ProAla: 1.866 ± 0.611
0.622ProCys: 0.622 ± 0.45
3.939ProAsp: 3.939 ± 0.723
2.902ProGlu: 2.902 ± 0.5
2.073ProPhe: 2.073 ± 0.65
1.658ProGly: 1.658 ± 0.843
0.0ProHis: 0.0 ± 0.0
2.488ProIle: 2.488 ± 0.537
1.658ProLys: 1.658 ± 0.269
1.866ProLeu: 1.866 ± 0.368
0.829ProMet: 0.829 ± 0.461
2.695ProAsn: 2.695 ± 0.568
0.415ProPro: 0.415 ± 0.263
0.829ProGln: 0.829 ± 0.346
0.829ProArg: 0.829 ± 0.517
2.902ProSer: 2.902 ± 0.537
2.488ProThr: 2.488 ± 0.703
1.244ProVal: 1.244 ± 0.587
0.0ProTrp: 0.0 ± 0.0
2.28ProTyr: 2.28 ± 0.581
0.0ProXaa: 0.0 ± 0.0
Gln
0.207GlnAla: 0.207 ± 0.279
0.207GlnCys: 0.207 ± 0.129
0.829GlnAsp: 0.829 ± 0.265
1.036GlnGlu: 1.036 ± 0.35
1.451GlnPhe: 1.451 ± 0.5
1.036GlnGly: 1.036 ± 0.805
0.207GlnHis: 0.207 ± 0.129
3.317GlnIle: 3.317 ± 1.063
2.488GlnLys: 2.488 ± 0.793
2.28GlnLeu: 2.28 ± 1.066
0.829GlnMet: 0.829 ± 0.754
1.866GlnAsn: 1.866 ± 0.331
0.829GlnPro: 0.829 ± 0.265
1.451GlnGln: 1.451 ± 0.808
0.829GlnArg: 0.829 ± 0.359
2.073GlnSer: 2.073 ± 0.518
1.244GlnThr: 1.244 ± 0.393
1.036GlnVal: 1.036 ± 0.58
0.622GlnTrp: 0.622 ± 0.447
1.451GlnTyr: 1.451 ± 0.389
0.0GlnXaa: 0.0 ± 0.0
Arg
0.829ArgAla: 0.829 ± 0.364
0.622ArgCys: 0.622 ± 0.306
1.451ArgAsp: 1.451 ± 0.566
2.902ArgGlu: 2.902 ± 0.938
1.244ArgPhe: 1.244 ± 0.578
0.829ArgGly: 0.829 ± 0.62
0.829ArgHis: 0.829 ± 0.517
3.109ArgIle: 3.109 ± 0.914
1.244ArgLys: 1.244 ± 0.92
5.39ArgLeu: 5.39 ± 0.591
1.036ArgMet: 1.036 ± 0.523
2.902ArgAsn: 2.902 ± 0.923
0.829ArgPro: 0.829 ± 0.292
0.415ArgGln: 0.415 ± 0.258
0.415ArgArg: 0.415 ± 0.244
1.451ArgSer: 1.451 ± 0.536
2.073ArgThr: 2.073 ± 0.857
1.036ArgVal: 1.036 ± 0.428
0.0ArgTrp: 0.0 ± 0.0
3.524ArgTyr: 3.524 ± 1.281
0.0ArgXaa: 0.0 ± 0.0
Ser
2.695SerAla: 2.695 ± 0.309
1.451SerCys: 1.451 ± 0.589
6.012SerAsp: 6.012 ± 0.825
4.768SerGlu: 4.768 ± 0.502
2.902SerPhe: 2.902 ± 0.476
1.866SerGly: 1.866 ± 0.522
3.317SerHis: 3.317 ± 1.106
5.597SerIle: 5.597 ± 0.787
4.353SerLys: 4.353 ± 1.15
8.292SerLeu: 8.292 ± 1.629
2.28SerMet: 2.28 ± 0.452
5.804SerAsn: 5.804 ± 0.405
2.28SerPro: 2.28 ± 0.762
1.244SerGln: 1.244 ± 0.546
1.866SerArg: 1.866 ± 0.641
4.975SerSer: 4.975 ± 0.379
4.146SerThr: 4.146 ± 0.466
3.109SerVal: 3.109 ± 0.694
0.415SerTrp: 0.415 ± 0.42
3.524SerTyr: 3.524 ± 0.711
0.0SerXaa: 0.0 ± 0.0
Thr
3.109ThrAla: 3.109 ± 1.181
1.036ThrCys: 1.036 ± 0.446
3.524ThrAsp: 3.524 ± 0.699
2.695ThrGlu: 2.695 ± 0.626
2.28ThrPhe: 2.28 ± 0.74
2.488ThrGly: 2.488 ± 0.606
0.622ThrHis: 0.622 ± 0.319
6.219ThrIle: 6.219 ± 1.133
4.561ThrLys: 4.561 ± 0.919
6.219ThrLeu: 6.219 ± 0.713
0.415ThrMet: 0.415 ± 0.244
3.317ThrAsn: 3.317 ± 0.834
2.073ThrPro: 2.073 ± 0.705
1.866ThrGln: 1.866 ± 0.817
1.866ThrArg: 1.866 ± 0.545
4.975ThrSer: 4.975 ± 1.517
3.524ThrThr: 3.524 ± 1.077
5.182ThrVal: 5.182 ± 1.111
0.415ThrTrp: 0.415 ± 0.379
4.146ThrTyr: 4.146 ± 1.102
0.0ThrXaa: 0.0 ± 0.0
Val
1.866ValAla: 1.866 ± 0.543
1.451ValCys: 1.451 ± 0.612
2.902ValAsp: 2.902 ± 0.975
2.073ValGlu: 2.073 ± 0.533
2.902ValPhe: 2.902 ± 0.463
1.036ValGly: 1.036 ± 0.35
1.451ValHis: 1.451 ± 0.602
5.804ValIle: 5.804 ± 1.749
4.768ValLys: 4.768 ± 1.962
6.426ValLeu: 6.426 ± 1.789
1.451ValMet: 1.451 ± 0.383
4.146ValAsn: 4.146 ± 0.999
0.829ValPro: 0.829 ± 0.755
1.036ValGln: 1.036 ± 0.39
2.902ValArg: 2.902 ± 0.642
4.975ValSer: 4.975 ± 0.924
4.353ValThr: 4.353 ± 0.759
3.109ValVal: 3.109 ± 0.927
0.415ValTrp: 0.415 ± 0.45
4.146ValTyr: 4.146 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
0.415TrpAla: 0.415 ± 0.315
0.207TrpCys: 0.207 ± 0.129
0.0TrpAsp: 0.0 ± 0.0
0.207TrpGlu: 0.207 ± 0.225
0.207TrpPhe: 0.207 ± 0.129
0.207TrpGly: 0.207 ± 0.298
0.0TrpHis: 0.0 ± 0.0
0.415TrpIle: 0.415 ± 0.235
0.829TrpLys: 0.829 ± 0.639
0.829TrpLeu: 0.829 ± 0.291
0.415TrpMet: 0.415 ± 0.438
0.415TrpAsn: 0.415 ± 0.595
0.207TrpPro: 0.207 ± 0.129
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.207TrpSer: 0.207 ± 0.129
0.622TrpThr: 0.622 ± 0.517
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.415TrpTyr: 0.415 ± 0.369
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.451TyrAla: 1.451 ± 0.905
1.036TyrCys: 1.036 ± 0.417
2.488TyrAsp: 2.488 ± 0.711
2.695TyrGlu: 2.695 ± 0.661
1.451TyrPhe: 1.451 ± 0.327
3.939TyrGly: 3.939 ± 0.884
1.244TyrHis: 1.244 ± 0.574
6.426TyrIle: 6.426 ± 0.743
6.633TyrLys: 6.633 ± 2.088
6.633TyrLeu: 6.633 ± 0.97
1.658TyrMet: 1.658 ± 0.393
5.182TyrAsn: 5.182 ± 0.747
1.866TyrPro: 1.866 ± 0.814
0.622TyrGln: 0.622 ± 0.576
2.073TyrArg: 2.073 ± 0.931
3.317TyrSer: 3.317 ± 0.672
4.353TyrThr: 4.353 ± 1.091
2.695TyrVal: 2.695 ± 0.916
0.415TyrTrp: 0.415 ± 0.515
3.317TyrTyr: 3.317 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski