Amino acid dipepetide frequency for Moju virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.04AlaAla: 2.04 ± 3.33
1.53AlaCys: 1.53 ± 0.79
1.53AlaAsp: 1.53 ± 0.482
4.591AlaGlu: 4.591 ± 2.504
1.785AlaPhe: 1.785 ± 0.838
2.04AlaGly: 2.04 ± 1.072
1.02AlaHis: 1.02 ± 0.657
3.315AlaIle: 3.315 ± 0.693
5.101AlaLys: 5.101 ± 1.083
4.846AlaLeu: 4.846 ± 2.453
0.255AlaMet: 0.255 ± 0.244
4.846AlaAsn: 4.846 ± 1.404
1.785AlaPro: 1.785 ± 2.041
2.04AlaGln: 2.04 ± 0.675
3.571AlaArg: 3.571 ± 2.795
3.826AlaSer: 3.826 ± 0.237
2.04AlaThr: 2.04 ± 0.729
1.785AlaVal: 1.785 ± 0.735
0.255AlaTrp: 0.255 ± 0.164
3.315AlaTyr: 3.315 ± 1.685
0.0AlaXaa: 0.0 ± 0.0
Cys
1.275CysAla: 1.275 ± 0.562
0.255CysCys: 0.255 ± 0.164
0.765CysAsp: 0.765 ± 0.241
1.275CysGlu: 1.275 ± 1.22
1.785CysPhe: 1.785 ± 1.025
2.805CysGly: 2.805 ± 1.349
1.53CysHis: 1.53 ± 0.79
3.315CysIle: 3.315 ± 1.814
2.04CysLys: 2.04 ± 1.601
2.55CysLeu: 2.55 ± 1.125
0.765CysMet: 0.765 ± 0.395
2.805CysAsn: 2.805 ± 1.086
1.785CysPro: 1.785 ± 0.735
0.765CysGln: 0.765 ± 0.241
0.765CysArg: 0.765 ± 0.732
1.275CysSer: 1.275 ± 0.392
1.275CysThr: 1.275 ± 0.873
1.53CysVal: 1.53 ± 1.115
0.255CysTrp: 0.255 ± 0.164
0.51CysTyr: 0.51 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
1.785AspAla: 1.785 ± 0.761
1.275AspCys: 1.275 ± 0.562
2.295AspAsp: 2.295 ± 0.731
3.826AspGlu: 3.826 ± 1.566
4.081AspPhe: 4.081 ± 1.998
1.275AspGly: 1.275 ± 0.392
1.02AspHis: 1.02 ± 0.985
4.846AspIle: 4.846 ± 1.615
4.336AspLys: 4.336 ± 1.339
5.866AspLeu: 5.866 ± 1.774
2.55AspMet: 2.55 ± 1.424
3.06AspAsn: 3.06 ± 0.948
2.04AspPro: 2.04 ± 0.744
2.295AspGln: 2.295 ± 0.91
2.55AspArg: 2.55 ± 1.991
1.53AspSer: 1.53 ± 1.116
2.805AspThr: 2.805 ± 1.18
2.295AspVal: 2.295 ± 1.001
0.51AspTrp: 0.51 ± 0.328
1.53AspTyr: 1.53 ± 0.482
0.0AspXaa: 0.0 ± 0.0
Glu
3.826GluAla: 3.826 ± 1.452
1.275GluCys: 1.275 ± 0.562
2.55GluAsp: 2.55 ± 0.843
5.101GluGlu: 5.101 ± 1.544
6.886GluPhe: 6.886 ± 3.44
1.785GluGly: 1.785 ± 0.559
3.315GluHis: 3.315 ± 0.911
7.396GluIle: 7.396 ± 2.305
3.06GluLys: 3.06 ± 0.948
6.631GluLeu: 6.631 ± 1.25
3.06GluMet: 3.06 ± 1.614
4.336GluAsn: 4.336 ± 1.041
2.04GluPro: 2.04 ± 0.943
3.06GluGln: 3.06 ± 0.948
3.826GluArg: 3.826 ± 1.305
3.826GluSer: 3.826 ± 0.962
3.315GluThr: 3.315 ± 1.263
2.55GluVal: 2.55 ± 1.419
0.255GluTrp: 0.255 ± 0.164
2.55GluTyr: 2.55 ± 1.044
0.0GluXaa: 0.0 ± 0.0
Phe
2.805PheAla: 2.805 ± 0.907
1.785PheCys: 1.785 ± 1.149
3.315PheAsp: 3.315 ± 1.015
2.295PheGlu: 2.295 ± 1.911
3.06PhePhe: 3.06 ± 1.998
2.805PheGly: 2.805 ± 1.947
1.02PheHis: 1.02 ± 0.372
3.571PheIle: 3.571 ± 0.451
5.356PheLys: 5.356 ± 1.64
6.121PheLeu: 6.121 ± 7.191
1.785PheMet: 1.785 ± 0.559
2.805PheAsn: 2.805 ± 1.199
1.275PhePro: 1.275 ± 0.932
1.53PheGln: 1.53 ± 0.678
2.04PheArg: 2.04 ± 1.236
5.101PheSer: 5.101 ± 1.686
3.315PheThr: 3.315 ± 1.015
1.785PheVal: 1.785 ± 0.605
0.765PheTrp: 0.765 ± 0.492
2.295PheTyr: 2.295 ± 1.185
0.0PheXaa: 0.0 ± 0.0
Gly
1.275GlyAla: 1.275 ± 0.392
2.805GlyCys: 2.805 ± 1.987
3.315GlyAsp: 3.315 ± 1.656
3.826GlyGlu: 3.826 ± 1.26
0.51GlyPhe: 0.51 ± 0.488
0.255GlyGly: 0.255 ± 0.164
0.51GlyHis: 0.51 ± 0.179
5.611GlyIle: 5.611 ± 2.232
2.805GlyLys: 2.805 ± 1.086
3.826GlyLeu: 3.826 ± 0.591
1.02GlyMet: 1.02 ± 0.357
3.571GlyAsn: 3.571 ± 1.676
1.275GlyPro: 1.275 ± 0.562
1.275GlyGln: 1.275 ± 0.522
2.295GlyArg: 2.295 ± 0.671
1.53GlySer: 1.53 ± 1.115
2.55GlyThr: 2.55 ± 1.925
1.785GlyVal: 1.785 ± 1.025
1.02GlyTrp: 1.02 ± 0.357
1.785GlyTyr: 1.785 ± 0.782
0.0GlyXaa: 0.0 ± 0.0
His
1.275HisAla: 1.275 ± 1.272
0.765HisCys: 0.765 ± 0.732
0.765HisAsp: 0.765 ± 1.061
0.765HisGlu: 0.765 ± 0.395
1.53HisPhe: 1.53 ± 0.985
2.04HisGly: 2.04 ± 0.729
0.765HisHis: 0.765 ± 1.061
0.765HisIle: 0.765 ± 0.492
2.04HisLys: 2.04 ± 0.999
2.295HisLeu: 2.295 ± 1.886
1.02HisMet: 1.02 ± 0.372
1.53HisAsn: 1.53 ± 0.678
1.275HisPro: 1.275 ± 0.873
0.51HisGln: 0.51 ± 0.328
0.765HisArg: 0.765 ± 1.061
1.53HisSer: 1.53 ± 0.482
1.275HisThr: 1.275 ± 0.562
1.53HisVal: 1.53 ± 0.536
0.51HisTrp: 0.51 ± 0.328
1.02HisTyr: 1.02 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
5.101IleAla: 5.101 ± 0.355
2.55IleCys: 2.55 ± 1.745
3.826IleAsp: 3.826 ± 0.562
6.376IleGlu: 6.376 ± 1.958
4.081IlePhe: 4.081 ± 0.811
4.846IleGly: 4.846 ± 1.141
2.04IleHis: 2.04 ± 0.999
5.866IleIle: 5.866 ± 1.925
7.396IleLys: 7.396 ± 2.305
8.926IleLeu: 8.926 ± 3.222
1.53IleMet: 1.53 ± 0.536
4.081IleAsn: 4.081 ± 1.322
4.336IlePro: 4.336 ± 1.417
3.315IleGln: 3.315 ± 0.565
4.591IleArg: 4.591 ± 1.58
6.631IleSer: 6.631 ± 1.445
4.081IleThr: 4.081 ± 1.457
3.315IleVal: 3.315 ± 0.302
0.51IleTrp: 0.51 ± 0.328
1.275IleTyr: 1.275 ± 2.2
0.0IleXaa: 0.0 ± 0.0
Lys
4.081LysAla: 4.081 ± 1.186
2.295LysCys: 2.295 ± 1.185
6.121LysAsp: 6.121 ± 1.93
6.376LysGlu: 6.376 ± 1.966
3.826LysPhe: 3.826 ± 1.481
4.336LysGly: 4.336 ± 1.447
2.295LysHis: 2.295 ± 1.886
6.376LysIle: 6.376 ± 2.021
5.356LysLys: 5.356 ± 1.323
7.141LysLeu: 7.141 ± 0.405
3.571LysMet: 3.571 ± 1.401
3.826LysAsn: 3.826 ± 1.775
2.295LysPro: 2.295 ± 0.891
3.06LysGln: 3.06 ± 1.998
1.785LysArg: 1.785 ± 0.605
3.06LysSer: 3.06 ± 0.948
7.396LysThr: 7.396 ± 1.064
2.295LysVal: 2.295 ± 0.724
0.51LysTrp: 0.51 ± 0.328
2.805LysTyr: 2.805 ± 0.863
0.0LysXaa: 0.0 ± 0.0
Leu
5.356LeuAla: 5.356 ± 4.917
1.785LeuCys: 1.785 ± 0.735
5.101LeuAsp: 5.101 ± 1.686
7.651LeuGlu: 7.651 ± 2.236
4.081LeuPhe: 4.081 ± 1.251
2.805LeuGly: 2.805 ± 0.893
2.805LeuHis: 2.805 ± 0.436
6.886LeuIle: 6.886 ± 0.966
8.161LeuLys: 8.161 ± 3.997
7.906LeuLeu: 7.906 ± 2.421
2.295LeuMet: 2.295 ± 0.945
6.376LeuAsn: 6.376 ± 1.404
3.826LeuPro: 3.826 ± 1.452
2.55LeuGln: 2.55 ± 1.044
3.571LeuArg: 3.571 ± 1.522
6.886LeuSer: 6.886 ± 0.667
3.571LeuThr: 3.571 ± 1.497
4.081LeuVal: 4.081 ± 1.885
0.255LeuTrp: 0.255 ± 0.164
4.081LeuTyr: 4.081 ± 1.289
0.0LeuXaa: 0.0 ± 0.0
Met
1.53MetAla: 1.53 ± 0.825
1.02MetCys: 1.02 ± 0.632
2.55MetAsp: 2.55 ± 0.843
2.295MetGlu: 2.295 ± 0.733
1.53MetPhe: 1.53 ± 0.907
1.02MetGly: 1.02 ± 0.632
0.765MetHis: 0.765 ± 0.395
3.315MetIle: 3.315 ± 1.263
2.04MetLys: 2.04 ± 0.729
2.805MetLeu: 2.805 ± 0.842
1.275MetMet: 1.275 ± 0.522
1.02MetAsn: 1.02 ± 0.372
1.53MetPro: 1.53 ± 0.907
0.51MetGln: 0.51 ± 0.179
1.02MetArg: 1.02 ± 0.985
3.571MetSer: 3.571 ± 0.805
1.275MetThr: 1.275 ± 0.392
1.53MetVal: 1.53 ± 0.482
0.0MetTrp: 0.0 ± 0.0
0.51MetTyr: 0.51 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.571AsnAla: 3.571 ± 1.635
1.53AsnCys: 1.53 ± 0.79
4.081AsnAsp: 4.081 ± 1.488
3.826AsnGlu: 3.826 ± 0.932
2.295AsnPhe: 2.295 ± 0.594
0.765AsnGly: 0.765 ± 0.732
2.04AsnHis: 2.04 ± 0.943
5.101AsnIle: 5.101 ± 0.934
6.121AsnLys: 6.121 ± 1.876
4.846AsnLeu: 4.846 ± 0.422
2.55AsnMet: 2.55 ± 1.83
4.081AsnAsn: 4.081 ± 1.643
3.315AsnPro: 3.315 ± 0.302
2.04AsnGln: 2.04 ± 0.625
2.295AsnArg: 2.295 ± 0.724
2.805AsnSer: 2.805 ± 1.086
3.315AsnThr: 3.315 ± 1.048
2.55AsnVal: 2.55 ± 0.551
0.255AsnTrp: 0.255 ± 0.164
3.06AsnTyr: 3.06 ± 1.072
0.0AsnXaa: 0.0 ± 0.0
Pro
2.55ProAla: 2.55 ± 1.865
0.255ProCys: 0.255 ± 0.244
1.53ProAsp: 1.53 ± 0.536
3.06ProGlu: 3.06 ± 0.965
2.55ProPhe: 2.55 ± 1.125
2.295ProGly: 2.295 ± 1.886
0.765ProHis: 0.765 ± 0.395
3.315ProIle: 3.315 ± 0.565
2.55ProLys: 2.55 ± 1.125
2.04ProLeu: 2.04 ± 0.813
0.255ProMet: 0.255 ± 0.244
1.02ProAsn: 1.02 ± 0.657
0.51ProPro: 0.51 ± 0.328
1.275ProGln: 1.275 ± 0.522
0.51ProArg: 0.51 ± 0.328
3.571ProSer: 3.571 ± 0.805
3.571ProThr: 3.571 ± 1.713
2.04ProVal: 2.04 ± 0.625
0.765ProTrp: 0.765 ± 1.106
1.02ProTyr: 1.02 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.805GlnAla: 2.805 ± 0.863
1.02GlnCys: 1.02 ± 0.357
1.02GlnAsp: 1.02 ± 0.657
2.55GlnGlu: 2.55 ± 0.893
1.53GlnPhe: 1.53 ± 0.678
1.02GlnGly: 1.02 ± 0.357
0.51GlnHis: 0.51 ± 0.179
3.06GlnIle: 3.06 ± 0.937
4.081GlnLys: 4.081 ± 3.074
2.55GlnLeu: 2.55 ± 0.783
0.765GlnMet: 0.765 ± 1.106
1.53GlnAsn: 1.53 ± 0.825
0.51GlnPro: 0.51 ± 0.179
1.275GlnGln: 1.275 ± 2.2
1.53GlnArg: 1.53 ± 0.985
2.55GlnSer: 2.55 ± 0.783
2.55GlnThr: 2.55 ± 0.783
1.785GlnVal: 1.785 ± 0.559
0.255GlnTrp: 0.255 ± 1.216
1.02GlnTyr: 1.02 ± 0.985
0.0GlnXaa: 0.0 ± 0.0
Arg
1.275ArgAla: 1.275 ± 2.2
1.785ArgCys: 1.785 ± 0.735
3.571ArgAsp: 3.571 ± 1.976
3.06ArgGlu: 3.06 ± 1.357
2.295ArgPhe: 2.295 ± 0.594
0.51ArgGly: 0.51 ± 0.488
0.51ArgHis: 0.51 ± 0.328
3.315ArgIle: 3.315 ± 0.302
2.805ArgLys: 2.805 ± 1.349
4.591ArgLeu: 4.591 ± 0.987
2.04ArgMet: 2.04 ± 0.729
3.571ArgAsn: 3.571 ± 4.161
0.255ArgPro: 0.255 ± 0.244
1.275ArgGln: 1.275 ± 1.088
1.275ArgArg: 1.275 ± 0.522
2.04ArgSer: 2.04 ± 1.236
2.04ArgThr: 2.04 ± 0.675
3.315ArgVal: 3.315 ± 1.809
0.255ArgTrp: 0.255 ± 0.244
1.785ArgTyr: 1.785 ± 0.838
0.0ArgXaa: 0.0 ± 0.0
Ser
3.571SerAla: 3.571 ± 1.497
1.785SerCys: 1.785 ± 1.025
3.826SerAsp: 3.826 ± 0.562
3.315SerGlu: 3.315 ± 0.911
3.315SerPhe: 3.315 ± 0.911
3.06SerGly: 3.06 ± 0.442
0.51SerHis: 0.51 ± 0.328
6.121SerIle: 6.121 ± 2.144
5.611SerLys: 5.611 ± 1.257
6.121SerLeu: 6.121 ± 0.925
2.04SerMet: 2.04 ± 1.313
1.275SerAsn: 1.275 ± 0.873
2.295SerPro: 2.295 ± 0.724
1.53SerGln: 1.53 ± 1.116
3.826SerArg: 3.826 ± 1.346
3.06SerSer: 3.06 ± 1.116
5.101SerThr: 5.101 ± 1.566
6.121SerVal: 6.121 ± 0.973
0.765SerTrp: 0.765 ± 0.492
1.785SerTyr: 1.785 ± 1.025
0.0SerXaa: 0.0 ± 0.0
Thr
4.591ThrAla: 4.591 ± 0.616
2.04ThrCys: 2.04 ± 1.263
2.805ThrAsp: 2.805 ± 3.062
3.826ThrGlu: 3.826 ± 1.417
4.081ThrPhe: 4.081 ± 1.349
2.805ThrGly: 2.805 ± 1.349
1.02ThrHis: 1.02 ± 0.632
3.571ThrIle: 3.571 ± 0.401
3.315ThrLys: 3.315 ± 1.082
3.826ThrLeu: 3.826 ± 2.728
0.765ThrMet: 0.765 ± 0.241
4.336ThrAsn: 4.336 ± 1.618
1.785ThrPro: 1.785 ± 0.838
1.53ThrGln: 1.53 ± 0.678
2.04ThrArg: 2.04 ± 1.236
4.336ThrSer: 4.336 ± 1.339
3.315ThrThr: 3.315 ± 1.517
3.315ThrVal: 3.315 ± 0.381
1.275ThrTrp: 1.275 ± 1.042
3.571ThrTyr: 3.571 ± 1.118
0.0ThrXaa: 0.0 ± 0.0
Val
2.04ValAla: 2.04 ± 0.729
1.53ValCys: 1.53 ± 0.79
1.53ValAsp: 1.53 ± 0.79
2.55ValGlu: 2.55 ± 0.542
3.315ValPhe: 3.315 ± 0.693
2.805ValGly: 2.805 ± 1.086
0.51ValHis: 0.51 ± 0.328
3.315ValIle: 3.315 ± 1.015
2.805ValLys: 2.805 ± 0.975
3.571ValLeu: 3.571 ± 1.251
1.275ValMet: 1.275 ± 0.392
3.06ValAsn: 3.06 ± 1.656
2.295ValPro: 2.295 ± 0.671
2.55ValGln: 2.55 ± 1.925
1.53ValArg: 1.53 ± 0.825
5.101ValSer: 5.101 ± 0.584
1.785ValThr: 1.785 ± 0.559
2.04ValVal: 2.04 ± 0.955
0.765ValTrp: 0.765 ± 0.732
2.805ValTyr: 2.805 ± 0.907
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.164
0.0TrpCys: 0.0 ± 0.0
0.51TrpAsp: 0.51 ± 1.151
0.255TrpGlu: 0.255 ± 0.164
0.51TrpPhe: 0.51 ± 0.179
1.275TrpGly: 1.275 ± 0.392
0.0TrpHis: 0.0 ± 0.0
0.51TrpIle: 0.51 ± 0.179
0.0TrpLys: 0.0 ± 0.0
1.02TrpLeu: 1.02 ± 0.357
0.255TrpMet: 0.255 ± 1.216
0.765TrpAsn: 0.765 ± 0.492
0.0TrpPro: 0.0 ± 0.0
0.765TrpGln: 0.765 ± 0.492
0.255TrpArg: 0.255 ± 0.244
0.765TrpSer: 0.765 ± 0.492
0.51TrpThr: 0.51 ± 1.151
0.51TrpVal: 0.51 ± 0.328
0.0TrpTrp: 0.0 ± 0.0
1.02TrpTyr: 1.02 ± 0.357
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.51TyrAla: 0.51 ± 0.179
2.04TyrCys: 2.04 ± 1.263
0.765TyrAsp: 0.765 ± 1.106
3.826TyrGlu: 3.826 ± 1.441
1.785TyrPhe: 1.785 ± 0.838
2.295TyrGly: 2.295 ± 0.911
1.02TyrHis: 1.02 ± 0.357
5.101TyrIle: 5.101 ± 1.583
3.826TyrLys: 3.826 ± 1.687
2.805TyrLeu: 2.805 ± 1.744
1.785TyrMet: 1.785 ± 0.559
2.55TyrAsn: 2.55 ± 0.783
1.02TyrPro: 1.02 ± 0.357
0.765TyrGln: 0.765 ± 0.241
1.53TyrArg: 1.53 ± 0.907
2.04TyrSer: 2.04 ± 0.999
3.06TyrThr: 3.06 ± 0.606
1.02TyrVal: 1.02 ± 0.632
0.0TyrTrp: 0.0 ± 0.0
1.785TyrTyr: 1.785 ± 1.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3922 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski