Amino acid dipepetide frequency for Lisianthus necrosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.886AlaAla: 7.886 ± 2.148
0.0AlaCys: 0.0 ± 0.0
0.526AlaAsp: 0.526 ± 0.338
1.052AlaGlu: 1.052 ± 0.404
3.155AlaPhe: 3.155 ± 0.597
6.309AlaGly: 6.309 ± 1.645
1.577AlaHis: 1.577 ± 1.013
6.835AlaIle: 6.835 ± 1.546
5.258AlaLys: 5.258 ± 1.815
11.041AlaLeu: 11.041 ± 2.267
4.206AlaMet: 4.206 ± 1.612
2.103AlaAsn: 2.103 ± 1.316
4.206AlaPro: 4.206 ± 0.766
2.103AlaGln: 2.103 ± 0.731
4.732AlaArg: 4.732 ± 1.357
5.258AlaSer: 5.258 ± 1.169
7.361AlaThr: 7.361 ± 2.057
5.258AlaVal: 5.258 ± 1.458
0.526AlaTrp: 0.526 ± 0.338
1.052AlaTyr: 1.052 ± 0.806
0.0AlaXaa: 0.0 ± 0.0
Cys
2.629CysAla: 2.629 ± 1.147
1.052CysCys: 1.052 ± 0.515
0.526CysAsp: 0.526 ± 0.495
1.577CysGlu: 1.577 ± 0.811
1.052CysPhe: 1.052 ± 0.515
2.103CysGly: 2.103 ± 1.013
0.0CysHis: 0.0 ± 0.0
1.052CysIle: 1.052 ± 0.656
0.0CysLys: 0.0 ± 0.0
2.103CysLeu: 2.103 ± 0.7
0.0CysMet: 0.0 ± 0.0
1.052CysAsn: 1.052 ± 0.515
0.526CysPro: 0.526 ± 0.338
1.052CysGln: 1.052 ± 0.688
1.577CysArg: 1.577 ± 0.67
0.526CysSer: 0.526 ± 0.338
2.629CysThr: 2.629 ± 1.069
2.103CysVal: 2.103 ± 0.928
0.0CysTrp: 0.0 ± 0.0
0.526CysTyr: 0.526 ± 0.71
0.0CysXaa: 0.0 ± 0.0
Asp
2.103AspAla: 2.103 ± 1.312
2.629AspCys: 2.629 ± 1.147
1.052AspAsp: 1.052 ± 0.404
2.629AspGlu: 2.629 ± 0.888
1.577AspPhe: 1.577 ± 0.737
4.732AspGly: 4.732 ± 1.563
0.0AspHis: 0.0 ± 0.0
1.577AspIle: 1.577 ± 0.758
3.68AspLys: 3.68 ± 0.645
3.68AspLeu: 3.68 ± 1.053
1.052AspMet: 1.052 ± 0.515
0.526AspAsn: 0.526 ± 0.656
1.052AspPro: 1.052 ± 0.404
1.052AspGln: 1.052 ± 0.404
6.309AspArg: 6.309 ± 1.709
3.68AspSer: 3.68 ± 0.542
2.629AspThr: 2.629 ± 1.399
3.68AspVal: 3.68 ± 1.191
0.0AspTrp: 0.0 ± 0.0
1.052AspTyr: 1.052 ± 0.515
0.0AspXaa: 0.0 ± 0.0
Glu
5.783GluAla: 5.783 ± 1.278
2.103GluCys: 2.103 ± 1.03
4.732GluAsp: 4.732 ± 1.09
3.68GluGlu: 3.68 ± 1.218
0.526GluPhe: 0.526 ± 0.338
3.155GluGly: 3.155 ± 1.415
1.052GluHis: 1.052 ± 0.688
3.155GluIle: 3.155 ± 0.879
2.629GluLys: 2.629 ± 1.226
5.783GluLeu: 5.783 ± 1.924
2.629GluMet: 2.629 ± 1.234
1.052GluAsn: 1.052 ± 0.515
2.629GluPro: 2.629 ± 0.676
2.103GluGln: 2.103 ± 1.091
4.732GluArg: 4.732 ± 2.012
5.783GluSer: 5.783 ± 2.412
1.052GluThr: 1.052 ± 0.404
3.68GluVal: 3.68 ± 0.949
2.103GluTrp: 2.103 ± 0.7
2.103GluTyr: 2.103 ± 0.597
0.0GluXaa: 0.0 ± 0.0
Phe
2.103PheAla: 2.103 ± 0.491
1.052PheCys: 1.052 ± 0.676
1.577PheAsp: 1.577 ± 0.556
2.103PheGlu: 2.103 ± 0.884
0.0PhePhe: 0.0 ± 0.0
5.258PheGly: 5.258 ± 1.276
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.103PheLys: 2.103 ± 2.624
5.258PheLeu: 5.258 ± 0.83
1.577PheMet: 1.577 ± 0.67
0.526PheAsn: 0.526 ± 0.495
2.629PhePro: 2.629 ± 0.89
1.577PheGln: 1.577 ± 0.67
4.732PheArg: 4.732 ± 0.75
2.629PheSer: 2.629 ± 1.494
0.526PheThr: 0.526 ± 0.495
4.206PheVal: 4.206 ± 1.41
0.526PheTrp: 0.526 ± 0.338
1.052PheTyr: 1.052 ± 0.676
0.0PheXaa: 0.0 ± 0.0
Gly
4.206GlyAla: 4.206 ± 2.632
3.68GlyCys: 3.68 ± 1.332
4.732GlyAsp: 4.732 ± 1.547
2.103GlyGlu: 2.103 ± 0.928
2.103GlyPhe: 2.103 ± 1.39
7.361GlyGly: 7.361 ± 2.519
0.0GlyHis: 0.0 ± 0.0
5.258GlyIle: 5.258 ± 2.438
5.258GlyLys: 5.258 ± 0.987
5.258GlyLeu: 5.258 ± 1.55
1.577GlyMet: 1.577 ± 1.188
4.206GlyAsn: 4.206 ± 0.417
1.052GlyPro: 1.052 ± 0.404
1.577GlyGln: 1.577 ± 0.673
3.68GlyArg: 3.68 ± 1.191
6.835GlySer: 6.835 ± 2.902
5.258GlyThr: 5.258 ± 2.723
8.412GlyVal: 8.412 ± 1.336
1.052GlyTrp: 1.052 ± 0.688
5.783GlyTyr: 5.783 ± 1.543
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
2.103HisCys: 2.103 ± 0.976
0.0HisAsp: 0.0 ± 0.0
0.526HisGlu: 0.526 ± 0.338
0.526HisPhe: 0.526 ± 0.735
1.052HisGly: 1.052 ± 0.688
0.526HisHis: 0.526 ± 0.338
0.526HisIle: 0.526 ± 0.338
0.526HisLys: 0.526 ± 0.338
2.103HisLeu: 2.103 ± 0.976
1.577HisMet: 1.577 ± 0.67
1.577HisAsn: 1.577 ± 0.811
1.052HisPro: 1.052 ± 0.688
0.526HisGln: 0.526 ± 0.338
1.577HisArg: 1.577 ± 0.673
0.526HisSer: 0.526 ± 0.338
0.0HisThr: 0.0 ± 0.0
0.526HisVal: 0.526 ± 0.495
0.0HisTrp: 0.0 ± 0.0
0.526HisTyr: 0.526 ± 0.495
0.0HisXaa: 0.0 ± 0.0
Ile
3.68IleAla: 3.68 ± 0.935
0.0IleCys: 0.0 ± 0.0
1.052IleAsp: 1.052 ± 0.404
3.155IleGlu: 3.155 ± 1.296
2.103IlePhe: 2.103 ± 0.928
4.206IleGly: 4.206 ± 0.997
0.0IleHis: 0.0 ± 0.0
1.577IleIle: 1.577 ± 0.737
1.577IleLys: 1.577 ± 0.758
3.155IleLeu: 3.155 ± 0.737
1.052IleMet: 1.052 ± 0.515
2.629IleAsn: 2.629 ± 1.219
2.629IlePro: 2.629 ± 0.869
1.577IleGln: 1.577 ± 0.811
1.577IleArg: 1.577 ± 1.497
2.629IleSer: 2.629 ± 0.734
2.629IleThr: 2.629 ± 1.069
2.629IleVal: 2.629 ± 1.353
0.0IleTrp: 0.0 ± 0.0
1.577IleTyr: 1.577 ± 0.67
0.0IleXaa: 0.0 ± 0.0
Lys
5.258LysAla: 5.258 ± 1.705
0.0LysCys: 0.0 ± 0.0
4.732LysAsp: 4.732 ± 1.077
4.206LysGlu: 4.206 ± 1.482
1.577LysPhe: 1.577 ± 0.556
5.783LysGly: 5.783 ± 2.136
0.526LysHis: 0.526 ± 0.338
2.103LysIle: 2.103 ± 0.731
3.155LysLys: 3.155 ± 1.419
5.783LysLeu: 5.783 ± 1.613
0.526LysMet: 0.526 ± 0.489
2.103LysAsn: 2.103 ± 0.491
1.577LysPro: 1.577 ± 1.456
0.526LysGln: 0.526 ± 0.495
4.206LysArg: 4.206 ± 1.155
0.0LysSer: 0.0 ± 0.0
2.103LysThr: 2.103 ± 0.679
6.835LysVal: 6.835 ± 1.953
2.629LysTrp: 2.629 ± 0.869
0.526LysTyr: 0.526 ± 0.338
0.0LysXaa: 0.0 ± 0.0
Leu
11.041LeuAla: 11.041 ± 3.546
1.052LeuCys: 1.052 ± 0.974
3.155LeuAsp: 3.155 ± 1.184
3.68LeuGlu: 3.68 ± 1.272
1.577LeuPhe: 1.577 ± 0.635
8.938LeuGly: 8.938 ± 1.185
3.155LeuHis: 3.155 ± 1.258
3.68LeuIle: 3.68 ± 1.293
6.309LeuLys: 6.309 ± 1.514
12.093LeuLeu: 12.093 ± 2.144
2.103LeuMet: 2.103 ± 0.775
3.155LeuAsn: 3.155 ± 1.138
7.361LeuPro: 7.361 ± 0.865
2.103LeuGln: 2.103 ± 1.266
8.412LeuArg: 8.412 ± 2.003
6.835LeuSer: 6.835 ± 0.948
8.412LeuThr: 8.412 ± 1.638
5.783LeuVal: 5.783 ± 0.977
1.052LeuTrp: 1.052 ± 0.515
2.103LeuTyr: 2.103 ± 1.285
0.0LeuXaa: 0.0 ± 0.0
Met
1.577MetAla: 1.577 ± 1.165
0.526MetCys: 0.526 ± 0.338
4.206MetAsp: 4.206 ± 1.531
3.155MetGlu: 3.155 ± 0.526
1.052MetPhe: 1.052 ± 0.515
1.052MetGly: 1.052 ± 0.99
0.0MetHis: 0.0 ± 0.0
1.052MetIle: 1.052 ± 0.404
2.103MetLys: 2.103 ± 0.928
1.052MetLeu: 1.052 ± 0.806
0.526MetMet: 0.526 ± 0.495
0.0MetAsn: 0.0 ± 0.0
1.052MetPro: 1.052 ± 0.688
0.526MetGln: 0.526 ± 0.71
1.577MetArg: 1.577 ± 0.67
1.577MetSer: 1.577 ± 0.796
1.052MetThr: 1.052 ± 0.515
1.577MetVal: 1.577 ± 0.556
0.0MetTrp: 0.0 ± 0.0
1.577MetTyr: 1.577 ± 0.635
0.0MetXaa: 0.0 ± 0.0
Asn
1.577AsnAla: 1.577 ± 0.556
1.577AsnCys: 1.577 ± 0.838
0.526AsnAsp: 0.526 ± 0.656
2.629AsnGlu: 2.629 ± 0.505
2.629AsnPhe: 2.629 ± 0.932
2.103AsnGly: 2.103 ± 1.316
0.526AsnHis: 0.526 ± 0.338
0.526AsnIle: 0.526 ± 0.338
3.155AsnLys: 3.155 ± 1.27
2.103AsnLeu: 2.103 ± 0.807
0.526AsnMet: 0.526 ± 0.495
3.155AsnAsn: 3.155 ± 2.294
2.103AsnPro: 2.103 ± 0.982
0.0AsnGln: 0.0 ± 0.0
2.103AsnArg: 2.103 ± 0.491
3.68AsnSer: 3.68 ± 1.791
1.577AsnThr: 1.577 ± 0.838
4.206AsnVal: 4.206 ± 0.569
0.0AsnTrp: 0.0 ± 0.0
1.052AsnTyr: 1.052 ± 0.404
0.0AsnXaa: 0.0 ± 0.0
Pro
4.732ProAla: 4.732 ± 0.811
0.526ProCys: 0.526 ± 0.656
2.103ProAsp: 2.103 ± 0.7
3.155ProGlu: 3.155 ± 0.597
2.629ProPhe: 2.629 ± 0.764
1.577ProGly: 1.577 ± 0.556
0.0ProHis: 0.0 ± 0.0
1.577ProIle: 1.577 ± 0.737
2.103ProLys: 2.103 ± 0.491
5.783ProLeu: 5.783 ± 1.329
0.0ProMet: 0.0 ± 0.0
0.526ProAsn: 0.526 ± 0.338
1.577ProPro: 1.577 ± 0.758
1.577ProGln: 1.577 ± 0.556
4.732ProArg: 4.732 ± 2.011
6.309ProSer: 6.309 ± 1.725
1.577ProThr: 1.577 ± 1.486
5.258ProVal: 5.258 ± 1.4
1.052ProTrp: 1.052 ± 0.824
1.052ProTyr: 1.052 ± 0.515
0.0ProXaa: 0.0 ± 0.0
Gln
2.629GlnAla: 2.629 ± 0.862
0.526GlnCys: 0.526 ± 0.71
0.0GlnAsp: 0.0 ± 0.0
1.052GlnGlu: 1.052 ± 0.652
3.155GlnPhe: 3.155 ± 0.936
1.577GlnGly: 1.577 ± 0.673
2.103GlnHis: 2.103 ± 0.884
1.052GlnIle: 1.052 ± 0.656
0.0GlnLys: 0.0 ± 0.0
5.783GlnLeu: 5.783 ± 1.755
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.577GlnPro: 1.577 ± 0.556
0.0GlnGln: 0.0 ± 0.0
2.103GlnArg: 2.103 ± 0.7
0.0GlnSer: 0.0 ± 0.0
1.052GlnThr: 1.052 ± 0.404
2.103GlnVal: 2.103 ± 0.679
0.0GlnTrp: 0.0 ± 0.0
0.526GlnTyr: 0.526 ± 0.495
0.0GlnXaa: 0.0 ± 0.0
Arg
4.732ArgAla: 4.732 ± 1.081
0.0ArgCys: 0.0 ± 0.0
4.206ArgAsp: 4.206 ± 0.569
4.732ArgGlu: 4.732 ± 1.096
2.629ArgPhe: 2.629 ± 1.436
2.103ArgGly: 2.103 ± 1.312
1.577ArgHis: 1.577 ± 1.013
1.577ArgIle: 1.577 ± 0.758
4.732ArgLys: 4.732 ± 1.206
9.989ArgLeu: 9.989 ± 2.391
1.577ArgMet: 1.577 ± 0.67
3.155ArgAsn: 3.155 ± 1.211
5.783ArgPro: 5.783 ± 2.664
1.052ArgGln: 1.052 ± 0.99
5.783ArgArg: 5.783 ± 1.62
2.629ArgSer: 2.629 ± 0.505
9.989ArgThr: 9.989 ± 2.215
4.732ArgVal: 4.732 ± 1.219
1.577ArgTrp: 1.577 ± 0.865
4.732ArgTyr: 4.732 ± 1.116
0.0ArgXaa: 0.0 ± 0.0
Ser
4.732SerAla: 4.732 ± 1.156
0.526SerCys: 0.526 ± 0.656
1.577SerAsp: 1.577 ± 1.569
3.68SerGlu: 3.68 ± 2.435
3.155SerPhe: 3.155 ± 0.597
4.732SerGly: 4.732 ± 1.29
1.052SerHis: 1.052 ± 0.688
1.052SerIle: 1.052 ± 0.676
3.68SerLys: 3.68 ± 1.272
6.309SerLeu: 6.309 ± 2.057
1.577SerMet: 1.577 ± 0.503
2.103SerAsn: 2.103 ± 0.934
4.732SerPro: 4.732 ± 0.988
2.629SerGln: 2.629 ± 0.723
6.835SerArg: 6.835 ± 1.789
1.577SerSer: 1.577 ± 0.737
5.258SerThr: 5.258 ± 2.794
6.835SerVal: 6.835 ± 1.739
1.577SerTrp: 1.577 ± 1.383
1.052SerTyr: 1.052 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
3.68ThrAla: 3.68 ± 2.15
0.526ThrCys: 0.526 ± 0.338
2.103ThrAsp: 2.103 ± 0.679
4.732ThrGlu: 4.732 ± 1.833
2.629ThrPhe: 2.629 ± 0.723
7.361ThrGly: 7.361 ± 1.514
1.577ThrHis: 1.577 ± 0.503
2.103ThrIle: 2.103 ± 1.266
3.68ThrLys: 3.68 ± 0.796
8.938ThrLeu: 8.938 ± 3.189
1.577ThrMet: 1.577 ± 0.865
3.155ThrAsn: 3.155 ± 1.676
2.103ThrPro: 2.103 ± 0.696
1.052ThrGln: 1.052 ± 0.404
3.68ThrArg: 3.68 ± 1.611
2.629ThrSer: 2.629 ± 1.219
2.629ThrThr: 2.629 ± 0.793
4.732ThrVal: 4.732 ± 1.423
0.0ThrTrp: 0.0 ± 0.0
2.629ThrTyr: 2.629 ± 0.895
0.0ThrXaa: 0.0 ± 0.0
Val
7.361ValAla: 7.361 ± 1.881
3.155ValCys: 3.155 ± 1.341
5.258ValAsp: 5.258 ± 1.384
9.464ValGlu: 9.464 ± 2.794
5.258ValPhe: 5.258 ± 0.63
6.835ValGly: 6.835 ± 2.103
2.103ValHis: 2.103 ± 0.826
3.68ValIle: 3.68 ± 1.647
3.155ValLys: 3.155 ± 0.922
1.577ValLeu: 1.577 ± 0.556
1.052ValMet: 1.052 ± 0.824
3.68ValAsn: 3.68 ± 1.786
2.629ValPro: 2.629 ± 1.432
1.052ValGln: 1.052 ± 0.404
6.835ValArg: 6.835 ± 1.507
6.309ValSer: 6.309 ± 4.164
4.206ValThr: 4.206 ± 2.053
4.732ValVal: 4.732 ± 0.75
0.0ValTrp: 0.0 ± 0.0
2.103ValTyr: 2.103 ± 1.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.526TrpAla: 0.526 ± 0.495
0.0TrpCys: 0.0 ± 0.0
1.577TrpAsp: 1.577 ± 0.673
2.103TrpGlu: 2.103 ± 0.7
1.052TrpPhe: 1.052 ± 0.515
1.052TrpGly: 1.052 ± 0.515
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.052TrpLys: 1.052 ± 0.652
0.526TrpLeu: 0.526 ± 0.495
0.0TrpMet: 0.0 ± 0.0
0.526TrpAsn: 0.526 ± 0.71
0.0TrpPro: 0.0 ± 0.0
1.577TrpGln: 1.577 ± 0.758
1.052TrpArg: 1.052 ± 0.656
1.577TrpSer: 1.577 ± 0.917
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.206TyrAla: 4.206 ± 2.06
1.052TyrCys: 1.052 ± 0.806
1.052TyrAsp: 1.052 ± 0.656
1.052TyrGlu: 1.052 ± 0.652
1.052TyrPhe: 1.052 ± 0.404
2.103TyrGly: 2.103 ± 1.316
0.0TyrHis: 0.0 ± 0.0
1.052TyrIle: 1.052 ± 0.824
0.526TyrLys: 0.526 ± 0.71
3.68TyrLeu: 3.68 ± 0.431
1.577TyrMet: 1.577 ± 0.67
0.526TyrAsn: 0.526 ± 0.338
1.577TyrPro: 1.577 ± 0.67
1.577TyrGln: 1.577 ± 0.737
1.052TyrArg: 1.052 ± 0.676
3.68TyrSer: 3.68 ± 0.949
1.577TyrThr: 1.577 ± 1.013
3.155TyrVal: 3.155 ± 1.138
0.526TyrTrp: 0.526 ± 0.495
0.526TyrTyr: 0.526 ± 0.338
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1903 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski