Amino acid dipepetide frequency for Loreto virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.051AlaAla: 3.051 ± 1.457
1.017AlaCys: 1.017 ± 0.486
3.729AlaAsp: 3.729 ± 1.781
1.017AlaGlu: 1.017 ± 0.486
2.034AlaPhe: 2.034 ± 0.416
0.0AlaGly: 0.0 ± 0.0
2.712AlaHis: 2.712 ± 1.295
3.39AlaIle: 3.39 ± 0.97
1.695AlaLys: 1.695 ± 2.653
7.797AlaLeu: 7.797 ± 1.425
2.034AlaMet: 2.034 ± 0.972
1.695AlaAsn: 1.695 ± 0.81
2.373AlaPro: 2.373 ± 1.39
0.678AlaGln: 0.678 ± 0.324
0.678AlaArg: 0.678 ± 0.324
2.373AlaSer: 2.373 ± 0.566
2.373AlaThr: 2.373 ± 3.498
5.085AlaVal: 5.085 ± 1.329
0.339AlaTrp: 0.339 ± 0.162
0.678AlaTyr: 0.678 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.339CysAla: 0.339 ± 0.162
0.0CysCys: 0.0 ± 0.0
0.678CysAsp: 0.678 ± 0.324
1.695CysGlu: 1.695 ± 0.334
3.39CysPhe: 3.39 ± 0.669
0.678CysGly: 0.678 ± 0.494
0.678CysHis: 0.678 ± 0.324
1.356CysIle: 1.356 ± 0.988
1.356CysLys: 1.356 ± 0.648
0.678CysLeu: 0.678 ± 0.845
0.339CysMet: 0.339 ± 0.162
0.0CysAsn: 0.0 ± 0.0
0.678CysPro: 0.678 ± 0.324
0.0CysGln: 0.0 ± 0.0
1.017CysArg: 1.017 ± 0.486
2.373CysSer: 2.373 ± 1.134
1.695CysThr: 1.695 ± 0.334
0.678CysVal: 0.678 ± 0.324
0.0CysTrp: 0.0 ± 0.0
0.339CysTyr: 0.339 ± 0.627
0.0CysXaa: 0.0 ± 0.0
Asp
4.407AspAla: 4.407 ± 1.325
0.678AspCys: 0.678 ± 0.324
3.39AspAsp: 3.39 ± 0.97
3.729AspGlu: 3.729 ± 1.124
4.068AspPhe: 4.068 ± 1.28
2.373AspGly: 2.373 ± 1.134
1.356AspHis: 1.356 ± 0.322
6.78AspIle: 6.78 ± 2.392
2.373AspLys: 2.373 ± 1.134
5.085AspLeu: 5.085 ± 1.621
0.678AspMet: 0.678 ± 0.324
3.729AspAsn: 3.729 ± 0.548
3.729AspPro: 3.729 ± 1.124
1.356AspGln: 1.356 ± 0.648
2.712AspArg: 2.712 ± 1.295
5.424AspSer: 5.424 ± 1.006
2.373AspThr: 2.373 ± 0.535
4.068AspVal: 4.068 ± 0.831
0.0AspTrp: 0.0 ± 0.0
2.034AspTyr: 2.034 ± 0.416
0.0AspXaa: 0.0 ± 0.0
Glu
2.712GluAla: 2.712 ± 1.295
2.034GluCys: 2.034 ± 0.416
2.712GluAsp: 2.712 ± 1.295
2.373GluGlu: 2.373 ± 1.134
3.051GluPhe: 3.051 ± 0.636
2.373GluGly: 2.373 ± 1.134
1.017GluHis: 1.017 ± 0.384
6.441GluIle: 6.441 ± 1.357
2.712GluLys: 2.712 ± 0.672
6.102GluLeu: 6.102 ± 1.247
1.017GluMet: 1.017 ± 0.486
4.407GluAsn: 4.407 ± 0.944
1.017GluPro: 1.017 ± 0.384
1.017GluGln: 1.017 ± 0.486
2.712GluArg: 2.712 ± 1.295
3.051GluSer: 3.051 ± 0.802
4.068GluThr: 4.068 ± 1.28
3.051GluVal: 3.051 ± 0.636
0.0GluTrp: 0.0 ± 0.0
2.373GluTyr: 2.373 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
3.051PheAla: 3.051 ± 0.802
1.356PheCys: 1.356 ± 0.648
5.424PheAsp: 5.424 ± 1.006
3.39PheGlu: 3.39 ± 1.069
5.763PhePhe: 5.763 ± 1.269
3.39PheGly: 3.39 ± 0.97
1.695PheHis: 1.695 ± 1.609
4.407PheIle: 4.407 ± 0.556
4.407PheLys: 4.407 ± 3.302
4.746PheLeu: 4.746 ± 2.39
1.695PheMet: 1.695 ± 0.344
5.424PheAsn: 5.424 ± 0.266
2.034PhePro: 2.034 ± 1.481
2.373PheGln: 2.373 ± 0.535
2.034PheArg: 2.034 ± 0.416
10.169PheSer: 10.169 ± 2.163
4.407PheThr: 4.407 ± 0.226
4.068PheVal: 4.068 ± 0.831
1.356PheTrp: 1.356 ± 0.657
1.695PheTyr: 1.695 ± 1.609
0.0PheXaa: 0.0 ± 0.0
Gly
1.695GlyAla: 1.695 ± 0.607
1.356GlyCys: 1.356 ± 0.322
4.407GlyAsp: 4.407 ± 0.556
2.373GlyGlu: 2.373 ± 0.535
3.051GlyPhe: 3.051 ± 0.819
2.373GlyGly: 2.373 ± 0.535
0.339GlyHis: 0.339 ± 0.162
3.39GlyIle: 3.39 ± 0.28
3.729GlyLys: 3.729 ± 0.321
3.39GlyLeu: 3.39 ± 0.28
1.017GlyMet: 1.017 ± 0.74
2.712GlyAsn: 2.712 ± 0.672
0.339GlyPro: 0.339 ± 0.162
1.356GlyGln: 1.356 ± 1.017
1.356GlyArg: 1.356 ± 0.648
2.034GlySer: 2.034 ± 0.972
2.373GlyThr: 2.373 ± 0.633
1.356GlyVal: 1.356 ± 0.657
0.339GlyTrp: 0.339 ± 0.967
2.373GlyTyr: 2.373 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
2.034HisAla: 2.034 ± 0.416
0.678HisCys: 0.678 ± 0.494
1.695HisAsp: 1.695 ± 0.334
2.034HisGlu: 2.034 ± 0.972
1.695HisPhe: 1.695 ± 0.87
0.339HisGly: 0.339 ± 0.162
0.0HisHis: 0.0 ± 0.0
2.712HisIle: 2.712 ± 1.295
1.017HisLys: 1.017 ± 0.384
3.051HisLeu: 3.051 ± 0.819
0.339HisMet: 0.339 ± 0.162
3.051HisAsn: 3.051 ± 1.152
1.017HisPro: 1.017 ± 0.384
1.017HisGln: 1.017 ± 0.74
2.373HisArg: 2.373 ± 0.535
3.39HisSer: 3.39 ± 1.739
2.034HisThr: 2.034 ± 0.711
2.712HisVal: 2.712 ± 0.643
0.339HisTrp: 0.339 ± 0.162
2.034HisTyr: 2.034 ± 1.481
0.0HisXaa: 0.0 ± 0.0
Ile
3.39IleAla: 3.39 ± 1.215
2.034IleCys: 2.034 ± 0.416
4.746IleAsp: 4.746 ± 1.07
4.407IleGlu: 4.407 ± 0.95
4.746IlePhe: 4.746 ± 1.703
2.373IleGly: 2.373 ± 0.633
3.39IleHis: 3.39 ± 1.739
3.051IleIle: 3.051 ± 0.636
3.051IleLys: 3.051 ± 0.819
5.763IleLeu: 5.763 ± 1.188
1.017IleMet: 1.017 ± 0.369
3.39IleAsn: 3.39 ± 0.669
5.763IlePro: 5.763 ± 0.901
1.017IleGln: 1.017 ± 0.74
4.068IleArg: 4.068 ± 1.943
6.102IleSer: 6.102 ± 0.588
3.39IleThr: 3.39 ± 0.97
5.424IleVal: 5.424 ± 2.628
0.0IleTrp: 0.0 ± 0.0
2.034IleTyr: 2.034 ± 0.599
0.0IleXaa: 0.0 ± 0.0
Lys
2.034LysAla: 2.034 ± 0.768
0.678LysCys: 0.678 ± 0.324
2.373LysAsp: 2.373 ± 0.566
3.39LysGlu: 3.39 ± 0.97
5.763LysPhe: 5.763 ± 3.383
1.017LysGly: 1.017 ± 0.486
2.373LysHis: 2.373 ± 1.134
4.068LysIle: 4.068 ± 0.831
7.458LysLys: 7.458 ± 2.707
5.763LysLeu: 5.763 ± 0.427
1.017LysMet: 1.017 ± 0.486
3.39LysAsn: 3.39 ± 0.709
3.051LysPro: 3.051 ± 1.399
1.017LysGln: 1.017 ± 0.486
2.712LysArg: 2.712 ± 0.704
4.068LysSer: 4.068 ± 2.512
4.407LysThr: 4.407 ± 1.275
3.051LysVal: 3.051 ± 0.326
0.339LysTrp: 0.339 ± 0.967
2.034LysTyr: 2.034 ± 0.416
0.0LysXaa: 0.0 ± 0.0
Leu
3.729LeuAla: 3.729 ± 0.737
0.339LeuCys: 0.339 ± 0.162
7.119LeuAsp: 7.119 ± 1.604
7.458LeuGlu: 7.458 ± 2.248
5.424LeuPhe: 5.424 ± 2.501
5.085LeuGly: 5.085 ± 1.822
2.373LeuHis: 2.373 ± 0.633
5.763LeuIle: 5.763 ± 1.224
9.831LeuLys: 9.831 ± 1.537
8.814LeuLeu: 8.814 ± 2.751
2.034LeuMet: 2.034 ± 0.768
5.763LeuAsn: 5.763 ± 1.188
2.034LeuPro: 2.034 ± 1.357
2.373LeuGln: 2.373 ± 0.535
5.763LeuArg: 5.763 ± 0.427
6.78LeuSer: 6.78 ± 1.419
4.407LeuThr: 4.407 ± 0.226
7.458LeuVal: 7.458 ± 1.559
0.339LeuTrp: 0.339 ± 0.162
4.068LeuTyr: 4.068 ± 1.911
0.0LeuXaa: 0.0 ± 0.0
Met
1.017MetAla: 1.017 ± 0.486
0.339MetCys: 0.339 ± 0.627
0.678MetAsp: 0.678 ± 0.324
0.0MetGlu: 0.0 ± 0.0
2.034MetPhe: 2.034 ± 0.972
1.017MetGly: 1.017 ± 0.486
1.017MetHis: 1.017 ± 0.384
1.017MetIle: 1.017 ± 0.486
0.678MetLys: 0.678 ± 0.494
2.373MetLeu: 2.373 ± 0.566
0.0MetMet: 0.0 ± 0.0
0.678MetAsn: 0.678 ± 0.324
0.339MetPro: 0.339 ± 0.162
0.0MetGln: 0.0 ± 0.0
1.017MetArg: 1.017 ± 0.486
1.695MetSer: 1.695 ± 0.607
1.017MetThr: 1.017 ± 0.486
0.339MetVal: 0.339 ± 0.162
0.0MetTrp: 0.0 ± 0.0
1.017MetTyr: 1.017 ± 0.486
0.0MetXaa: 0.0 ± 0.0
Asn
1.356AsnAla: 1.356 ± 0.648
1.695AsnCys: 1.695 ± 0.334
2.712AsnAsp: 2.712 ± 1.295
2.373AsnGlu: 2.373 ± 0.535
4.746AsnPhe: 4.746 ± 0.964
5.763AsnGly: 5.763 ± 1.832
1.695AsnHis: 1.695 ± 0.81
1.695AsnIle: 1.695 ± 0.87
2.373AsnLys: 2.373 ± 0.633
6.102AsnLeu: 6.102 ± 0.372
1.017AsnMet: 1.017 ± 0.486
1.695AsnAsn: 1.695 ± 0.334
1.356AsnPro: 1.356 ± 0.322
2.034AsnGln: 2.034 ± 0.416
2.034AsnArg: 2.034 ± 0.416
8.814AsnSer: 8.814 ± 2.775
3.729AsnThr: 3.729 ± 0.548
3.051AsnVal: 3.051 ± 1.457
0.339AsnTrp: 0.339 ± 0.162
3.729AsnTyr: 3.729 ± 1.005
0.0AsnXaa: 0.0 ± 0.0
Pro
2.712ProAla: 2.712 ± 4.462
0.339ProCys: 0.339 ± 0.162
2.373ProAsp: 2.373 ± 1.134
2.373ProGlu: 2.373 ± 0.566
2.373ProPhe: 2.373 ± 2.102
2.034ProGly: 2.034 ± 0.416
0.339ProHis: 0.339 ± 0.162
2.373ProIle: 2.373 ± 0.689
2.034ProLys: 2.034 ± 0.416
3.051ProLeu: 3.051 ± 1.152
0.339ProMet: 0.339 ± 0.162
3.051ProAsn: 3.051 ± 1.255
2.373ProPro: 2.373 ± 0.535
1.356ProGln: 1.356 ± 1.017
2.373ProArg: 2.373 ± 1.39
5.085ProSer: 5.085 ± 1.543
1.695ProThr: 1.695 ± 0.334
4.746ProVal: 4.746 ± 2.967
0.0ProTrp: 0.0 ± 0.0
3.051ProTyr: 3.051 ± 0.636
0.0ProXaa: 0.0 ± 0.0
Gln
1.017GlnAla: 1.017 ± 0.486
0.0GlnCys: 0.0 ± 0.0
0.339GlnAsp: 0.339 ± 0.162
0.678GlnGlu: 0.678 ± 0.324
1.695GlnPhe: 1.695 ± 0.81
1.017GlnGly: 1.017 ± 0.384
0.339GlnHis: 0.339 ± 0.162
1.695GlnIle: 1.695 ± 0.334
3.051GlnLys: 3.051 ± 1.399
1.695GlnLeu: 1.695 ± 0.334
0.339GlnMet: 0.339 ± 0.162
2.712GlnAsn: 2.712 ± 0.643
1.356GlnPro: 1.356 ± 0.988
0.0GlnGln: 0.0 ± 0.0
0.678GlnArg: 0.678 ± 0.324
4.068GlnSer: 4.068 ± 1.065
1.356GlnThr: 1.356 ± 1.691
0.678GlnVal: 0.678 ± 0.324
0.0GlnTrp: 0.0 ± 0.0
1.695GlnTyr: 1.695 ± 0.607
0.0GlnXaa: 0.0 ± 0.0
Arg
2.373ArgAla: 2.373 ± 1.134
1.017ArgCys: 1.017 ± 0.486
2.712ArgAsp: 2.712 ± 1.295
3.39ArgGlu: 3.39 ± 1.619
3.729ArgPhe: 3.729 ± 1.124
1.695ArgGly: 1.695 ± 0.81
1.356ArgHis: 1.356 ± 0.648
4.068ArgIle: 4.068 ± 0.386
2.373ArgLys: 2.373 ± 1.134
4.407ArgLeu: 4.407 ± 1.325
1.017ArgMet: 1.017 ± 0.486
4.407ArgAsn: 4.407 ± 1.438
2.034ArgPro: 2.034 ± 1.479
1.356ArgGln: 1.356 ± 1.017
1.695ArgArg: 1.695 ± 0.81
4.407ArgSer: 4.407 ± 2.105
2.373ArgThr: 2.373 ± 0.535
4.068ArgVal: 4.068 ± 1.198
0.339ArgTrp: 0.339 ± 0.162
1.356ArgTyr: 1.356 ± 0.648
0.0ArgXaa: 0.0 ± 0.0
Ser
4.407SerAla: 4.407 ± 0.226
1.017SerCys: 1.017 ± 0.384
4.407SerAsp: 4.407 ± 1.222
3.729SerGlu: 3.729 ± 1.124
7.797SerPhe: 7.797 ± 2.993
3.729SerGly: 3.729 ± 1.781
2.712SerHis: 2.712 ± 0.672
6.441SerIle: 6.441 ± 2.674
6.102SerLys: 6.102 ± 1.066
10.847SerLeu: 10.847 ± 1.884
1.356SerMet: 1.356 ± 0.648
4.746SerAsn: 4.746 ± 1.471
5.763SerPro: 5.763 ± 3.911
2.712SerGln: 2.712 ± 0.433
5.763SerArg: 5.763 ± 1.162
8.136SerSer: 8.136 ± 3.289
5.424SerThr: 5.424 ± 1.716
3.729SerVal: 3.729 ± 2.347
0.678SerTrp: 0.678 ± 0.494
3.729SerTyr: 3.729 ± 1.298
0.0SerXaa: 0.0 ± 0.0
Thr
1.356ThrAla: 1.356 ± 0.648
1.695ThrCys: 1.695 ± 0.81
3.051ThrAsp: 3.051 ± 1.457
3.051ThrGlu: 3.051 ± 0.819
2.373ThrPhe: 2.373 ± 0.535
1.695ThrGly: 1.695 ± 1.519
3.39ThrHis: 3.39 ± 2.218
2.373ThrIle: 2.373 ± 0.633
2.034ThrLys: 2.034 ± 0.711
6.78ThrLeu: 6.78 ± 1.338
0.339ThrMet: 0.339 ± 0.162
2.373ThrAsn: 2.373 ± 0.633
3.729ThrPro: 3.729 ± 1.298
1.017ThrGln: 1.017 ± 0.486
3.729ThrArg: 3.729 ± 1.196
5.763ThrSer: 5.763 ± 3.516
2.034ThrThr: 2.034 ± 0.972
3.39ThrVal: 3.39 ± 2.127
0.0ThrTrp: 0.0 ± 0.0
3.39ThrTyr: 3.39 ± 0.918
0.0ThrXaa: 0.0 ± 0.0
Val
2.712ValAla: 2.712 ± 3.381
1.017ValCys: 1.017 ± 0.384
5.085ValAsp: 5.085 ± 1.755
2.712ValGlu: 2.712 ± 1.295
5.085ValPhe: 5.085 ± 0.851
2.034ValGly: 2.034 ± 1.803
4.407ValHis: 4.407 ± 2.12
4.068ValIle: 4.068 ± 2.122
3.051ValLys: 3.051 ± 1.255
5.424ValLeu: 5.424 ± 4.296
0.0ValMet: 0.0 ± 0.762
2.712ValAsn: 2.712 ± 0.643
4.407ValPro: 4.407 ± 1.909
2.034ValGln: 2.034 ± 0.972
4.746ValArg: 4.746 ± 2.267
5.763ValSer: 5.763 ± 0.749
2.034ValThr: 2.034 ± 0.972
3.39ValVal: 3.39 ± 1.619
0.0ValTrp: 0.0 ± 0.0
2.034ValTyr: 2.034 ± 0.711
0.0ValXaa: 0.0 ± 0.0
Trp
0.339TrpAla: 0.339 ± 0.162
0.339TrpCys: 0.339 ± 0.967
0.0TrpAsp: 0.0 ± 0.0
0.678TrpGlu: 0.678 ± 0.324
1.356TrpPhe: 1.356 ± 1.017
0.339TrpGly: 0.339 ± 0.162
0.0TrpHis: 0.0 ± 0.0
0.339TrpIle: 0.339 ± 0.967
0.0TrpLys: 0.0 ± 0.0
0.339TrpLeu: 0.339 ± 0.162
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.339TrpArg: 0.339 ± 0.162
0.0TrpSer: 0.0 ± 0.0
0.678TrpThr: 0.678 ± 0.324
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.017TyrAla: 1.017 ± 0.486
0.339TyrCys: 0.339 ± 0.162
2.712TyrAsp: 2.712 ± 0.672
3.39TyrGlu: 3.39 ± 0.97
2.712TyrPhe: 2.712 ± 1.033
2.034TyrGly: 2.034 ± 0.416
2.373TyrHis: 2.373 ± 0.689
3.729TyrIle: 3.729 ± 0.737
0.678TyrLys: 0.678 ± 0.324
4.746TyrLeu: 4.746 ± 0.071
0.339TyrMet: 0.339 ± 0.162
2.373TyrAsn: 2.373 ± 1.93
0.678TyrPro: 0.678 ± 0.324
1.695TyrGln: 1.695 ± 1.519
2.373TyrArg: 2.373 ± 1.134
3.729TyrSer: 3.729 ± 1.633
1.695TyrThr: 1.695 ± 0.607
2.712TyrVal: 2.712 ± 2.528
0.339TyrTrp: 0.339 ± 0.162
1.695TyrTyr: 1.695 ± 0.862
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2951 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski