Amino acid dipepetide frequency for Lily virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.526AlaAla: 10.526 ± 7.514
1.053AlaCys: 1.053 ± 0.582
4.211AlaAsp: 4.211 ± 1.952
4.211AlaGlu: 4.211 ± 4.859
4.211AlaPhe: 4.211 ± 2.073
4.211AlaGly: 4.211 ± 1.244
5.789AlaHis: 5.789 ± 1.959
5.263AlaIle: 5.263 ± 2.351
5.789AlaLys: 5.789 ± 2.168
9.474AlaLeu: 9.474 ± 2.646
3.684AlaMet: 3.684 ± 1.285
4.737AlaAsn: 4.737 ± 1.612
5.263AlaPro: 5.263 ± 1.416
3.684AlaGln: 3.684 ± 1.478
5.263AlaArg: 5.263 ± 1.13
4.211AlaSer: 4.211 ± 1.136
5.789AlaThr: 5.789 ± 1.922
3.158AlaVal: 3.158 ± 2.669
0.0AlaTrp: 0.0 ± 0.0
4.211AlaTyr: 4.211 ± 0.862
0.0AlaXaa: 0.0 ± 0.0
Cys
2.105CysAla: 2.105 ± 1.164
0.0CysCys: 0.0 ± 0.0
0.526CysAsp: 0.526 ± 1.018
0.526CysGlu: 0.526 ± 0.291
0.526CysPhe: 0.526 ± 0.291
1.579CysGly: 1.579 ± 0.734
1.053CysHis: 1.053 ± 0.926
0.526CysIle: 0.526 ± 0.291
0.526CysLys: 0.526 ± 0.291
1.579CysLeu: 1.579 ± 0.921
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.526CysPro: 0.526 ± 0.291
2.105CysGln: 2.105 ± 1.165
1.579CysArg: 1.579 ± 0.747
3.158CysSer: 3.158 ± 2.258
1.579CysThr: 1.579 ± 1.219
1.579CysVal: 1.579 ± 1.538
0.0CysTrp: 0.0 ± 0.0
1.053CysTyr: 1.053 ± 2.15
0.0CysXaa: 0.0 ± 0.0
Asp
4.211AspAla: 4.211 ± 2.334
0.0AspCys: 0.0 ± 0.0
4.211AspAsp: 4.211 ± 1.677
4.211AspGlu: 4.211 ± 1.562
3.158AspPhe: 3.158 ± 1.06
4.211AspGly: 4.211 ± 2.517
0.526AspHis: 0.526 ± 0.983
2.105AspIle: 2.105 ± 1.164
3.158AspLys: 3.158 ± 1.082
3.684AspLeu: 3.684 ± 1.06
1.053AspMet: 1.053 ± 0.582
3.158AspAsn: 3.158 ± 0.938
2.105AspPro: 2.105 ± 0.762
2.105AspGln: 2.105 ± 0.762
2.105AspArg: 2.105 ± 0.762
2.632AspSer: 2.632 ± 1.811
3.684AspThr: 3.684 ± 1.285
4.211AspVal: 4.211 ± 0.718
1.053AspTrp: 1.053 ± 0.582
1.579AspTyr: 1.579 ± 0.921
0.0AspXaa: 0.0 ± 0.0
Glu
3.684GluAla: 3.684 ± 1.285
0.526GluCys: 0.526 ± 0.291
1.579GluAsp: 1.579 ± 2.948
2.105GluGlu: 2.105 ± 0.759
1.053GluPhe: 1.053 ± 0.817
1.053GluGly: 1.053 ± 0.841
0.526GluHis: 0.526 ± 0.291
1.579GluIle: 1.579 ± 0.747
3.158GluLys: 3.158 ± 1.747
8.947GluLeu: 8.947 ± 1.559
0.526GluMet: 0.526 ± 0.291
1.579GluAsn: 1.579 ± 0.873
4.737GluPro: 4.737 ± 1.824
1.579GluGln: 1.579 ± 0.873
0.526GluArg: 0.526 ± 0.291
3.684GluSer: 3.684 ± 1.285
2.105GluThr: 2.105 ± 0.921
5.789GluVal: 5.789 ± 1.569
1.579GluTrp: 1.579 ± 0.873
2.632GluTyr: 2.632 ± 1.526
0.0GluXaa: 0.0 ± 0.0
Phe
4.737PheAla: 4.737 ± 1.732
2.105PheCys: 2.105 ± 0.976
3.158PheAsp: 3.158 ± 2.524
0.526PheGlu: 0.526 ± 0.291
1.579PhePhe: 1.579 ± 1.84
1.053PheGly: 1.053 ± 0.817
2.105PheHis: 2.105 ± 1.164
2.105PheIle: 2.105 ± 1.075
0.0PheLys: 0.0 ± 0.0
3.684PheLeu: 3.684 ± 2.038
0.526PheMet: 0.526 ± 0.291
0.526PheAsn: 0.526 ± 0.291
2.105PhePro: 2.105 ± 0.976
3.158PheGln: 3.158 ± 0.628
4.211PheArg: 4.211 ± 1.677
2.632PheSer: 2.632 ± 1.156
3.684PheThr: 3.684 ± 1.625
3.158PheVal: 3.158 ± 2.876
0.0PheTrp: 0.0 ± 0.0
1.053PheTyr: 1.053 ± 0.582
0.0PheXaa: 0.0 ± 0.0
Gly
3.684GlyAla: 3.684 ± 1.311
1.053GlyCys: 1.053 ± 0.94
5.263GlyAsp: 5.263 ± 1.537
3.684GlyGlu: 3.684 ± 1.467
2.105GlyPhe: 2.105 ± 0.759
5.263GlyGly: 5.263 ± 2.573
3.158GlyHis: 3.158 ± 1.057
3.684GlyIle: 3.684 ± 1.433
3.158GlyLys: 3.158 ± 1.468
3.684GlyLeu: 3.684 ± 2.088
1.053GlyMet: 1.053 ± 0.766
1.053GlyAsn: 1.053 ± 0.841
1.579GlyPro: 1.579 ± 0.921
1.053GlyGln: 1.053 ± 0.582
2.105GlyArg: 2.105 ± 0.759
2.105GlySer: 2.105 ± 0.976
5.789GlyThr: 5.789 ± 4.258
3.158GlyVal: 3.158 ± 0.628
1.053GlyTrp: 1.053 ± 0.582
2.105GlyTyr: 2.105 ± 0.762
0.0GlyXaa: 0.0 ± 0.0
His
2.632HisAla: 2.632 ± 0.875
1.053HisCys: 1.053 ± 0.582
1.053HisAsp: 1.053 ± 0.841
1.579HisGlu: 1.579 ± 0.873
3.684HisPhe: 3.684 ± 0.998
4.211HisGly: 4.211 ± 1.534
3.684HisHis: 3.684 ± 0.959
3.158HisIle: 3.158 ± 1.082
0.0HisLys: 0.0 ± 0.0
2.632HisLeu: 2.632 ± 1.042
0.0HisMet: 0.0 ± 0.0
1.053HisAsn: 1.053 ± 0.94
3.158HisPro: 3.158 ± 1.468
1.579HisGln: 1.579 ± 0.873
4.737HisArg: 4.737 ± 2.993
2.105HisSer: 2.105 ± 1.004
4.737HisThr: 4.737 ± 2.852
1.053HisVal: 1.053 ± 0.926
0.0HisTrp: 0.0 ± 0.0
0.526HisTyr: 0.526 ± 1.075
0.0HisXaa: 0.0 ± 0.0
Ile
8.947IleAla: 8.947 ± 1.367
0.526IleCys: 0.526 ± 0.291
0.526IleAsp: 0.526 ± 0.983
6.316IleGlu: 6.316 ± 1.892
1.053IlePhe: 1.053 ± 0.582
2.632IleGly: 2.632 ± 0.875
1.579IleHis: 1.579 ± 0.873
2.632IleIle: 2.632 ± 1.456
2.105IleLys: 2.105 ± 1.164
5.263IleLeu: 5.263 ± 1.217
2.105IleMet: 2.105 ± 1.164
1.579IleAsn: 1.579 ± 0.873
3.684IlePro: 3.684 ± 1.285
2.105IleGln: 2.105 ± 1.164
3.684IleArg: 3.684 ± 2.594
1.579IleSer: 1.579 ± 0.873
2.632IleThr: 2.632 ± 1.012
2.632IleVal: 2.632 ± 1.526
0.0IleTrp: 0.0 ± 0.0
1.053IleTyr: 1.053 ± 0.582
0.0IleXaa: 0.0 ± 0.0
Lys
4.211LysAla: 4.211 ± 1.519
0.0LysCys: 0.0 ± 0.0
3.158LysAsp: 3.158 ± 1.747
3.158LysGlu: 3.158 ± 1.082
1.579LysPhe: 1.579 ± 2.552
1.053LysGly: 1.053 ± 0.582
1.579LysHis: 1.579 ± 0.873
2.105LysIle: 2.105 ± 1.164
1.579LysLys: 1.579 ± 0.873
5.263LysLeu: 5.263 ± 2.911
0.526LysMet: 0.526 ± 0.291
1.579LysAsn: 1.579 ± 0.873
2.632LysPro: 2.632 ± 1.156
1.053LysGln: 1.053 ± 0.582
1.579LysArg: 1.579 ± 0.921
3.158LysSer: 3.158 ± 0.938
5.263LysThr: 5.263 ± 1.146
2.105LysVal: 2.105 ± 1.164
0.526LysTrp: 0.526 ± 0.291
1.579LysTyr: 1.579 ± 0.873
0.0LysXaa: 0.0 ± 0.0
Leu
7.895LeuAla: 7.895 ± 1.91
1.579LeuCys: 1.579 ± 1.538
3.158LeuAsp: 3.158 ± 1.082
3.158LeuGlu: 3.158 ± 1.06
4.211LeuPhe: 4.211 ± 2.329
6.842LeuGly: 6.842 ± 3.775
2.632LeuHis: 2.632 ± 1.823
9.474LeuIle: 9.474 ± 1.809
5.789LeuLys: 5.789 ± 2.336
9.474LeuLeu: 9.474 ± 3.433
1.579LeuMet: 1.579 ± 0.873
3.158LeuAsn: 3.158 ± 1.082
10.0LeuPro: 10.0 ± 2.839
4.737LeuGln: 4.737 ± 0.911
5.789LeuArg: 5.789 ± 0.921
3.684LeuSer: 3.684 ± 4.059
6.842LeuThr: 6.842 ± 2.255
4.211LeuVal: 4.211 ± 1.244
1.053LeuTrp: 1.053 ± 0.841
2.105LeuTyr: 2.105 ± 1.164
0.0LeuXaa: 0.0 ± 0.0
Met
2.632MetAla: 2.632 ± 1.456
0.526MetCys: 0.526 ± 0.291
1.053MetAsp: 1.053 ± 0.582
1.053MetGlu: 1.053 ± 0.817
0.0MetPhe: 0.0 ± 0.0
1.053MetGly: 1.053 ± 0.582
0.0MetHis: 0.0 ± 0.0
0.526MetIle: 0.526 ± 0.291
1.053MetLys: 1.053 ± 0.582
3.158MetLeu: 3.158 ± 1.06
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.053MetPro: 1.053 ± 0.926
1.053MetGln: 1.053 ± 0.582
1.579MetArg: 1.579 ± 0.747
1.579MetSer: 1.579 ± 0.873
1.053MetThr: 1.053 ± 0.841
1.053MetVal: 1.053 ± 0.582
0.0MetTrp: 0.0 ± 0.0
1.053MetTyr: 1.053 ± 0.94
0.0MetXaa: 0.0 ± 0.0
Asn
3.684AsnAla: 3.684 ± 1.438
0.526AsnCys: 0.526 ± 1.014
1.053AsnAsp: 1.053 ± 0.582
2.105AsnGlu: 2.105 ± 1.164
1.579AsnPhe: 1.579 ± 0.747
1.579AsnGly: 1.579 ± 0.873
1.579AsnHis: 1.579 ± 1.358
0.526AsnIle: 0.526 ± 0.291
0.526AsnLys: 0.526 ± 0.291
2.632AsnLeu: 2.632 ± 1.842
1.053AsnMet: 1.053 ± 0.708
1.579AsnAsn: 1.579 ± 0.873
2.105AsnPro: 2.105 ± 1.164
0.526AsnGln: 0.526 ± 0.291
1.579AsnArg: 1.579 ± 0.873
1.579AsnSer: 1.579 ± 1.219
3.158AsnThr: 3.158 ± 1.355
0.526AsnVal: 0.526 ± 1.075
1.053AsnTrp: 1.053 ± 2.027
1.579AsnTyr: 1.579 ± 0.873
0.0AsnXaa: 0.0 ± 0.0
Pro
9.474ProAla: 9.474 ± 3.686
1.579ProCys: 1.579 ± 0.734
6.842ProAsp: 6.842 ± 0.882
3.158ProGlu: 3.158 ± 1.332
1.053ProPhe: 1.053 ± 0.926
2.105ProGly: 2.105 ± 1.164
2.105ProHis: 2.105 ± 3.55
3.684ProIle: 3.684 ± 1.311
3.684ProLys: 3.684 ± 2.038
3.684ProLeu: 3.684 ± 0.828
1.053ProMet: 1.053 ± 0.738
1.579ProAsn: 1.579 ± 1.784
6.842ProPro: 6.842 ± 3.951
1.579ProGln: 1.579 ± 1.219
2.105ProArg: 2.105 ± 1.164
2.105ProSer: 2.105 ± 0.762
4.737ProThr: 4.737 ± 1.612
6.316ProVal: 6.316 ± 2.937
1.053ProTrp: 1.053 ± 0.582
2.105ProTyr: 2.105 ± 0.921
0.0ProXaa: 0.0 ± 0.0
Gln
4.211GlnAla: 4.211 ± 1.562
0.526GlnCys: 0.526 ± 0.291
3.684GlnAsp: 3.684 ± 2.038
0.526GlnGlu: 0.526 ± 0.291
1.579GlnPhe: 1.579 ± 1.84
1.579GlnGly: 1.579 ± 0.873
1.579GlnHis: 1.579 ± 0.734
0.0GlnIle: 0.0 ± 0.0
1.579GlnLys: 1.579 ± 0.884
7.368GlnLeu: 7.368 ± 2.224
0.526GlnMet: 0.526 ± 0.291
0.526GlnAsn: 0.526 ± 0.291
2.632GlnPro: 2.632 ± 1.547
0.526GlnGln: 0.526 ± 0.291
2.105GlnArg: 2.105 ± 0.762
3.684GlnSer: 3.684 ± 1.478
3.158GlnThr: 3.158 ± 1.06
2.105GlnVal: 2.105 ± 1.164
1.579GlnTrp: 1.579 ± 0.873
1.053GlnTyr: 1.053 ± 0.582
0.0GlnXaa: 0.0 ± 0.0
Arg
2.632ArgAla: 2.632 ± 0.875
1.053ArgCys: 1.053 ± 1.505
4.737ArgAsp: 4.737 ± 1.824
2.105ArgGlu: 2.105 ± 1.164
2.105ArgPhe: 2.105 ± 2.048
3.684ArgGly: 3.684 ± 1.058
3.158ArgHis: 3.158 ± 1.468
2.632ArgIle: 2.632 ± 1.156
1.053ArgLys: 1.053 ± 0.582
2.632ArgLeu: 2.632 ± 0.875
0.526ArgMet: 0.526 ± 0.291
3.158ArgAsn: 3.158 ± 1.45
1.053ArgPro: 1.053 ± 1.479
3.158ArgGln: 3.158 ± 1.06
2.105ArgArg: 2.105 ± 1.164
2.105ArgSer: 2.105 ± 1.075
5.263ArgThr: 5.263 ± 2.213
3.684ArgVal: 3.684 ± 0.828
0.0ArgTrp: 0.0 ± 0.0
3.158ArgTyr: 3.158 ± 1.069
0.0ArgXaa: 0.0 ± 0.0
Ser
1.579SerAla: 1.579 ± 1.84
1.579SerCys: 1.579 ± 1.538
2.105SerAsp: 2.105 ± 0.976
2.632SerGlu: 2.632 ± 0.875
1.579SerPhe: 1.579 ± 0.873
4.737SerGly: 4.737 ± 2.146
4.737SerHis: 4.737 ± 1.297
3.158SerIle: 3.158 ± 0.628
2.632SerLys: 2.632 ± 1.456
6.842SerLeu: 6.842 ± 0.833
0.526SerMet: 0.526 ± 0.954
0.526SerAsn: 0.526 ± 0.291
4.211SerPro: 4.211 ± 0.862
3.158SerGln: 3.158 ± 1.06
2.632SerArg: 2.632 ± 0.767
2.632SerSer: 2.632 ± 1.134
2.105SerThr: 2.105 ± 0.762
3.158SerVal: 3.158 ± 2.745
0.526SerTrp: 0.526 ± 1.018
1.579SerTyr: 1.579 ± 0.873
0.0SerXaa: 0.0 ± 0.0
Thr
5.263ThrAla: 5.263 ± 4.669
3.684ThrCys: 3.684 ± 1.625
2.105ThrAsp: 2.105 ± 1.634
2.632ThrGlu: 2.632 ± 0.875
6.842ThrPhe: 6.842 ± 1.786
2.632ThrGly: 2.632 ± 1.342
3.684ThrHis: 3.684 ± 0.828
3.158ThrIle: 3.158 ± 1.747
2.105ThrLys: 2.105 ± 1.851
8.421ThrLeu: 8.421 ± 3.068
1.579ThrMet: 1.579 ± 0.747
3.684ThrAsn: 3.684 ± 1.285
7.895ThrPro: 7.895 ± 2.228
2.632ThrGln: 2.632 ± 1.042
3.158ThrArg: 3.158 ± 1.67
3.684ThrSer: 3.684 ± 1.534
6.316ThrThr: 6.316 ± 0.891
7.895ThrVal: 7.895 ± 1.678
0.526ThrTrp: 0.526 ± 1.014
2.105ThrTyr: 2.105 ± 1.075
0.0ThrXaa: 0.0 ± 0.0
Val
5.263ValAla: 5.263 ± 2.919
1.053ValCys: 1.053 ± 0.94
2.632ValAsp: 2.632 ± 0.889
3.158ValGlu: 3.158 ± 1.082
2.632ValPhe: 2.632 ± 0.889
3.684ValGly: 3.684 ± 0.998
1.579ValHis: 1.579 ± 1.244
2.632ValIle: 2.632 ± 1.042
3.684ValLys: 3.684 ± 1.285
6.316ValLeu: 6.316 ± 1.658
1.579ValMet: 1.579 ± 0.873
0.526ValAsn: 0.526 ± 1.075
4.211ValPro: 4.211 ± 3.401
2.105ValGln: 2.105 ± 1.164
2.105ValArg: 2.105 ± 0.976
2.632ValSer: 2.632 ± 1.811
9.474ValThr: 9.474 ± 2.836
5.263ValVal: 5.263 ± 2.955
1.053ValTrp: 1.053 ± 0.841
2.105ValTyr: 2.105 ± 0.762
0.0ValXaa: 0.0 ± 0.0
Trp
2.632TrpAla: 2.632 ± 1.935
1.053TrpCys: 1.053 ± 0.582
0.0TrpAsp: 0.0 ± 0.0
1.053TrpGlu: 1.053 ± 0.841
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.526TrpHis: 0.526 ± 0.291
0.0TrpIle: 0.0 ± 0.0
1.053TrpLys: 1.053 ± 0.582
0.526TrpLeu: 0.526 ± 0.291
0.526TrpMet: 0.526 ± 0.291
0.526TrpAsn: 0.526 ± 1.014
0.526TrpPro: 0.526 ± 0.291
1.053TrpGln: 1.053 ± 0.582
0.526TrpArg: 0.526 ± 1.014
0.526TrpSer: 0.526 ± 0.291
0.0TrpThr: 0.0 ± 0.0
0.526TrpVal: 0.526 ± 0.291
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.737TyrAla: 4.737 ± 1.278
1.053TyrCys: 1.053 ± 1.452
2.105TyrAsp: 2.105 ± 0.921
1.053TyrGlu: 1.053 ± 0.582
1.579TyrPhe: 1.579 ± 0.734
2.632TyrGly: 2.632 ± 1.842
1.053TyrHis: 1.053 ± 0.582
3.684TyrIle: 3.684 ± 1.82
1.053TyrLys: 1.053 ± 0.582
2.105TyrLeu: 2.105 ± 0.762
0.526TyrMet: 0.526 ± 0.291
0.0TyrAsn: 0.0 ± 0.0
1.053TyrPro: 1.053 ± 0.817
1.579TyrGln: 1.579 ± 0.873
0.526TyrArg: 0.526 ± 0.291
3.158TyrSer: 3.158 ± 1.221
2.632TyrThr: 2.632 ± 1.012
2.105TyrVal: 2.105 ± 0.921
0.0TyrTrp: 0.0 ± 0.0
1.053TyrTyr: 1.053 ± 0.841
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski