Amino acid dipepetide frequency for Laurel Lake virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.406AlaAla: 2.406 ± 1.052
0.602AlaCys: 0.602 ± 0.308
1.805AlaAsp: 1.805 ± 1.229
1.203AlaGlu: 1.203 ± 0.568
2.707AlaPhe: 2.707 ± 1.771
2.406AlaGly: 2.406 ± 1.624
0.602AlaHis: 0.602 ± 0.308
2.707AlaIle: 2.707 ± 0.989
4.211AlaLys: 4.211 ± 1.464
2.707AlaLeu: 2.707 ± 0.989
1.504AlaMet: 1.504 ± 0.891
1.805AlaAsn: 1.805 ± 0.924
0.301AlaPro: 0.301 ± 0.154
0.602AlaGln: 0.602 ± 0.641
2.707AlaArg: 2.707 ± 0.989
3.91AlaSer: 3.91 ± 0.843
2.105AlaThr: 2.105 ± 1.078
1.805AlaVal: 1.805 ± 1.247
0.301AlaTrp: 0.301 ± 0.154
1.504AlaTyr: 1.504 ± 0.592
0.0AlaXaa: 0.0 ± 0.0
Cys
0.602CysAla: 0.602 ± 0.308
0.0CysCys: 0.0 ± 0.0
1.203CysAsp: 1.203 ± 0.23
0.301CysGlu: 0.301 ± 0.154
0.602CysPhe: 0.602 ± 0.308
1.203CysGly: 1.203 ± 0.616
0.301CysHis: 0.301 ± 0.423
0.301CysIle: 0.301 ± 0.423
1.805CysLys: 1.805 ± 0.454
1.805CysLeu: 1.805 ± 0.924
0.602CysMet: 0.602 ± 0.308
1.805CysAsn: 1.805 ± 0.454
0.902CysPro: 0.902 ± 0.218
0.602CysGln: 0.602 ± 0.308
0.0CysArg: 0.0 ± 0.0
1.805CysSer: 1.805 ± 0.924
0.602CysThr: 0.602 ± 0.308
0.902CysVal: 0.902 ± 0.462
0.301CysTrp: 0.301 ± 0.154
0.602CysTyr: 0.602 ± 0.299
0.0CysXaa: 0.0 ± 0.0
Asp
1.203AspAla: 1.203 ± 1.282
0.301AspCys: 0.301 ± 0.154
3.609AspAsp: 3.609 ± 0.69
4.211AspGlu: 4.211 ± 0.062
3.91AspPhe: 3.91 ± 2.311
2.707AspGly: 2.707 ± 1.125
1.805AspHis: 1.805 ± 0.436
3.609AspIle: 3.609 ± 0.69
4.211AspLys: 4.211 ± 1.091
5.414AspLeu: 5.414 ± 0.606
2.707AspMet: 2.707 ± 0.443
2.105AspAsn: 2.105 ± 0.595
3.008AspPro: 3.008 ± 0.65
1.504AspGln: 1.504 ± 0.77
2.707AspArg: 2.707 ± 0.654
4.812AspSer: 4.812 ± 1.097
4.511AspThr: 4.511 ± 1.023
4.211AspVal: 4.211 ± 1.723
0.602AspTrp: 0.602 ± 0.308
4.211AspTyr: 4.211 ± 0.841
0.0AspXaa: 0.0 ± 0.0
Glu
4.211GluAla: 4.211 ± 1.161
1.805GluCys: 1.805 ± 0.454
4.812GluAsp: 4.812 ± 0.353
8.421GluGlu: 8.421 ± 0.901
4.211GluPhe: 4.211 ± 0.8
3.609GluGly: 3.609 ± 0.886
1.203GluHis: 1.203 ± 0.23
4.211GluIle: 4.211 ± 1.189
2.406GluLys: 2.406 ± 1.052
8.421GluLeu: 8.421 ± 2.148
2.707GluMet: 2.707 ± 1.125
3.008GluAsn: 3.008 ± 1.001
1.805GluPro: 1.805 ± 0.454
1.805GluGln: 1.805 ± 0.412
3.609GluArg: 3.609 ± 1.343
4.511GluSer: 4.511 ± 0.975
3.308GluThr: 3.308 ± 0.479
3.609GluVal: 3.609 ± 0.806
0.602GluTrp: 0.602 ± 0.308
2.707GluTyr: 2.707 ± 0.541
0.0GluXaa: 0.0 ± 0.0
Phe
1.805PheAla: 1.805 ± 1.172
1.203PheCys: 1.203 ± 0.616
2.105PheAsp: 2.105 ± 0.595
2.105PheGlu: 2.105 ± 0.595
1.504PhePhe: 1.504 ± 0.77
2.105PheGly: 2.105 ± 0.74
0.602PheHis: 0.602 ± 0.308
2.406PheIle: 2.406 ± 0.46
3.008PheLys: 3.008 ± 1.011
2.105PheLeu: 2.105 ± 0.589
1.203PheMet: 1.203 ± 0.616
1.805PheAsn: 1.805 ± 2.056
3.008PhePro: 3.008 ± 0.946
2.105PheGln: 2.105 ± 0.421
3.609PheArg: 3.609 ± 0.872
7.82PheSer: 7.82 ± 1.341
3.008PheThr: 3.008 ± 1.011
3.609PheVal: 3.609 ± 0.934
0.0PheTrp: 0.0 ± 0.0
1.203PheTyr: 1.203 ± 0.616
0.0PheXaa: 0.0 ± 0.0
Gly
1.805GlyAla: 1.805 ± 0.412
1.504GlyCys: 1.504 ± 0.77
3.308GlyAsp: 3.308 ± 1.193
5.414GlyGlu: 5.414 ± 0.602
1.504GlyPhe: 1.504 ± 0.891
3.008GlyGly: 3.008 ± 0.548
0.902GlyHis: 0.902 ± 0.462
2.707GlyIle: 2.707 ± 0.443
3.91GlyLys: 3.91 ± 0.178
2.707GlyLeu: 2.707 ± 0.443
2.105GlyMet: 2.105 ± 0.589
3.008GlyAsn: 3.008 ± 0.548
1.504GlyPro: 1.504 ± 0.325
0.902GlyGln: 0.902 ± 0.218
3.308GlyArg: 3.308 ± 0.479
3.008GlySer: 3.008 ± 0.548
3.008GlyThr: 3.008 ± 0.184
4.812GlyVal: 4.812 ± 0.353
0.602GlyTrp: 0.602 ± 0.299
1.203GlyTyr: 1.203 ± 0.23
0.0GlyXaa: 0.0 ± 0.0
His
1.203HisAla: 1.203 ± 0.599
0.0HisCys: 0.0 ± 0.0
2.105HisAsp: 2.105 ± 0.595
1.203HisGlu: 1.203 ± 0.23
1.504HisPhe: 1.504 ± 0.592
0.902HisGly: 0.902 ± 0.462
0.301HisHis: 0.301 ± 0.154
1.203HisIle: 1.203 ± 0.616
1.504HisLys: 1.504 ± 0.325
3.008HisLeu: 3.008 ± 1.001
1.203HisMet: 1.203 ± 0.23
0.902HisAsn: 0.902 ± 0.462
0.301HisPro: 0.301 ± 0.154
0.0HisGln: 0.0 ± 0.0
0.301HisArg: 0.301 ± 0.154
1.203HisSer: 1.203 ± 0.568
2.105HisThr: 2.105 ± 0.595
1.504HisVal: 1.504 ± 0.77
0.301HisTrp: 0.301 ± 0.154
1.203HisTyr: 1.203 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
3.008IleAla: 3.008 ± 0.548
1.504IleCys: 1.504 ± 0.77
3.91IleAsp: 3.91 ± 1.046
3.91IleGlu: 3.91 ± 0.766
0.602IlePhe: 0.602 ± 0.308
3.008IleGly: 3.008 ± 1.011
1.203IleHis: 1.203 ± 0.568
1.504IleIle: 1.504 ± 0.325
4.812IleLys: 4.812 ± 0.89
5.714IleLeu: 5.714 ± 1.287
2.105IleMet: 2.105 ± 0.396
3.609IleAsn: 3.609 ± 1.295
1.805IlePro: 1.805 ± 0.436
1.805IleGln: 1.805 ± 0.454
2.707IleArg: 2.707 ± 0.654
8.722IleSer: 8.722 ± 1.714
3.609IleThr: 3.609 ± 1.056
2.707IleVal: 2.707 ± 0.541
0.602IleTrp: 0.602 ± 0.308
1.504IleTyr: 1.504 ± 0.77
0.0IleXaa: 0.0 ± 0.0
Lys
3.008LysAla: 3.008 ± 1.693
0.301LysCys: 0.301 ± 0.154
6.617LysAsp: 6.617 ± 0.797
6.316LysGlu: 6.316 ± 1.221
3.91LysPhe: 3.91 ± 1.046
6.316LysGly: 6.316 ± 0.304
0.902LysHis: 0.902 ± 0.218
3.91LysIle: 3.91 ± 1.386
4.812LysLys: 4.812 ± 0.595
7.519LysLeu: 7.519 ± 1.19
1.504LysMet: 1.504 ± 0.77
3.008LysAsn: 3.008 ± 0.946
2.707LysPro: 2.707 ± 2.151
3.609LysGln: 3.609 ± 1.056
4.511LysArg: 4.511 ± 0.975
4.211LysSer: 4.211 ± 1.142
5.414LysThr: 5.414 ± 0.886
6.015LysVal: 6.015 ± 0.913
0.301LysTrp: 0.301 ± 0.154
3.308LysTyr: 3.308 ± 0.928
0.0LysXaa: 0.0 ± 0.0
Leu
2.406LeuAla: 2.406 ± 0.741
0.301LeuCys: 0.301 ± 0.154
4.511LeuAsp: 4.511 ± 0.975
8.722LeuGlu: 8.722 ± 0.923
4.812LeuPhe: 4.812 ± 0.83
6.316LeuGly: 6.316 ± 1.216
2.105LeuHis: 2.105 ± 0.595
4.812LeuIle: 4.812 ± 0.83
9.023LeuLys: 9.023 ± 2.269
9.323LeuLeu: 9.323 ± 3.271
2.406LeuMet: 2.406 ± 0.343
6.015LeuAsn: 6.015 ± 1.259
4.511LeuPro: 4.511 ± 0.72
2.406LeuGln: 2.406 ± 1.484
4.211LeuArg: 4.211 ± 1.723
9.925LeuSer: 9.925 ± 0.618
6.316LeuThr: 6.316 ± 0.48
3.609LeuVal: 3.609 ± 1.705
1.203LeuTrp: 1.203 ± 0.568
4.812LeuTyr: 4.812 ± 0.883
0.0LeuXaa: 0.0 ± 0.0
Met
0.301MetAla: 0.301 ± 0.423
0.301MetCys: 0.301 ± 0.154
1.504MetAsp: 1.504 ± 0.325
2.105MetGlu: 2.105 ± 0.74
2.105MetPhe: 2.105 ± 0.74
0.902MetGly: 0.902 ± 0.218
0.902MetHis: 0.902 ± 0.462
3.008MetIle: 3.008 ± 1.184
4.511MetLys: 4.511 ± 1.397
2.406MetLeu: 2.406 ± 1.232
0.602MetMet: 0.602 ± 0.308
1.504MetAsn: 1.504 ± 0.77
1.203MetPro: 1.203 ± 0.616
1.504MetGln: 1.504 ± 0.325
1.504MetArg: 1.504 ± 0.505
3.308MetSer: 3.308 ± 1.391
2.406MetThr: 2.406 ± 1.052
1.203MetVal: 1.203 ± 0.599
0.301MetTrp: 0.301 ± 0.154
1.203MetTyr: 1.203 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
1.805AsnAla: 1.805 ± 1.172
1.805AsnCys: 1.805 ± 0.454
0.902AsnAsp: 0.902 ± 0.586
2.406AsnGlu: 2.406 ± 0.741
1.805AsnPhe: 1.805 ± 0.924
1.504AsnGly: 1.504 ± 0.501
1.504AsnHis: 1.504 ± 0.501
2.707AsnIle: 2.707 ± 0.541
3.609AsnLys: 3.609 ± 1.295
7.519AsnLeu: 7.519 ± 1.953
1.203AsnMet: 1.203 ± 0.793
2.105AsnAsn: 2.105 ± 0.421
1.504AsnPro: 1.504 ± 1.218
3.008AsnGln: 3.008 ± 2.028
2.707AsnArg: 2.707 ± 0.443
5.414AsnSer: 5.414 ± 0.606
2.707AsnThr: 2.707 ± 1.386
2.406AsnVal: 2.406 ± 1.232
0.902AsnTrp: 0.902 ± 0.462
1.504AsnTyr: 1.504 ± 0.505
0.0AsnXaa: 0.0 ± 0.0
Pro
0.301ProAla: 0.301 ± 0.154
0.0ProCys: 0.0 ± 0.0
3.609ProAsp: 3.609 ± 1.447
2.707ProGlu: 2.707 ± 0.989
0.602ProPhe: 0.602 ± 0.299
1.805ProGly: 1.805 ± 0.924
0.902ProHis: 0.902 ± 0.462
3.008ProIle: 3.008 ± 1.001
4.211ProLys: 4.211 ± 0.8
2.707ProLeu: 2.707 ± 1.093
0.301ProMet: 0.301 ± 0.154
1.504ProAsn: 1.504 ± 0.77
0.902ProPro: 0.902 ± 0.755
0.301ProGln: 0.301 ± 0.154
3.008ProArg: 3.008 ± 2.279
3.008ProSer: 3.008 ± 0.633
3.008ProThr: 3.008 ± 1.183
1.203ProVal: 1.203 ± 0.568
0.301ProTrp: 0.301 ± 0.154
2.406ProTyr: 2.406 ± 0.712
0.0ProXaa: 0.0 ± 0.0
Gln
1.504GlnAla: 1.504 ± 0.505
0.902GlnCys: 0.902 ± 0.218
1.504GlnAsp: 1.504 ± 0.77
3.008GlnGlu: 3.008 ± 0.633
2.105GlnPhe: 2.105 ± 0.589
2.406GlnGly: 2.406 ± 1.137
0.602GlnHis: 0.602 ± 0.308
1.203GlnIle: 1.203 ± 0.23
2.406GlnLys: 2.406 ± 0.847
1.805GlnLeu: 1.805 ± 0.454
1.203GlnMet: 1.203 ± 0.616
1.504GlnAsn: 1.504 ± 0.501
0.301GlnPro: 0.301 ± 0.423
1.504GlnGln: 1.504 ± 1.562
1.805GlnArg: 1.805 ± 0.454
3.008GlnSer: 3.008 ± 2.301
1.504GlnThr: 1.504 ± 0.501
2.105GlnVal: 2.105 ± 0.796
0.602GlnTrp: 0.602 ± 0.299
0.902GlnTyr: 0.902 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
1.504ArgAla: 1.504 ± 0.891
1.203ArgCys: 1.203 ± 0.616
3.008ArgAsp: 3.008 ± 1.011
2.406ArgGlu: 2.406 ± 0.712
1.805ArgPhe: 1.805 ± 0.436
2.406ArgGly: 2.406 ± 0.46
0.902ArgHis: 0.902 ± 0.462
4.511ArgIle: 4.511 ± 0.467
1.504ArgLys: 1.504 ± 0.325
6.917ArgLeu: 6.917 ± 1.149
2.707ArgMet: 2.707 ± 1.386
2.105ArgAsn: 2.105 ± 0.595
1.504ArgPro: 1.504 ± 0.501
2.707ArgGln: 2.707 ± 0.541
0.902ArgArg: 0.902 ± 0.717
6.617ArgSer: 6.617 ± 2.323
2.406ArgThr: 2.406 ± 1.801
2.105ArgVal: 2.105 ± 1.376
0.902ArgTrp: 0.902 ± 0.218
0.902ArgTyr: 0.902 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
5.113SerAla: 5.113 ± 2.037
1.203SerCys: 1.203 ± 0.23
5.714SerAsp: 5.714 ± 2.516
5.414SerGlu: 5.414 ± 0.652
3.609SerPhe: 3.609 ± 0.907
4.511SerGly: 4.511 ± 1.09
2.406SerHis: 2.406 ± 0.374
7.218SerIle: 7.218 ± 1.511
8.722SerLys: 8.722 ± 2.133
10.226SerLeu: 10.226 ± 1.593
2.707SerMet: 2.707 ± 0.541
5.714SerAsn: 5.714 ± 1.499
2.707SerPro: 2.707 ± 1.093
3.008SerGln: 3.008 ± 0.548
5.714SerArg: 5.714 ± 1.274
9.624SerSer: 9.624 ± 1.37
5.414SerThr: 5.414 ± 2.293
6.316SerVal: 6.316 ± 0.59
0.301SerTrp: 0.301 ± 0.154
2.707SerTyr: 2.707 ± 1.093
0.0SerXaa: 0.0 ± 0.0
Thr
2.105ThrAla: 2.105 ± 0.362
0.301ThrCys: 0.301 ± 0.154
4.211ThrAsp: 4.211 ± 0.53
3.609ThrGlu: 3.609 ± 0.886
2.707ThrPhe: 2.707 ± 0.301
1.504ThrGly: 1.504 ± 0.505
2.707ThrHis: 2.707 ± 0.654
3.91ThrIle: 3.91 ± 0.74
4.211ThrLys: 4.211 ± 1.02
6.316ThrLeu: 6.316 ± 1.916
2.707ThrMet: 2.707 ± 0.989
3.91ThrAsn: 3.91 ± 0.384
2.105ThrPro: 2.105 ± 1.636
1.203ThrGln: 1.203 ± 0.599
3.008ThrArg: 3.008 ± 1.693
6.917ThrSer: 6.917 ± 0.582
4.812ThrThr: 4.812 ± 3.066
3.308ThrVal: 3.308 ± 0.479
0.602ThrTrp: 0.602 ± 0.299
1.203ThrTyr: 1.203 ± 0.616
0.0ThrXaa: 0.0 ± 0.0
Val
2.105ValAla: 2.105 ± 1.134
0.301ValCys: 0.301 ± 0.154
3.91ValAsp: 3.91 ± 1.046
4.812ValGlu: 4.812 ± 0.89
3.308ValPhe: 3.308 ± 0.156
2.406ValGly: 2.406 ± 0.46
0.602ValHis: 0.602 ± 0.308
3.609ValIle: 3.609 ± 1.056
5.113ValLys: 5.113 ± 1.048
6.617ValLeu: 6.617 ± 1.344
1.805ValMet: 1.805 ± 0.454
2.105ValAsn: 2.105 ± 0.589
4.211ValPro: 4.211 ± 1.947
1.504ValGln: 1.504 ± 0.592
2.105ValArg: 2.105 ± 0.595
5.414ValSer: 5.414 ± 1.308
1.805ValThr: 1.805 ± 0.412
2.406ValVal: 2.406 ± 0.94
0.602ValTrp: 0.602 ± 0.299
1.805ValTyr: 1.805 ± 1.229
0.301ValXaa: 0.301 ± 0.154
Trp
0.602TrpAla: 0.602 ± 0.308
0.902TrpCys: 0.902 ± 0.462
0.602TrpAsp: 0.602 ± 0.308
0.902TrpGlu: 0.902 ± 0.717
0.602TrpPhe: 0.602 ± 0.308
0.602TrpGly: 0.602 ± 0.299
0.602TrpHis: 0.602 ± 0.308
0.301TrpIle: 0.301 ± 0.154
0.902TrpLys: 0.902 ± 0.462
0.902TrpLeu: 0.902 ± 0.755
0.301TrpMet: 0.301 ± 0.154
0.0TrpAsn: 0.0 ± 0.0
0.902TrpPro: 0.902 ± 0.218
0.602TrpGln: 0.602 ± 0.308
0.0TrpArg: 0.0 ± 0.0
0.602TrpSer: 0.602 ± 0.308
0.301TrpThr: 0.301 ± 0.154
0.301TrpVal: 0.301 ± 0.154
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.902TyrAla: 0.902 ± 0.462
1.805TyrCys: 1.805 ± 0.898
2.406TyrAsp: 2.406 ± 1.232
1.805TyrGlu: 1.805 ± 0.412
2.105TyrPhe: 2.105 ± 0.421
0.602TyrGly: 0.602 ± 0.299
0.902TyrHis: 0.902 ± 0.462
1.504TyrIle: 1.504 ± 0.592
3.308TyrLys: 3.308 ± 0.672
3.609TyrLeu: 3.609 ± 1.295
0.902TyrMet: 0.902 ± 0.462
1.805TyrAsn: 1.805 ± 0.652
0.602TyrPro: 0.602 ± 0.299
1.203TyrGln: 1.203 ± 0.23
0.602TyrArg: 0.602 ± 0.299
4.511TyrSer: 4.511 ± 0.153
3.008TyrThr: 3.008 ± 1.84
2.707TyrVal: 2.707 ± 1.093
0.602TyrTrp: 0.602 ± 0.308
0.602TyrTyr: 0.602 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.301XaaPhe: 0.301 ± 0.154
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski