Amino acid dipepetide frequency for Laibin virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.515AlaAla: 3.515 ± 0.42
0.811AlaCys: 0.811 ± 0.875
1.893AlaAsp: 1.893 ± 0.1
2.975AlaGlu: 2.975 ± 1.916
1.622AlaPhe: 1.622 ± 0.672
3.786AlaGly: 3.786 ± 1.762
2.434AlaHis: 2.434 ± 0.643
2.434AlaIle: 2.434 ± 1.212
5.138AlaLys: 5.138 ± 0.784
5.138AlaLeu: 5.138 ± 1.157
2.163AlaMet: 2.163 ± 0.581
2.163AlaAsn: 2.163 ± 0.702
2.163AlaPro: 2.163 ± 1.056
2.975AlaGln: 2.975 ± 0.368
2.434AlaArg: 2.434 ± 1.148
5.949AlaSer: 5.949 ± 1.852
2.704AlaThr: 2.704 ± 0.09
4.056AlaVal: 4.056 ± 1.308
0.541AlaTrp: 0.541 ± 0.329
2.704AlaTyr: 2.704 ± 1.368
0.0AlaXaa: 0.0 ± 0.0
Cys
1.352CysAla: 1.352 ± 0.438
0.541CysCys: 0.541 ± 0.176
0.811CysAsp: 0.811 ± 0.453
0.541CysGlu: 0.541 ± 0.176
1.893CysPhe: 1.893 ± 0.794
1.352CysGly: 1.352 ± 0.622
0.27CysHis: 0.27 ± 0.292
2.163CysIle: 2.163 ± 0.704
1.082CysLys: 1.082 ± 0.513
2.163CysLeu: 2.163 ± 1.483
0.811CysMet: 0.811 ± 0.453
0.811CysAsn: 0.811 ± 0.875
2.975CysPro: 2.975 ± 2.634
1.352CysGln: 1.352 ± 1.181
0.27CysArg: 0.27 ± 0.292
2.434CysSer: 2.434 ± 0.643
1.622CysThr: 1.622 ± 0.906
1.352CysVal: 1.352 ± 0.31
0.27CysTrp: 0.27 ± 0.292
1.622CysTyr: 1.622 ± 1.323
0.0CysXaa: 0.0 ± 0.0
Asp
3.245AspAla: 3.245 ± 0.817
1.082AspCys: 1.082 ± 0.352
2.975AspAsp: 2.975 ± 2.053
2.434AspGlu: 2.434 ± 1.076
2.704AspPhe: 2.704 ± 0.619
2.975AspGly: 2.975 ± 0.738
1.893AspHis: 1.893 ± 0.859
3.515AspIle: 3.515 ± 0.448
3.245AspLys: 3.245 ± 0.403
8.383AspLeu: 8.383 ± 1.854
1.622AspMet: 1.622 ± 0.767
2.434AspAsn: 2.434 ± 1.078
1.893AspPro: 1.893 ± 0.794
2.163AspGln: 2.163 ± 0.702
2.434AspArg: 2.434 ± 0.333
2.704AspSer: 2.704 ± 0.443
2.434AspThr: 2.434 ± 1.327
3.245AspVal: 3.245 ± 1.107
0.811AspTrp: 0.811 ± 0.493
1.893AspTyr: 1.893 ± 0.451
0.0AspXaa: 0.0 ± 0.0
Glu
2.975GluAla: 2.975 ± 1.141
2.434GluCys: 2.434 ± 0.968
3.515GluAsp: 3.515 ± 0.26
4.056GluGlu: 4.056 ± 1.642
3.245GluPhe: 3.245 ± 0.872
3.515GluGly: 3.515 ± 0.98
1.622GluHis: 1.622 ± 0.348
4.327GluIle: 4.327 ± 0.964
5.408GluLys: 5.408 ± 2.193
6.76GluLeu: 6.76 ± 0.765
1.082GluMet: 1.082 ± 0.52
1.893GluAsn: 1.893 ± 0.425
2.975GluPro: 2.975 ± 0.919
1.622GluGln: 1.622 ± 0.35
2.163GluArg: 2.163 ± 0.263
3.245GluSer: 3.245 ± 0.872
2.975GluThr: 2.975 ± 0.654
4.327GluVal: 4.327 ± 0.854
1.622GluTrp: 1.622 ± 0.35
1.622GluTyr: 1.622 ± 0.986
0.0GluXaa: 0.0 ± 0.0
Phe
2.163PheAla: 2.163 ± 0.541
1.622PheCys: 1.622 ± 0.906
2.975PheAsp: 2.975 ± 1.404
2.704PheGlu: 2.704 ± 0.619
3.245PhePhe: 3.245 ± 0.817
1.352PheGly: 1.352 ± 0.31
1.622PheHis: 1.622 ± 0.589
2.163PheIle: 2.163 ± 0.701
3.786PheLys: 3.786 ± 1.895
4.327PheLeu: 4.327 ± 0.434
1.352PheMet: 1.352 ± 0.833
1.622PheAsn: 1.622 ± 0.595
0.541PhePro: 0.541 ± 0.41
2.163PheGln: 2.163 ± 0.183
3.245PheArg: 3.245 ± 0.211
4.327PheSer: 4.327 ± 0.551
2.434PheThr: 2.434 ± 1.078
1.352PheVal: 1.352 ± 0.634
0.27PheTrp: 0.27 ± 0.164
1.352PheTyr: 1.352 ± 0.353
0.0PheXaa: 0.0 ± 0.0
Gly
3.245GlyAla: 3.245 ± 0.812
1.352GlyCys: 1.352 ± 0.353
2.975GlyAsp: 2.975 ± 1.046
4.597GlyGlu: 4.597 ± 2.038
2.704GlyPhe: 2.704 ± 0.09
2.163GlyGly: 2.163 ± 0.701
1.893GlyHis: 1.893 ± 0.794
4.867GlyIle: 4.867 ± 1.533
2.704GlyLys: 2.704 ± 0.529
5.138GlyLeu: 5.138 ± 0.825
2.975GlyMet: 2.975 ± 1.526
3.515GlyAsn: 3.515 ± 0.272
0.811GlyPro: 0.811 ± 0.453
2.163GlyGln: 2.163 ± 0.701
1.622GlyArg: 1.622 ± 0.722
2.434GlySer: 2.434 ± 0.968
4.327GlyThr: 4.327 ± 1.301
3.245GlyVal: 3.245 ± 0.402
1.082GlyTrp: 1.082 ± 0.742
2.434GlyTyr: 2.434 ± 0.643
0.0GlyXaa: 0.0 ± 0.0
His
1.352HisAla: 1.352 ± 0.438
0.27HisCys: 0.27 ± 0.292
2.163HisAsp: 2.163 ± 0.581
0.811HisGlu: 0.811 ± 0.438
1.082HisPhe: 1.082 ± 0.271
1.082HisGly: 1.082 ± 0.742
0.27HisHis: 0.27 ± 0.164
2.163HisIle: 2.163 ± 1.074
1.622HisLys: 1.622 ± 0.528
3.245HisLeu: 3.245 ± 0.929
0.811HisMet: 0.811 ± 0.175
1.622HisAsn: 1.622 ± 0.528
1.082HisPro: 1.082 ± 0.291
0.541HisGln: 0.541 ± 0.176
0.811HisArg: 0.811 ± 0.175
1.893HisSer: 1.893 ± 1.194
1.082HisThr: 1.082 ± 0.291
1.352HisVal: 1.352 ± 0.622
0.811HisTrp: 0.811 ± 0.175
1.352HisTyr: 1.352 ± 0.438
0.0HisXaa: 0.0 ± 0.0
Ile
4.597IleAla: 4.597 ± 1.059
1.082IleCys: 1.082 ± 0.291
4.597IleAsp: 4.597 ± 0.75
5.408IleGlu: 5.408 ± 0.708
2.163IlePhe: 2.163 ± 0.704
3.245IleGly: 3.245 ± 1.056
1.082IleHis: 1.082 ± 0.291
2.975IleIle: 2.975 ± 0.926
3.245IleLys: 3.245 ± 1.606
8.112IleLeu: 8.112 ± 2.071
3.245IleMet: 3.245 ± 0.717
1.622IleAsn: 1.622 ± 0.348
3.515IlePro: 3.515 ± 1.014
3.515IleGln: 3.515 ± 1.41
2.975IleArg: 2.975 ± 1.088
5.408IleSer: 5.408 ± 0.356
2.704IleThr: 2.704 ± 0.88
5.138IleVal: 5.138 ± 1.382
0.27IleTrp: 0.27 ± 0.292
1.893IleTyr: 1.893 ± 0.473
0.0IleXaa: 0.0 ± 0.0
Lys
5.949LysAla: 5.949 ± 0.55
1.082LysCys: 1.082 ± 0.352
1.622LysAsp: 1.622 ± 0.589
6.22LysGlu: 6.22 ± 2.259
2.704LysPhe: 2.704 ± 0.867
3.515LysGly: 3.515 ± 0.42
2.704LysHis: 2.704 ± 0.877
5.408LysIle: 5.408 ± 1.753
4.056LysLys: 4.056 ± 0.931
6.49LysLeu: 6.49 ± 2.648
0.811LysMet: 0.811 ± 0.297
2.704LysAsn: 2.704 ± 0.877
1.352LysPro: 1.352 ± 0.265
1.893LysGln: 1.893 ± 0.667
2.163LysArg: 2.163 ± 0.183
2.975LysSer: 2.975 ± 0.242
5.408LysThr: 5.408 ± 0.502
7.031LysVal: 7.031 ± 2.233
1.082LysTrp: 1.082 ± 0.52
2.975LysTyr: 2.975 ± 0.738
0.0LysXaa: 0.0 ± 0.0
Leu
5.949LeuAla: 5.949 ± 1.309
2.434LeuCys: 2.434 ± 1.866
6.22LeuAsp: 6.22 ± 0.709
7.031LeuGlu: 7.031 ± 1.284
4.327LeuPhe: 4.327 ± 1.832
4.867LeuGly: 4.867 ± 0.467
2.704LeuHis: 2.704 ± 0.619
6.76LeuIle: 6.76 ± 1.001
7.031LeuLys: 7.031 ± 2.263
10.276LeuLeu: 10.276 ± 1.952
2.163LeuMet: 2.163 ± 0.581
4.327LeuAsn: 4.327 ± 1.471
4.056LeuPro: 4.056 ± 1.431
3.515LeuGln: 3.515 ± 0.98
5.138LeuArg: 5.138 ± 2.967
7.301LeuSer: 7.301 ± 2.545
5.408LeuThr: 5.408 ± 2.302
6.22LeuVal: 6.22 ± 1.248
0.27LeuTrp: 0.27 ± 0.164
5.949LeuTyr: 5.949 ± 0.888
0.0LeuXaa: 0.0 ± 0.0
Met
0.811MetAla: 0.811 ± 0.84
0.811MetCys: 0.811 ± 0.453
1.893MetAsp: 1.893 ± 0.475
2.704MetGlu: 2.704 ± 0.867
1.082MetPhe: 1.082 ± 0.657
1.352MetGly: 1.352 ± 0.715
0.541MetHis: 0.541 ± 0.584
2.434MetIle: 2.434 ± 0.865
2.434MetLys: 2.434 ± 0.333
1.622MetLeu: 1.622 ± 0.201
0.27MetMet: 0.27 ± 0.445
1.352MetAsn: 1.352 ± 0.31
0.27MetPro: 0.27 ± 0.164
1.082MetGln: 1.082 ± 0.291
1.082MetArg: 1.082 ± 0.291
1.622MetSer: 1.622 ± 0.595
1.893MetThr: 1.893 ± 0.755
0.811MetVal: 0.811 ± 0.438
0.811MetTrp: 0.811 ± 0.175
0.811MetTyr: 0.811 ± 0.438
0.0MetXaa: 0.0 ± 0.0
Asn
1.352AsnAla: 1.352 ± 0.752
1.622AsnCys: 1.622 ± 0.35
1.622AsnAsp: 1.622 ± 0.35
1.352AsnGlu: 1.352 ± 0.265
1.082AsnPhe: 1.082 ± 0.657
1.352AsnGly: 1.352 ± 0.31
0.811AsnHis: 0.811 ± 0.453
3.245AsnIle: 3.245 ± 0.701
4.597AsnLys: 4.597 ± 0.541
4.056AsnLeu: 4.056 ± 0.525
0.541AsnMet: 0.541 ± 0.176
0.811AsnAsn: 0.811 ± 0.175
2.434AsnPro: 2.434 ± 0.109
1.352AsnGln: 1.352 ± 0.353
2.704AsnArg: 2.704 ± 0.929
0.811AsnSer: 0.811 ± 0.493
1.893AsnThr: 1.893 ± 0.859
2.434AsnVal: 2.434 ± 0.773
1.622AsnTrp: 1.622 ± 0.672
1.082AsnTyr: 1.082 ± 0.657
0.0AsnXaa: 0.0 ± 0.0
Pro
1.622ProAla: 1.622 ± 0.201
0.811ProCys: 0.811 ± 0.675
2.434ProAsp: 2.434 ± 0.51
2.975ProGlu: 2.975 ± 1.032
1.893ProPhe: 1.893 ± 0.859
3.245ProGly: 3.245 ± 0.984
1.082ProHis: 1.082 ± 0.742
2.163ProIle: 2.163 ± 0.62
2.434ProLys: 2.434 ± 0.547
2.434ProLeu: 2.434 ± 0.547
0.27ProMet: 0.27 ± 0.164
1.622ProAsn: 1.622 ± 0.528
1.082ProPro: 1.082 ± 0.791
0.541ProGln: 0.541 ± 0.176
1.082ProArg: 1.082 ± 0.271
4.056ProSer: 4.056 ± 0.931
2.704ProThr: 2.704 ± 1.096
2.434ProVal: 2.434 ± 0.51
0.541ProTrp: 0.541 ± 0.491
1.082ProTyr: 1.082 ± 0.352
0.0ProXaa: 0.0 ± 0.0
Gln
3.786GlnAla: 3.786 ± 1.78
1.352GlnCys: 1.352 ± 0.622
1.352GlnAsp: 1.352 ± 0.353
1.893GlnGlu: 1.893 ± 0.91
1.352GlnPhe: 1.352 ± 0.31
1.622GlnGly: 1.622 ± 0.589
1.352GlnHis: 1.352 ± 0.438
3.245GlnIle: 3.245 ± 1.177
1.622GlnLys: 1.622 ± 0.348
4.597GlnLeu: 4.597 ± 0.623
0.541GlnMet: 0.541 ± 0.329
0.811GlnAsn: 0.811 ± 0.361
0.811GlnPro: 0.811 ± 0.175
1.622GlnGln: 1.622 ± 0.906
1.893GlnArg: 1.893 ± 0.617
4.597GlnSer: 4.597 ± 1.466
2.975GlnThr: 2.975 ± 1.564
2.163GlnVal: 2.163 ± 0.581
0.811GlnTrp: 0.811 ± 0.175
1.622GlnTyr: 1.622 ± 0.595
0.0GlnXaa: 0.0 ± 0.0
Arg
1.082ArgAla: 1.082 ± 0.52
1.082ArgCys: 1.082 ± 0.513
3.786ArgAsp: 3.786 ± 1.82
1.352ArgGlu: 1.352 ± 0.634
1.893ArgPhe: 1.893 ± 0.91
3.245ArgGly: 3.245 ± 0.211
1.352ArgHis: 1.352 ± 0.833
2.434ArgIle: 2.434 ± 0.51
2.975ArgLys: 2.975 ± 1.046
3.786ArgLeu: 3.786 ± 1.163
0.811ArgMet: 0.811 ± 0.175
1.622ArgAsn: 1.622 ± 0.767
1.082ArgPro: 1.082 ± 0.291
2.704ArgGln: 2.704 ± 0.939
1.893ArgArg: 1.893 ± 0.475
3.245ArgSer: 3.245 ± 1.107
2.704ArgThr: 2.704 ± 0.529
2.704ArgVal: 2.704 ± 0.09
0.541ArgTrp: 0.541 ± 0.329
2.434ArgTyr: 2.434 ± 0.51
0.0ArgXaa: 0.0 ± 0.0
Ser
3.245SerAla: 3.245 ± 0.762
2.163SerCys: 2.163 ± 1.906
5.138SerAsp: 5.138 ± 0.609
4.056SerGlu: 4.056 ± 1.163
3.515SerPhe: 3.515 ± 0.26
5.138SerGly: 5.138 ± 1.348
0.811SerHis: 0.811 ± 0.175
4.597SerIle: 4.597 ± 0.541
4.867SerLys: 4.867 ± 1.799
10.276SerLeu: 10.276 ± 1.082
1.622SerMet: 1.622 ± 0.722
2.163SerAsn: 2.163 ± 0.263
3.245SerPro: 3.245 ± 0.817
2.434SerGln: 2.434 ± 0.725
3.515SerArg: 3.515 ± 0.816
5.949SerSer: 5.949 ± 1.416
4.597SerThr: 4.597 ± 0.512
4.597SerVal: 4.597 ± 1.179
0.811SerTrp: 0.811 ± 0.453
3.245SerTyr: 3.245 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
4.597ThrAla: 4.597 ± 1.059
1.893ThrCys: 1.893 ± 1.194
1.893ThrAsp: 1.893 ± 0.755
4.597ThrGlu: 4.597 ± 0.837
3.786ThrPhe: 3.786 ± 0.434
4.327ThrGly: 4.327 ± 0.527
0.27ThrHis: 0.27 ± 0.164
4.867ThrIle: 4.867 ± 1.347
4.597ThrLys: 4.597 ± 0.368
4.327ThrLeu: 4.327 ± 0.367
1.352ThrMet: 1.352 ± 0.31
2.163ThrAsn: 2.163 ± 0.263
1.352ThrPro: 1.352 ± 0.265
1.622ThrGln: 1.622 ± 0.877
1.352ThrArg: 1.352 ± 0.265
4.867ThrSer: 4.867 ± 1.089
4.597ThrThr: 4.597 ± 1.929
5.949ThrVal: 5.949 ± 0.327
0.811ThrTrp: 0.811 ± 0.453
1.082ThrTyr: 1.082 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
4.056ValAla: 4.056 ± 0.271
1.893ValCys: 1.893 ± 1.614
3.515ValAsp: 3.515 ± 0.816
3.515ValGlu: 3.515 ± 0.98
2.434ValPhe: 2.434 ± 0.109
3.245ValGly: 3.245 ± 1.606
1.352ValHis: 1.352 ± 1.032
4.056ValIle: 4.056 ± 0.894
3.786ValLys: 3.786 ± 0.527
5.679ValLeu: 5.679 ± 1.15
1.082ValMet: 1.082 ± 0.82
2.163ValAsn: 2.163 ± 1.074
3.245ValPro: 3.245 ± 1.177
3.515ValGln: 3.515 ± 0.42
2.975ValArg: 2.975 ± 0.654
7.031ValSer: 7.031 ± 0.035
5.408ValThr: 5.408 ± 1.29
4.056ValVal: 4.056 ± 1.092
0.811ValTrp: 0.811 ± 0.175
2.704ValTyr: 2.704 ± 0.867
0.0ValXaa: 0.0 ± 0.0
Trp
1.352TrpAla: 1.352 ± 0.31
0.27TrpCys: 0.27 ± 0.292
0.541TrpAsp: 0.541 ± 0.329
0.27TrpGlu: 0.27 ± 0.445
0.811TrpPhe: 0.811 ± 0.493
1.622TrpGly: 1.622 ± 0.589
0.541TrpHis: 0.541 ± 0.329
0.27TrpIle: 0.27 ± 0.292
0.811TrpLys: 0.811 ± 0.175
1.893TrpLeu: 1.893 ± 0.425
0.811TrpMet: 0.811 ± 0.175
0.0TrpAsn: 0.0 ± 0.0
0.27TrpPro: 0.27 ± 0.164
0.811TrpGln: 0.811 ± 0.875
0.541TrpArg: 0.541 ± 0.329
1.082TrpSer: 1.082 ± 0.291
0.811TrpThr: 0.811 ± 0.493
1.622TrpVal: 1.622 ± 0.348
0.27TrpTrp: 0.27 ± 0.292
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.352TyrAla: 1.352 ± 0.634
1.082TyrCys: 1.082 ± 0.352
2.163TyrAsp: 2.163 ± 1.059
1.893TyrGlu: 1.893 ± 0.473
1.082TyrPhe: 1.082 ± 0.271
3.245TyrGly: 3.245 ± 0.56
0.811TyrHis: 0.811 ± 0.493
2.704TyrIle: 2.704 ± 0.443
2.975TyrLys: 2.975 ± 1.142
3.245TyrLeu: 3.245 ± 1.249
1.082TyrMet: 1.082 ± 0.831
1.352TyrAsn: 1.352 ± 0.821
1.352TyrPro: 1.352 ± 0.438
2.434TyrGln: 2.434 ± 0.547
2.434TyrArg: 2.434 ± 0.773
4.056TyrSer: 4.056 ± 1.027
1.622TyrThr: 1.622 ± 0.35
2.434TyrVal: 2.434 ± 0.109
0.541TyrTrp: 0.541 ± 0.329
1.082TyrTyr: 1.082 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3699 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski