Amino acid dipepetide frequency for Beihai picorna-like virus 105

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.727AlaAla: 5.727 ± 0.044
2.673AlaCys: 2.673 ± 0.185
1.527AlaAsp: 1.527 ± 0.809
3.436AlaGlu: 3.436 ± 0.642
3.436AlaPhe: 3.436 ± 1.821
5.346AlaGly: 5.346 ± 1.478
0.764AlaHis: 0.764 ± 0.211
5.727AlaIle: 5.727 ± 0.572
5.346AlaLys: 5.346 ± 0.985
4.964AlaLeu: 4.964 ± 1.065
2.291AlaMet: 2.291 ± 0.598
3.818AlaAsn: 3.818 ± 1.056
3.055AlaPro: 3.055 ± 0.387
1.527AlaGln: 1.527 ± 0.194
6.873AlaArg: 6.873 ± 1.795
3.818AlaSer: 3.818 ± 0.792
4.2AlaThr: 4.2 ± 1.469
4.582AlaVal: 4.582 ± 0.035
0.764AlaTrp: 0.764 ± 0.405
5.346AlaTyr: 5.346 ± 0.246
0.0AlaXaa: 0.0 ± 0.0
Cys
1.145CysAla: 1.145 ± 0.607
0.0CysCys: 0.0 ± 0.0
1.145CysAsp: 1.145 ± 0.009
0.764CysGlu: 0.764 ± 0.405
1.527CysPhe: 1.527 ± 0.422
1.145CysGly: 1.145 ± 0.607
0.0CysHis: 0.0 ± 0.0
0.382CysIle: 0.382 ± 0.202
1.145CysLys: 1.145 ± 0.009
0.764CysLeu: 0.764 ± 0.405
1.145CysMet: 1.145 ± 0.607
1.527CysAsn: 1.527 ± 0.422
0.382CysPro: 0.382 ± 0.202
0.382CysGln: 0.382 ± 0.202
0.382CysArg: 0.382 ± 0.414
0.382CysSer: 0.382 ± 0.202
0.764CysThr: 0.764 ± 0.405
2.673CysVal: 2.673 ± 0.801
0.0CysTrp: 0.0 ± 0.0
0.382CysTyr: 0.382 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
4.582AspAla: 4.582 ± 1.196
0.382AspCys: 0.382 ± 0.202
3.055AspAsp: 3.055 ± 1.003
2.673AspGlu: 2.673 ± 1.416
3.436AspPhe: 3.436 ± 0.026
3.436AspGly: 3.436 ± 1.205
1.527AspHis: 1.527 ± 0.809
1.527AspIle: 1.527 ± 0.809
1.909AspLys: 1.909 ± 0.396
7.637AspLeu: 7.637 ± 0.352
1.909AspMet: 1.909 ± 0.323
0.764AspAsn: 0.764 ± 0.211
4.582AspPro: 4.582 ± 1.267
1.909AspGln: 1.909 ± 0.396
2.291AspArg: 2.291 ± 0.633
3.818AspSer: 3.818 ± 1.056
1.145AspThr: 1.145 ± 0.009
2.673AspVal: 2.673 ± 1.416
0.382AspTrp: 0.382 ± 0.414
2.291AspTyr: 2.291 ± 0.598
0.0AspXaa: 0.0 ± 0.0
Glu
3.436GluAla: 3.436 ± 1.205
1.527GluCys: 1.527 ± 0.809
1.527GluAsp: 1.527 ± 0.194
3.055GluGlu: 3.055 ± 0.229
3.055GluPhe: 3.055 ± 0.845
3.436GluGly: 3.436 ± 1.205
2.291GluHis: 2.291 ± 0.598
1.145GluIle: 1.145 ± 0.607
1.909GluLys: 1.909 ± 0.22
1.909GluLeu: 1.909 ± 0.22
2.291GluMet: 2.291 ± 0.598
3.436GluAsn: 3.436 ± 0.642
3.055GluPro: 3.055 ± 0.387
1.909GluGln: 1.909 ± 0.22
3.436GluArg: 3.436 ± 1.205
3.818GluSer: 3.818 ± 0.792
3.055GluThr: 3.055 ± 0.229
3.818GluVal: 3.818 ± 0.176
1.909GluTrp: 1.909 ± 0.396
2.673GluTyr: 2.673 ± 0.185
0.0GluXaa: 0.0 ± 0.0
Phe
2.673PheAla: 2.673 ± 0.185
0.0PheCys: 0.0 ± 0.0
3.818PheAsp: 3.818 ± 0.176
4.2PheGlu: 4.2 ± 0.378
2.291PhePhe: 2.291 ± 0.018
3.055PheGly: 3.055 ± 0.229
0.0PheHis: 0.0 ± 0.0
3.818PheIle: 3.818 ± 0.792
1.527PheLys: 1.527 ± 0.422
2.291PheLeu: 2.291 ± 0.598
1.527PheMet: 1.527 ± 0.422
3.436PheAsn: 3.436 ± 0.589
1.909PhePro: 1.909 ± 0.22
1.527PheGln: 1.527 ± 1.654
3.818PheArg: 3.818 ± 0.176
2.291PheSer: 2.291 ± 0.018
2.673PheThr: 2.673 ± 1.663
4.2PheVal: 4.2 ± 0.238
0.382PheTrp: 0.382 ± 0.414
2.291PheTyr: 2.291 ± 1.214
0.0PheXaa: 0.0 ± 0.0
Gly
6.491GlyAla: 6.491 ± 0.871
1.145GlyCys: 1.145 ± 0.607
4.2GlyAsp: 4.2 ± 0.994
3.818GlyGlu: 3.818 ± 0.792
3.055GlyPhe: 3.055 ± 0.845
4.2GlyGly: 4.2 ± 1.469
1.145GlyHis: 1.145 ± 0.009
2.291GlyIle: 2.291 ± 0.598
5.346GlyLys: 5.346 ± 1.601
4.2GlyLeu: 4.2 ± 1.61
2.291GlyMet: 2.291 ± 0.598
4.964GlyAsn: 4.964 ± 0.449
2.291GlyPro: 2.291 ± 0.633
1.909GlyGln: 1.909 ± 0.22
3.055GlyArg: 3.055 ± 0.845
3.818GlySer: 3.818 ± 1.672
4.2GlyThr: 4.2 ± 1.469
6.873GlyVal: 6.873 ± 1.179
0.764GlyTrp: 0.764 ± 0.827
3.436GlyTyr: 3.436 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
1.527HisAla: 1.527 ± 0.422
0.382HisCys: 0.382 ± 0.202
0.764HisAsp: 0.764 ± 0.211
0.764HisGlu: 0.764 ± 0.405
2.291HisPhe: 2.291 ± 0.598
1.145HisGly: 1.145 ± 0.607
0.0HisHis: 0.0 ± 0.0
1.527HisIle: 1.527 ± 0.194
0.382HisLys: 0.382 ± 0.202
2.291HisLeu: 2.291 ± 0.598
0.382HisMet: 0.382 ± 0.202
0.382HisAsn: 0.382 ± 0.414
1.145HisPro: 1.145 ± 0.009
1.145HisGln: 1.145 ± 0.607
1.145HisArg: 1.145 ± 0.607
1.145HisSer: 1.145 ± 0.009
0.764HisThr: 0.764 ± 0.211
1.145HisVal: 1.145 ± 0.009
0.382HisTrp: 0.382 ± 0.202
0.382HisTyr: 0.382 ± 0.414
0.0HisXaa: 0.0 ± 0.0
Ile
4.582IleAla: 4.582 ± 0.035
0.764IleCys: 0.764 ± 0.405
5.727IleAsp: 5.727 ± 0.572
2.673IleGlu: 2.673 ± 0.801
1.145IlePhe: 1.145 ± 0.607
3.818IleGly: 3.818 ± 0.176
1.527IleHis: 1.527 ± 0.809
1.145IleIle: 1.145 ± 0.607
3.436IleLys: 3.436 ± 1.821
1.145IleLeu: 1.145 ± 0.607
2.291IleMet: 2.291 ± 0.018
3.818IleAsn: 3.818 ± 0.44
3.055IlePro: 3.055 ± 0.229
0.764IleGln: 0.764 ± 0.405
2.291IleArg: 2.291 ± 1.214
4.2IleSer: 4.2 ± 2.085
3.818IleThr: 3.818 ± 1.408
3.436IleVal: 3.436 ± 0.026
0.382IleTrp: 0.382 ± 0.202
1.527IleTyr: 1.527 ± 1.038
0.0IleXaa: 0.0 ± 0.0
Lys
1.909LysAla: 1.909 ± 0.396
0.764LysCys: 0.764 ± 0.405
1.909LysAsp: 1.909 ± 0.396
2.673LysGlu: 2.673 ± 0.185
2.291LysPhe: 2.291 ± 0.018
4.964LysGly: 4.964 ± 0.783
1.145LysHis: 1.145 ± 0.607
2.291LysIle: 2.291 ± 0.018
1.145LysLys: 1.145 ± 0.607
4.2LysLeu: 4.2 ± 0.238
0.382LysMet: 0.382 ± 0.202
1.527LysAsn: 1.527 ± 0.194
1.909LysPro: 1.909 ± 0.396
0.764LysGln: 0.764 ± 0.405
4.582LysArg: 4.582 ± 1.196
2.291LysSer: 2.291 ± 0.018
4.964LysThr: 4.964 ± 0.167
2.673LysVal: 2.673 ± 1.047
0.382LysTrp: 0.382 ± 0.202
3.436LysTyr: 3.436 ± 1.205
0.0LysXaa: 0.0 ± 0.0
Leu
8.4LeuAla: 8.4 ± 2.323
0.764LeuCys: 0.764 ± 0.211
3.818LeuAsp: 3.818 ± 1.672
4.964LeuGlu: 4.964 ± 0.783
4.582LeuPhe: 4.582 ± 0.035
3.818LeuGly: 3.818 ± 0.792
1.527LeuHis: 1.527 ± 0.422
2.673LeuIle: 2.673 ± 0.801
3.818LeuLys: 3.818 ± 0.792
6.873LeuLeu: 6.873 ± 0.669
3.055LeuMet: 3.055 ± 0.387
4.582LeuAsn: 4.582 ± 0.035
1.909LeuPro: 1.909 ± 0.396
2.673LeuGln: 2.673 ± 0.185
4.2LeuArg: 4.2 ± 0.994
6.491LeuSer: 6.491 ± 0.361
4.964LeuThr: 4.964 ± 2.296
4.964LeuVal: 4.964 ± 1.399
1.145LeuTrp: 1.145 ± 0.009
4.582LeuTyr: 4.582 ± 0.581
0.0LeuXaa: 0.0 ± 0.0
Met
2.673MetAla: 2.673 ± 0.185
0.382MetCys: 0.382 ± 0.202
3.436MetAsp: 3.436 ± 1.258
0.764MetGlu: 0.764 ± 0.405
1.527MetPhe: 1.527 ± 0.194
4.2MetGly: 4.2 ± 0.853
1.145MetHis: 1.145 ± 0.009
2.291MetIle: 2.291 ± 1.249
0.382MetLys: 0.382 ± 0.202
3.055MetLeu: 3.055 ± 1.619
0.382MetMet: 0.382 ± 0.202
1.527MetAsn: 1.527 ± 0.809
1.527MetPro: 1.527 ± 0.194
1.527MetGln: 1.527 ± 0.194
1.909MetArg: 1.909 ± 1.012
0.0MetSer: 0.0 ± 0.0
1.909MetThr: 1.909 ± 0.836
2.673MetVal: 2.673 ± 1.047
2.291MetTrp: 2.291 ± 1.214
0.382MetTyr: 0.382 ± 0.414
0.0MetXaa: 0.0 ± 0.0
Asn
3.055AsnAla: 3.055 ± 0.845
0.764AsnCys: 0.764 ± 0.211
3.818AsnAsp: 3.818 ± 0.176
2.673AsnGlu: 2.673 ± 0.185
2.673AsnPhe: 2.673 ± 0.801
4.2AsnGly: 4.2 ± 1.469
1.145AsnHis: 1.145 ± 0.607
2.673AsnIle: 2.673 ± 0.431
1.909AsnLys: 1.909 ± 0.22
4.964AsnLeu: 4.964 ± 2.296
2.291AsnMet: 2.291 ± 0.633
0.764AsnAsn: 0.764 ± 0.405
3.055AsnPro: 3.055 ± 1.46
1.909AsnGln: 1.909 ± 0.396
1.527AsnArg: 1.527 ± 0.809
2.673AsnSer: 2.673 ± 1.416
2.291AsnThr: 2.291 ± 0.018
4.964AsnVal: 4.964 ± 0.449
1.909AsnTrp: 1.909 ± 0.22
1.145AsnTyr: 1.145 ± 0.625
0.0AsnXaa: 0.0 ± 0.0
Pro
3.818ProAla: 3.818 ± 1.408
0.764ProCys: 0.764 ± 0.211
3.055ProAsp: 3.055 ± 0.387
0.764ProGlu: 0.764 ± 0.405
2.291ProPhe: 2.291 ± 0.018
3.818ProGly: 3.818 ± 0.176
1.145ProHis: 1.145 ± 0.625
3.055ProIle: 3.055 ± 0.229
3.436ProLys: 3.436 ± 0.589
2.673ProLeu: 2.673 ± 0.185
2.291ProMet: 2.291 ± 1.249
2.291ProAsn: 2.291 ± 2.481
2.673ProPro: 2.673 ± 2.279
1.527ProGln: 1.527 ± 0.194
2.673ProArg: 2.673 ± 0.431
3.436ProSer: 3.436 ± 1.258
3.818ProThr: 3.818 ± 0.44
2.673ProVal: 2.673 ± 1.663
0.382ProTrp: 0.382 ± 0.202
3.055ProTyr: 3.055 ± 1.46
0.0ProXaa: 0.0 ± 0.0
Gln
4.2GlnAla: 4.2 ± 0.238
0.382GlnCys: 0.382 ± 0.202
0.764GlnAsp: 0.764 ± 0.405
1.527GlnGlu: 1.527 ± 0.809
1.909GlnPhe: 1.909 ± 0.22
1.909GlnGly: 1.909 ± 0.396
0.764GlnHis: 0.764 ± 0.211
2.673GlnIle: 2.673 ± 0.431
0.764GlnLys: 0.764 ± 0.827
1.527GlnLeu: 1.527 ± 0.194
2.291GlnMet: 2.291 ± 0.633
1.145GlnAsn: 1.145 ± 0.009
0.382GlnPro: 0.382 ± 0.202
1.909GlnGln: 1.909 ± 0.396
1.145GlnArg: 1.145 ± 0.009
3.055GlnSer: 3.055 ± 2.076
1.909GlnThr: 1.909 ± 0.22
1.145GlnVal: 1.145 ± 0.009
0.764GlnTrp: 0.764 ± 0.405
2.291GlnTyr: 2.291 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
1.527ArgAla: 1.527 ± 0.809
0.0ArgCys: 0.0 ± 0.0
2.291ArgAsp: 2.291 ± 1.214
3.055ArgGlu: 3.055 ± 1.619
1.145ArgPhe: 1.145 ± 0.009
3.055ArgGly: 3.055 ± 0.387
0.382ArgHis: 0.382 ± 0.414
2.673ArgIle: 2.673 ± 0.801
2.673ArgLys: 2.673 ± 0.185
6.873ArgLeu: 6.873 ± 0.563
1.527ArgMet: 1.527 ± 0.809
3.055ArgAsn: 3.055 ± 1.003
3.436ArgPro: 3.436 ± 0.642
0.764ArgGln: 0.764 ± 0.405
2.291ArgArg: 2.291 ± 0.633
3.055ArgSer: 3.055 ± 1.619
4.2ArgThr: 4.2 ± 0.853
7.637ArgVal: 7.637 ± 0.264
0.0ArgTrp: 0.0 ± 0.0
1.909ArgTyr: 1.909 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
4.2SerAla: 4.2 ± 0.238
0.764SerCys: 0.764 ± 0.405
2.291SerAsp: 2.291 ± 0.018
1.527SerGlu: 1.527 ± 0.809
3.436SerPhe: 3.436 ± 0.026
4.964SerGly: 4.964 ± 0.449
0.764SerHis: 0.764 ± 0.405
3.436SerIle: 3.436 ± 0.026
3.055SerLys: 3.055 ± 0.229
8.018SerLeu: 8.018 ± 2.525
1.145SerMet: 1.145 ± 0.625
4.582SerAsn: 4.582 ± 0.651
3.436SerPro: 3.436 ± 0.026
2.291SerGln: 2.291 ± 1.249
1.909SerArg: 1.909 ± 0.396
6.873SerSer: 6.873 ± 2.516
8.018SerThr: 8.018 ± 4.373
3.436SerVal: 3.436 ± 1.205
1.145SerTrp: 1.145 ± 0.607
2.291SerTyr: 2.291 ± 0.018
0.0SerXaa: 0.0 ± 0.0
Thr
4.2ThrAla: 4.2 ± 2.701
0.764ThrCys: 0.764 ± 0.211
1.145ThrAsp: 1.145 ± 0.607
3.818ThrGlu: 3.818 ± 1.056
3.055ThrPhe: 3.055 ± 1.46
3.055ThrGly: 3.055 ± 2.076
1.527ThrHis: 1.527 ± 0.194
5.346ThrIle: 5.346 ± 0.985
2.673ThrLys: 2.673 ± 0.801
6.873ThrLeu: 6.873 ± 0.669
1.527ThrMet: 1.527 ± 0.422
2.673ThrAsn: 2.673 ± 0.431
4.964ThrPro: 4.964 ± 1.065
3.436ThrGln: 3.436 ± 1.874
2.291ThrArg: 2.291 ± 0.598
4.582ThrSer: 4.582 ± 1.883
5.346ThrThr: 5.346 ± 0.862
5.346ThrVal: 5.346 ± 2.094
1.909ThrTrp: 1.909 ± 1.452
1.527ThrTyr: 1.527 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
7.255ValAla: 7.255 ± 2.613
2.673ValCys: 2.673 ± 0.801
3.818ValAsp: 3.818 ± 2.023
6.109ValGlu: 6.109 ± 2.305
2.673ValPhe: 2.673 ± 0.431
5.727ValGly: 5.727 ± 0.572
2.291ValHis: 2.291 ± 0.598
2.291ValIle: 2.291 ± 0.018
1.909ValLys: 1.909 ± 0.396
4.964ValLeu: 4.964 ± 0.783
2.673ValMet: 2.673 ± 0.431
1.909ValAsn: 1.909 ± 0.836
4.582ValPro: 4.582 ± 2.499
2.291ValGln: 2.291 ± 0.598
3.055ValArg: 3.055 ± 1.003
6.873ValSer: 6.873 ± 0.563
4.964ValThr: 4.964 ± 2.296
8.018ValVal: 8.018 ± 3.017
1.527ValTrp: 1.527 ± 1.038
1.909ValTyr: 1.909 ± 0.22
0.0ValXaa: 0.0 ± 0.0
Trp
1.145TrpAla: 1.145 ± 0.607
0.382TrpCys: 0.382 ± 0.202
0.764TrpAsp: 0.764 ± 0.405
0.382TrpGlu: 0.382 ± 0.202
0.764TrpPhe: 0.764 ± 0.211
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.527TrpIle: 1.527 ± 0.809
1.527TrpLys: 1.527 ± 0.422
1.145TrpLeu: 1.145 ± 0.625
0.382TrpMet: 0.382 ± 0.132
1.909TrpAsn: 1.909 ± 0.22
0.764TrpPro: 0.764 ± 0.211
0.382TrpGln: 0.382 ± 0.202
1.909TrpArg: 1.909 ± 0.836
0.764TrpSer: 0.764 ± 0.827
0.382TrpThr: 0.382 ± 0.202
1.145TrpVal: 1.145 ± 0.009
0.382TrpTrp: 0.382 ± 0.202
1.145TrpTyr: 1.145 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.673TyrAla: 2.673 ± 0.801
0.764TyrCys: 0.764 ± 0.211
3.055TyrAsp: 3.055 ± 0.387
3.055TyrGlu: 3.055 ± 0.387
1.145TyrPhe: 1.145 ± 0.625
4.2TyrGly: 4.2 ± 0.238
0.0TyrHis: 0.0 ± 0.0
3.055TyrIle: 3.055 ± 0.387
1.909TyrLys: 1.909 ± 0.22
3.436TyrLeu: 3.436 ± 0.589
1.527TyrMet: 1.527 ± 0.809
2.291TyrAsn: 2.291 ± 1.214
1.527TyrPro: 1.527 ± 0.422
1.909TyrGln: 1.909 ± 1.452
0.764TyrArg: 0.764 ± 0.211
4.2TyrSer: 4.2 ± 1.469
2.673TyrThr: 2.673 ± 0.431
3.436TyrVal: 3.436 ± 0.589
0.382TyrTrp: 0.382 ± 0.414
2.291TyrTyr: 2.291 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski