Amino acid dipepetide frequency for Beihai picorna-like virus 120

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.667AlaAla: 6.667 ± 3.666
1.333AlaCys: 1.333 ± 0.807
4.333AlaAsp: 4.333 ± 1.776
4.667AlaGlu: 4.667 ± 2.747
0.667AlaPhe: 0.667 ± 2.052
6.333AlaGly: 6.333 ± 3.803
1.0AlaHis: 1.0 ± 0.536
2.667AlaIle: 2.667 ± 1.218
3.333AlaLys: 3.333 ± 1.141
5.667AlaLeu: 5.667 ± 0.887
3.333AlaMet: 3.333 ± 0.953
2.667AlaAsn: 2.667 ± 1.431
2.667AlaPro: 2.667 ± 1.218
2.333AlaGln: 2.333 ± 1.272
5.0AlaArg: 5.0 ± 2.192
6.0AlaSer: 6.0 ± 3.43
1.333AlaThr: 1.333 ± 0.923
4.667AlaVal: 4.667 ± 0.697
1.667AlaTrp: 1.667 ± 0.835
2.333AlaTyr: 2.333 ± 0.759
0.0AlaXaa: 0.0 ± 0.0
Cys
2.333CysAla: 2.333 ± 1.01
0.333CysCys: 0.333 ± 0.179
1.0CysAsp: 1.0 ± 0.536
1.333CysGlu: 1.333 ± 0.714
0.667CysPhe: 0.667 ± 0.851
1.667CysGly: 1.667 ± 0.467
0.333CysHis: 0.333 ± 0.179
0.667CysIle: 0.667 ± 0.357
1.0CysLys: 1.0 ± 0.536
1.667CysLeu: 1.667 ± 0.893
1.0CysMet: 1.0 ± 0.536
1.0CysAsn: 1.0 ± 0.536
0.333CysPro: 0.333 ± 0.179
1.0CysGln: 1.0 ± 0.536
0.0CysArg: 0.0 ± 0.0
0.667CysSer: 0.667 ± 0.851
1.333CysThr: 1.333 ± 0.714
1.333CysVal: 1.333 ± 0.714
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.667AspAla: 4.667 ± 0.697
1.0AspCys: 1.0 ± 0.536
4.333AspAsp: 4.333 ± 1.916
4.667AspGlu: 4.667 ± 1.47
5.333AspPhe: 5.333 ± 1.164
2.333AspGly: 2.333 ± 0.759
1.333AspHis: 1.333 ± 0.714
4.667AspIle: 4.667 ± 1.306
2.333AspLys: 2.333 ± 1.25
4.667AspLeu: 4.667 ± 1.29
1.667AspMet: 1.667 ± 0.893
1.667AspAsn: 1.667 ± 0.835
4.667AspPro: 4.667 ± 0.56
2.333AspGln: 2.333 ± 1.954
2.333AspArg: 2.333 ± 0.759
1.0AspSer: 1.0 ± 0.81
2.667AspThr: 2.667 ± 1.428
5.667AspVal: 5.667 ± 1.773
0.667AspTrp: 0.667 ± 0.851
3.667AspTyr: 3.667 ± 0.534
0.0AspXaa: 0.0 ± 0.0
Glu
4.0GluAla: 4.0 ± 0.811
1.333GluCys: 1.333 ± 0.807
3.333GluAsp: 3.333 ± 1.286
4.333GluGlu: 4.333 ± 2.321
5.0GluPhe: 5.0 ± 0.88
3.0GluGly: 3.0 ± 1.489
1.333GluHis: 1.333 ± 0.714
3.667GluIle: 3.667 ± 1.143
4.667GluLys: 4.667 ± 1.917
5.333GluLeu: 5.333 ± 1.786
1.667GluMet: 1.667 ± 0.835
2.0GluAsn: 2.0 ± 0.59
3.667GluPro: 3.667 ± 1.396
2.0GluGln: 2.0 ± 0.531
2.667GluArg: 2.667 ± 0.893
4.333GluSer: 4.333 ± 1.228
2.667GluThr: 2.667 ± 0.525
3.667GluVal: 3.667 ± 1.049
1.667GluTrp: 1.667 ± 0.467
2.333GluTyr: 2.333 ± 1.25
0.0GluXaa: 0.0 ± 0.0
Phe
2.333PheAla: 2.333 ± 0.954
1.667PheCys: 1.667 ± 0.617
2.333PheAsp: 2.333 ± 0.735
4.0PheGlu: 4.0 ± 1.569
3.667PhePhe: 3.667 ± 1.049
2.333PheGly: 2.333 ± 0.597
2.0PheHis: 2.0 ± 1.749
1.0PheIle: 1.0 ± 0.877
4.0PheLys: 4.0 ± 0.89
4.0PheLeu: 4.0 ± 1.179
1.333PheMet: 1.333 ± 0.391
2.333PheAsn: 2.333 ± 0.759
2.0PhePro: 2.0 ± 0.59
2.667PheGln: 2.667 ± 1.025
3.0PheArg: 3.0 ± 0.843
2.667PheSer: 2.667 ± 1.208
2.0PheThr: 2.0 ± 2.439
3.667PheVal: 3.667 ± 2.346
0.667PheTrp: 0.667 ± 0.357
0.333PheTyr: 0.333 ± 0.582
0.0PheXaa: 0.0 ± 0.0
Gly
3.667GlyAla: 3.667 ± 3.976
0.333GlyCys: 0.333 ± 0.179
2.667GlyAsp: 2.667 ± 0.811
3.667GlyGlu: 3.667 ± 1.416
3.667GlyPhe: 3.667 ± 2.212
4.333GlyGly: 4.333 ± 3.587
1.333GlyHis: 1.333 ± 0.807
4.667GlyIle: 4.667 ± 0.697
5.0GlyLys: 5.0 ± 1.155
5.667GlyLeu: 5.667 ± 4.34
1.333GlyMet: 1.333 ± 0.737
5.333GlyAsn: 5.333 ± 1.05
2.333GlyPro: 2.333 ± 1.01
4.667GlyGln: 4.667 ± 1.908
2.667GlyArg: 2.667 ± 0.782
3.667GlySer: 3.667 ± 2.795
2.0GlyThr: 2.0 ± 1.107
3.667GlyVal: 3.667 ± 1.671
1.0GlyTrp: 1.0 ± 0.536
1.667GlyTyr: 1.667 ± 1.053
0.0GlyXaa: 0.0 ± 0.0
His
2.333HisAla: 2.333 ± 0.735
0.667HisCys: 0.667 ± 0.357
3.333HisAsp: 3.333 ± 0.381
1.0HisGlu: 1.0 ± 0.536
0.667HisPhe: 0.667 ± 0.357
1.0HisGly: 1.0 ± 0.389
0.667HisHis: 0.667 ± 0.357
1.667HisIle: 1.667 ± 0.893
1.333HisLys: 1.333 ± 0.714
1.667HisLeu: 1.667 ± 0.843
1.333HisMet: 1.333 ± 0.714
0.667HisAsn: 0.667 ± 0.357
1.0HisPro: 1.0 ± 0.389
0.333HisGln: 0.333 ± 0.179
0.667HisArg: 0.667 ± 0.357
1.333HisSer: 1.333 ± 0.807
1.0HisThr: 1.0 ± 0.536
0.667HisVal: 0.667 ± 0.928
0.667HisTrp: 0.667 ± 0.462
1.0HisTyr: 1.0 ± 0.536
0.0HisXaa: 0.0 ± 0.0
Ile
5.333IleAla: 5.333 ± 0.969
1.667IleCys: 1.667 ± 0.893
2.667IleAsp: 2.667 ± 0.525
3.667IleGlu: 3.667 ± 1.143
2.0IlePhe: 2.0 ± 0.778
4.333IleGly: 4.333 ± 1.258
0.333IleHis: 0.333 ± 0.179
1.333IleIle: 1.333 ± 0.391
3.0IleLys: 3.0 ± 1.057
4.0IleLeu: 4.0 ± 0.454
2.0IleMet: 2.0 ± 0.671
3.333IleAsn: 3.333 ± 1.082
4.667IlePro: 4.667 ± 2.584
0.667IleGln: 0.667 ± 0.357
2.333IleArg: 2.333 ± 1.292
2.0IleSer: 2.0 ± 0.671
2.667IleThr: 2.667 ± 2.621
6.0IleVal: 6.0 ± 1.769
0.333IleTrp: 0.333 ± 0.179
1.667IleTyr: 1.667 ± 0.893
0.0IleXaa: 0.0 ± 0.0
Lys
2.667LysAla: 2.667 ± 0.571
1.333LysCys: 1.333 ± 0.714
8.0LysAsp: 8.0 ± 3.138
2.333LysGlu: 2.333 ± 1.25
2.667LysPhe: 2.667 ± 0.525
5.667LysGly: 5.667 ± 1.635
1.667LysHis: 1.667 ± 0.893
5.0LysIle: 5.0 ± 1.531
4.0LysLys: 4.0 ± 2.142
3.333LysLeu: 3.333 ± 1.397
1.0LysMet: 1.0 ± 0.389
3.0LysAsn: 3.0 ± 1.166
3.667LysPro: 3.667 ± 0.379
3.667LysGln: 3.667 ± 1.283
4.0LysArg: 4.0 ± 1.569
3.0LysSer: 3.0 ± 2.351
5.0LysThr: 5.0 ± 2.093
3.667LysVal: 3.667 ± 1.396
0.0LysTrp: 0.0 ± 0.0
2.0LysTyr: 2.0 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
5.667LeuAla: 5.667 ± 2.329
1.667LeuCys: 1.667 ± 0.893
4.333LeuAsp: 4.333 ± 1.526
5.333LeuGlu: 5.333 ± 1.045
2.0LeuPhe: 2.0 ± 0.531
3.333LeuGly: 3.333 ± 0.724
1.333LeuHis: 1.333 ± 0.714
2.0LeuIle: 2.0 ± 1.392
7.667LeuLys: 7.667 ± 1.249
6.0LeuLeu: 6.0 ± 0.792
2.0LeuMet: 2.0 ± 0.856
4.667LeuAsn: 4.667 ± 1.441
3.667LeuPro: 3.667 ± 2.059
6.0LeuGln: 6.0 ± 1.961
3.333LeuArg: 3.333 ± 0.676
5.333LeuSer: 5.333 ± 1.045
6.333LeuThr: 6.333 ± 0.924
6.667LeuVal: 6.667 ± 0.992
1.0LeuTrp: 1.0 ± 1.056
2.0LeuTyr: 2.0 ± 0.531
0.0LeuXaa: 0.0 ± 0.0
Met
3.333MetAla: 3.333 ± 0.676
0.0MetCys: 0.0 ± 0.0
0.333MetAsp: 0.333 ± 0.179
2.0MetGlu: 2.0 ± 1.071
2.667MetPhe: 2.667 ± 0.811
2.667MetGly: 2.667 ± 1.474
1.333MetHis: 1.333 ± 0.391
1.333MetIle: 1.333 ± 0.391
2.0MetLys: 2.0 ± 2.352
4.667MetLeu: 4.667 ± 0.56
0.667MetMet: 0.667 ± 0.403
1.333MetAsn: 1.333 ± 0.818
0.333MetPro: 0.333 ± 0.179
1.667MetGln: 1.667 ± 0.467
1.333MetArg: 1.333 ± 0.714
2.333MetSer: 2.333 ± 0.497
1.667MetThr: 1.667 ± 0.835
0.667MetVal: 0.667 ± 0.357
0.333MetTrp: 0.333 ± 0.179
0.667MetTyr: 0.667 ± 0.462
0.0MetXaa: 0.0 ± 0.0
Asn
1.667AsnAla: 1.667 ± 2.196
0.333AsnCys: 0.333 ± 0.179
1.333AsnAsp: 1.333 ± 0.737
2.0AsnGlu: 2.0 ± 0.531
1.667AsnPhe: 1.667 ± 1.267
2.333AsnGly: 2.333 ± 1.588
0.333AsnHis: 0.333 ± 0.179
3.0AsnIle: 3.0 ± 1.143
2.0AsnLys: 2.0 ± 0.778
5.0AsnLeu: 5.0 ± 3.134
3.0AsnMet: 3.0 ± 1.6
1.0AsnAsn: 1.0 ± 0.389
4.0AsnPro: 4.0 ± 1.179
3.0AsnGln: 3.0 ± 0.6
1.0AsnArg: 1.0 ± 0.389
4.333AsnSer: 4.333 ± 2.08
3.0AsnThr: 3.0 ± 2.127
3.0AsnVal: 3.0 ± 1.166
0.667AsnTrp: 0.667 ± 0.357
1.333AsnTyr: 1.333 ± 0.391
0.0AsnXaa: 0.0 ± 0.0
Pro
2.333ProAla: 2.333 ± 2.65
0.333ProCys: 0.333 ± 0.179
2.667ProAsp: 2.667 ± 0.893
4.0ProGlu: 4.0 ± 0.664
2.667ProPhe: 2.667 ± 0.893
3.333ProGly: 3.333 ± 1.928
0.667ProHis: 0.667 ± 0.357
3.667ProIle: 3.667 ± 0.94
3.0ProLys: 3.0 ± 0.607
5.0ProLeu: 5.0 ± 1.626
0.667ProMet: 0.667 ± 0.851
2.0ProAsn: 2.0 ± 2.071
3.333ProPro: 3.333 ± 0.724
2.667ProGln: 2.667 ± 1.428
1.0ProArg: 1.0 ± 0.536
2.667ProSer: 2.667 ± 0.582
2.667ProThr: 2.667 ± 1.218
3.0ProVal: 3.0 ± 1.057
0.333ProTrp: 0.333 ± 0.179
1.667ProTyr: 1.667 ± 0.78
0.0ProXaa: 0.0 ± 0.0
Gln
2.0GlnAla: 2.0 ± 0.531
0.667GlnCys: 0.667 ± 0.357
3.0GlnAsp: 3.0 ± 1.053
2.333GlnGlu: 2.333 ± 1.25
2.333GlnPhe: 2.333 ± 1.292
2.667GlnGly: 2.667 ± 1.354
0.667GlnHis: 0.667 ± 0.462
4.333GlnIle: 4.333 ± 0.596
3.0GlnLys: 3.0 ± 0.607
1.333GlnLeu: 1.333 ± 0.818
3.0GlnMet: 3.0 ± 0.458
4.0GlnAsn: 4.0 ± 1.356
2.0GlnPro: 2.0 ± 0.531
1.333GlnGln: 1.333 ± 0.714
2.667GlnArg: 2.667 ± 2.481
2.0GlnSer: 2.0 ± 1.107
3.0GlnThr: 3.0 ± 1.143
3.0GlnVal: 3.0 ± 0.843
0.333GlnTrp: 0.333 ± 0.179
1.667GlnTyr: 1.667 ± 0.893
0.0GlnXaa: 0.0 ± 0.0
Arg
3.0ArgAla: 3.0 ± 1.057
1.667ArgCys: 1.667 ± 0.893
3.667ArgAsp: 3.667 ± 1.049
2.0ArgGlu: 2.0 ± 0.59
1.333ArgPhe: 1.333 ± 0.714
1.667ArgGly: 1.667 ± 0.467
1.667ArgHis: 1.667 ± 0.843
3.667ArgIle: 3.667 ± 0.534
5.333ArgLys: 5.333 ± 2.359
3.667ArgLeu: 3.667 ± 1.049
0.333ArgMet: 0.333 ± 0.179
2.0ArgAsn: 2.0 ± 1.721
1.0ArgPro: 1.0 ± 0.536
2.0ArgGln: 2.0 ± 0.778
4.0ArgArg: 4.0 ± 2.142
2.0ArgSer: 2.0 ± 0.778
3.667ArgThr: 3.667 ± 1.396
2.667ArgVal: 2.667 ± 1.431
0.667ArgTrp: 0.667 ± 0.357
1.333ArgTyr: 1.333 ± 1.703
0.0ArgXaa: 0.0 ± 0.0
Ser
3.333SerAla: 3.333 ± 2.555
0.333SerCys: 0.333 ± 0.179
2.0SerAsp: 2.0 ± 0.778
4.333SerGlu: 4.333 ± 0.587
3.333SerPhe: 3.333 ± 1.974
6.0SerGly: 6.0 ± 1.196
2.0SerHis: 2.0 ± 1.071
2.333SerIle: 2.333 ± 0.729
2.667SerLys: 2.667 ± 1.208
5.333SerLeu: 5.333 ± 3.443
2.0SerMet: 2.0 ± 0.531
2.667SerAsn: 2.667 ± 1.892
1.333SerPro: 1.333 ± 0.391
2.667SerGln: 2.667 ± 1.892
2.333SerArg: 2.333 ± 1.607
7.333SerSer: 7.333 ± 3.865
3.333SerThr: 3.333 ± 1.234
5.0SerVal: 5.0 ± 2.085
1.333SerTrp: 1.333 ± 0.923
2.0SerTyr: 2.0 ± 1.071
0.0SerXaa: 0.0 ± 0.0
Thr
4.667ThrAla: 4.667 ± 1.458
0.667ThrCys: 0.667 ± 0.357
4.333ThrAsp: 4.333 ± 2.673
3.333ThrGlu: 3.333 ± 0.724
2.0ThrPhe: 2.0 ± 0.778
3.667ThrGly: 3.667 ± 1.444
1.333ThrHis: 1.333 ± 0.714
4.0ThrIle: 4.0 ± 1.555
3.333ThrLys: 3.333 ± 1.785
3.333ThrLeu: 3.333 ± 2.039
1.0ThrMet: 1.0 ± 0.536
1.667ThrAsn: 1.667 ± 1.494
2.333ThrPro: 2.333 ± 1.272
1.333ThrGln: 1.333 ± 0.714
3.333ThrArg: 3.333 ± 1.141
5.0ThrSer: 5.0 ± 0.88
4.667ThrThr: 4.667 ± 0.56
4.333ThrVal: 4.333 ± 1.173
1.0ThrTrp: 1.0 ± 0.81
3.0ThrTyr: 3.0 ± 0.843
0.0ThrXaa: 0.0 ± 0.0
Val
5.667ValAla: 5.667 ± 2.278
2.0ValCys: 2.0 ± 1.071
5.0ValAsp: 5.0 ± 0.588
3.667ValGlu: 3.667 ± 1.603
4.0ValPhe: 4.0 ± 1.179
4.333ValGly: 4.333 ± 1.258
2.333ValHis: 2.333 ± 0.497
2.0ValIle: 2.0 ± 0.59
4.333ValLys: 4.333 ± 0.587
6.333ValLeu: 6.333 ± 1.096
2.0ValMet: 2.0 ± 0.59
1.333ValAsn: 1.333 ± 0.391
3.333ValPro: 3.333 ± 0.381
2.667ValGln: 2.667 ± 1.127
3.333ValArg: 3.333 ± 1.225
3.0ValSer: 3.0 ± 2.43
6.333ValThr: 6.333 ± 1.92
6.0ValVal: 6.0 ± 0.792
1.333ValTrp: 1.333 ± 0.714
2.667ValTyr: 2.667 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
1.0TrpAla: 1.0 ± 0.536
0.0TrpCys: 0.0 ± 0.0
0.667TrpAsp: 0.667 ± 0.357
1.667TrpGlu: 1.667 ± 0.893
0.333TrpPhe: 0.333 ± 0.582
0.0TrpGly: 0.0 ± 0.0
0.333TrpHis: 0.333 ± 0.179
1.0TrpIle: 1.0 ± 0.389
2.0TrpLys: 2.0 ± 0.888
1.0TrpLeu: 1.0 ± 0.536
1.333TrpMet: 1.333 ± 0.717
1.0TrpAsn: 1.0 ± 0.536
0.333TrpPro: 0.333 ± 0.179
0.0TrpGln: 0.0 ± 0.0
0.667TrpArg: 0.667 ± 0.851
1.333TrpSer: 1.333 ± 1.222
0.333TrpThr: 0.333 ± 0.179
1.0TrpVal: 1.0 ± 0.389
0.0TrpTrp: 0.0 ± 0.0
0.333TrpTyr: 0.333 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.0TyrAla: 2.0 ± 1.62
0.333TyrCys: 0.333 ± 0.179
2.333TyrAsp: 2.333 ± 0.497
2.333TyrGlu: 2.333 ± 0.735
1.333TyrPhe: 1.333 ± 0.391
2.333TyrGly: 2.333 ± 0.735
1.0TyrHis: 1.0 ± 0.536
1.333TyrIle: 1.333 ± 0.923
1.667TyrLys: 1.667 ± 0.893
2.667TyrLeu: 2.667 ± 0.811
0.333TyrMet: 0.333 ± 0.179
0.667TyrAsn: 0.667 ± 0.462
1.0TyrPro: 1.0 ± 0.389
2.333TyrGln: 2.333 ± 0.497
1.667TyrArg: 1.667 ± 0.893
1.667TyrSer: 1.667 ± 0.467
2.667TyrThr: 2.667 ± 1.354
3.333TyrVal: 3.333 ± 1.141
0.667TyrTrp: 0.667 ± 0.851
1.667TyrTyr: 1.667 ± 1.652
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3001 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski