Amino acid dipepetide frequency for Beihai picorna-like virus 75

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.062AlaAla: 8.062 ± 0.646
0.645AlaCys: 0.645 ± 0.365
3.225AlaAsp: 3.225 ± 0.146
1.612AlaGlu: 1.612 ± 0.912
4.515AlaPhe: 4.515 ± 0.242
5.16AlaGly: 5.16 ± 1.555
2.257AlaHis: 2.257 ± 0.401
3.225AlaIle: 3.225 ± 0.413
3.225AlaLys: 3.225 ± 1.265
5.16AlaLeu: 5.16 ± 1.241
1.29AlaMet: 1.29 ± 0.73
3.87AlaAsn: 3.87 ± 0.048
4.192AlaPro: 4.192 ± 0.984
2.257AlaGln: 2.257 ± 0.401
5.482AlaArg: 5.482 ± 1.423
3.225AlaSer: 3.225 ± 0.413
2.902AlaThr: 2.902 ± 2.273
4.192AlaVal: 4.192 ± 1.812
0.322AlaTrp: 0.322 ± 0.182
2.257AlaTyr: 2.257 ± 1.277
0.0AlaXaa: 0.0 ± 0.0
Cys
2.257CysAla: 2.257 ± 1.277
0.0CysCys: 0.0 ± 0.0
1.612CysAsp: 1.612 ± 0.353
0.645CysGlu: 0.645 ± 0.365
0.0CysPhe: 0.0 ± 0.0
1.29CysGly: 1.29 ± 0.73
0.0CysHis: 0.0 ± 0.0
1.29CysIle: 1.29 ± 0.17
0.645CysLys: 0.645 ± 0.365
1.29CysLeu: 1.29 ± 0.17
0.322CysMet: 0.322 ± 0.182
0.322CysAsn: 0.322 ± 0.182
1.29CysPro: 1.29 ± 0.17
0.645CysGln: 0.645 ± 0.194
0.645CysArg: 0.645 ± 0.365
2.257CysSer: 2.257 ± 1.277
1.612CysThr: 1.612 ± 0.206
1.612CysVal: 1.612 ± 0.353
0.0CysTrp: 0.0 ± 0.0
1.29CysTyr: 1.29 ± 0.948
0.0CysXaa: 0.0 ± 0.0
Asp
3.225AspAla: 3.225 ± 0.706
1.612AspCys: 1.612 ± 0.206
3.547AspAsp: 3.547 ± 0.23
4.837AspGlu: 4.837 ± 1.618
2.58AspPhe: 2.58 ± 1.337
2.58AspGly: 2.58 ± 0.341
1.612AspHis: 1.612 ± 0.912
2.257AspIle: 2.257 ± 0.718
2.58AspLys: 2.58 ± 0.9
4.515AspLeu: 4.515 ± 0.242
1.935AspMet: 1.935 ± 0.024
1.612AspAsn: 1.612 ± 0.206
1.935AspPro: 1.935 ± 0.583
2.257AspGln: 2.257 ± 0.718
2.58AspArg: 2.58 ± 0.341
2.902AspSer: 2.902 ± 1.714
4.837AspThr: 4.837 ± 0.619
7.739AspVal: 7.739 ± 0.463
0.322AspTrp: 0.322 ± 0.182
1.612AspTyr: 1.612 ± 0.206
0.0AspXaa: 0.0 ± 0.0
Glu
3.547GluAla: 3.547 ± 2.007
0.967GluCys: 0.967 ± 0.547
1.935GluAsp: 1.935 ± 0.024
4.192GluGlu: 4.192 ± 1.253
3.225GluPhe: 3.225 ± 0.706
4.515GluGly: 4.515 ± 0.317
0.322GluHis: 0.322 ± 0.377
4.192GluIle: 4.192 ± 0.694
4.515GluLys: 4.515 ± 1.435
5.16GluLeu: 5.16 ± 0.682
2.902GluMet: 2.902 ± 0.036
2.58GluAsn: 2.58 ± 0.9
2.902GluPro: 2.902 ± 1.155
2.902GluGln: 2.902 ± 0.595
3.225GluArg: 3.225 ± 0.146
5.482GluSer: 5.482 ± 2.542
2.58GluThr: 2.58 ± 0.218
5.805GluVal: 5.805 ± 0.072
1.29GluTrp: 1.29 ± 0.73
1.935GluTyr: 1.935 ± 1.095
0.0GluXaa: 0.0 ± 0.0
Phe
1.29PheAla: 1.29 ± 0.389
0.645PheCys: 0.645 ± 0.365
2.58PheAsp: 2.58 ± 0.341
2.902PheGlu: 2.902 ± 0.523
2.257PhePhe: 2.257 ± 0.718
1.612PheGly: 1.612 ± 0.353
1.935PheHis: 1.935 ± 0.024
0.967PheIle: 0.967 ± 0.547
3.87PheLys: 3.87 ± 0.511
1.935PheLeu: 1.935 ± 0.024
1.612PheMet: 1.612 ± 0.111
1.29PheAsn: 1.29 ± 0.17
1.935PhePro: 1.935 ± 1.143
3.547PheGln: 3.547 ± 0.79
3.87PheArg: 3.87 ± 0.607
2.58PheSer: 2.58 ± 0.778
4.837PheThr: 4.837 ± 0.619
2.902PheVal: 2.902 ± 0.523
0.0PheTrp: 0.0 ± 0.0
2.58PheTyr: 2.58 ± 1.337
0.0PheXaa: 0.0 ± 0.0
Gly
3.87GlyAla: 3.87 ± 0.607
0.645GlyCys: 0.645 ± 0.194
3.547GlyAsp: 3.547 ± 0.23
4.515GlyGlu: 4.515 ± 0.317
0.967GlyPhe: 0.967 ± 0.547
2.902GlyGly: 2.902 ± 0.595
0.967GlyHis: 0.967 ± 0.547
3.225GlyIle: 3.225 ± 0.146
2.902GlyLys: 2.902 ± 1.083
4.192GlyLeu: 4.192 ± 0.134
1.935GlyMet: 1.935 ± 0.024
2.58GlyAsn: 2.58 ± 1.896
3.225GlyPro: 3.225 ± 0.413
2.58GlyGln: 2.58 ± 0.341
2.257GlyArg: 2.257 ± 0.718
2.902GlySer: 2.902 ± 0.036
4.837GlyThr: 4.837 ± 0.619
6.45GlyVal: 6.45 ± 0.826
0.322GlyTrp: 0.322 ± 0.377
2.902GlyTyr: 2.902 ± 1.642
0.0GlyXaa: 0.0 ± 0.0
His
0.967HisAla: 0.967 ± 0.571
0.322HisCys: 0.322 ± 0.182
0.645HisAsp: 0.645 ± 0.365
1.935HisGlu: 1.935 ± 0.535
1.29HisPhe: 1.29 ± 0.17
1.29HisGly: 1.29 ± 0.73
0.645HisHis: 0.645 ± 0.194
3.225HisIle: 3.225 ± 0.413
0.322HisLys: 0.322 ± 0.182
1.935HisLeu: 1.935 ± 0.535
1.29HisMet: 1.29 ± 0.389
2.257HisAsn: 2.257 ± 0.401
0.645HisPro: 0.645 ± 0.194
0.322HisGln: 0.322 ± 0.377
0.645HisArg: 0.645 ± 0.365
1.612HisSer: 1.612 ± 0.353
0.967HisThr: 0.967 ± 0.012
1.612HisVal: 1.612 ± 0.206
0.0HisTrp: 0.0 ± 0.0
0.322HisTyr: 0.322 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.515IleAla: 4.515 ± 1.435
1.612IleCys: 1.612 ± 0.353
4.192IleAsp: 4.192 ± 0.425
4.837IleGlu: 4.837 ± 1.059
0.967IlePhe: 0.967 ± 0.547
2.58IleGly: 2.58 ± 0.778
1.612IleHis: 1.612 ± 0.353
2.902IleIle: 2.902 ± 0.036
1.29IleLys: 1.29 ± 0.389
3.87IleLeu: 3.87 ± 0.511
1.29IleMet: 1.29 ± 0.389
1.612IleAsn: 1.612 ± 0.206
3.87IlePro: 3.87 ± 0.607
1.935IleGln: 1.935 ± 0.024
2.902IleArg: 2.902 ± 0.595
4.192IleSer: 4.192 ± 1.543
2.902IleThr: 2.902 ± 0.595
3.87IleVal: 3.87 ± 1.167
0.322IleTrp: 0.322 ± 0.182
0.967IleTyr: 0.967 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
2.257LysAla: 2.257 ± 0.158
2.257LysCys: 2.257 ± 1.277
2.257LysAsp: 2.257 ± 0.718
1.29LysGlu: 1.29 ± 0.17
2.58LysPhe: 2.58 ± 0.218
1.935LysGly: 1.935 ± 1.095
0.645LysHis: 0.645 ± 0.365
2.58LysIle: 2.58 ± 0.218
2.58LysLys: 2.58 ± 0.9
4.515LysLeu: 4.515 ± 0.876
1.935LysMet: 1.935 ± 1.095
2.902LysAsn: 2.902 ± 1.642
1.29LysPro: 1.29 ± 0.17
2.58LysGln: 2.58 ± 0.341
3.87LysArg: 3.87 ± 1.63
2.902LysSer: 2.902 ± 0.036
3.87LysThr: 3.87 ± 0.511
3.87LysVal: 3.87 ± 0.048
0.322LysTrp: 0.322 ± 0.377
1.935LysTyr: 1.935 ± 1.095
0.0LysXaa: 0.0 ± 0.0
Leu
4.837LeuAla: 4.837 ± 1.059
0.645LeuCys: 0.645 ± 0.365
6.772LeuAsp: 6.772 ± 1.594
5.482LeuGlu: 5.482 ± 1.983
3.225LeuPhe: 3.225 ± 1.824
4.192LeuGly: 4.192 ± 0.134
2.257LeuHis: 2.257 ± 1.277
4.837LeuIle: 4.837 ± 0.619
3.225LeuLys: 3.225 ± 0.413
4.837LeuLeu: 4.837 ± 0.499
3.547LeuMet: 3.547 ± 0.433
6.127LeuAsn: 6.127 ± 0.11
2.902LeuPro: 2.902 ± 0.595
2.902LeuGln: 2.902 ± 0.595
4.192LeuArg: 4.192 ± 0.984
7.739LeuSer: 7.739 ± 0.463
2.58LeuThr: 2.58 ± 0.778
3.547LeuVal: 3.547 ± 0.329
0.967LeuTrp: 0.967 ± 0.547
1.29LeuTyr: 1.29 ± 0.73
0.0LeuXaa: 0.0 ± 0.0
Met
3.225MetAla: 3.225 ± 0.706
0.322MetCys: 0.322 ± 0.182
1.29MetAsp: 1.29 ± 0.389
1.612MetGlu: 1.612 ± 0.353
0.967MetPhe: 0.967 ± 0.012
1.612MetGly: 1.612 ± 0.206
0.967MetHis: 0.967 ± 0.547
3.225MetIle: 3.225 ± 0.706
2.902MetLys: 2.902 ± 0.523
2.58MetLeu: 2.58 ± 0.341
1.612MetMet: 1.612 ± 0.353
1.935MetAsn: 1.935 ± 0.024
1.935MetPro: 1.935 ± 0.024
1.29MetGln: 1.29 ± 0.389
1.29MetArg: 1.29 ± 0.73
3.547MetSer: 3.547 ± 0.329
2.257MetThr: 2.257 ± 0.158
2.58MetVal: 2.58 ± 1.337
0.645MetTrp: 0.645 ± 0.194
2.58MetTyr: 2.58 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
4.515AsnAla: 4.515 ± 0.876
0.645AsnCys: 0.645 ± 0.194
3.547AsnAsp: 3.547 ± 0.888
3.547AsnGlu: 3.547 ± 0.329
2.257AsnPhe: 2.257 ± 1.519
2.257AsnGly: 2.257 ± 0.718
0.645AsnHis: 0.645 ± 0.194
2.902AsnIle: 2.902 ± 0.595
1.612AsnLys: 1.612 ± 0.206
2.257AsnLeu: 2.257 ± 0.96
2.902AsnMet: 2.902 ± 0.036
1.612AsnAsn: 1.612 ± 0.206
3.225AsnPro: 3.225 ± 0.972
2.902AsnGln: 2.902 ± 0.036
3.87AsnArg: 3.87 ± 0.511
3.547AsnSer: 3.547 ± 1.908
3.225AsnThr: 3.225 ± 1.531
3.87AsnVal: 3.87 ± 1.726
0.0AsnTrp: 0.0 ± 0.0
0.645AsnTyr: 0.645 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
1.935ProAla: 1.935 ± 0.583
1.612ProCys: 1.612 ± 0.353
1.612ProAsp: 1.612 ± 0.766
3.547ProGlu: 3.547 ± 0.329
2.58ProPhe: 2.58 ± 2.456
2.58ProGly: 2.58 ± 0.218
0.0ProHis: 0.0 ± 0.0
1.612ProIle: 1.612 ± 1.325
3.547ProLys: 3.547 ± 0.888
5.805ProLeu: 5.805 ± 0.631
1.612ProMet: 1.612 ± 0.353
1.29ProAsn: 1.29 ± 0.948
1.612ProPro: 1.612 ± 0.766
1.29ProGln: 1.29 ± 0.948
2.58ProArg: 2.58 ± 0.778
4.192ProSer: 4.192 ± 0.425
3.547ProThr: 3.547 ± 0.329
2.257ProVal: 2.257 ± 0.158
1.29ProTrp: 1.29 ± 0.948
0.967ProTyr: 0.967 ± 0.571
0.0ProXaa: 0.0 ± 0.0
Gln
1.612GlnAla: 1.612 ± 0.206
0.967GlnCys: 0.967 ± 0.547
1.612GlnAsp: 1.612 ± 0.206
3.547GlnGlu: 3.547 ± 0.888
2.58GlnPhe: 2.58 ± 0.218
3.225GlnGly: 3.225 ± 1.531
0.322GlnHis: 0.322 ± 0.377
3.225GlnIle: 3.225 ± 0.146
1.935GlnLys: 1.935 ± 0.024
3.225GlnLeu: 3.225 ± 0.706
1.612GlnMet: 1.612 ± 0.766
1.612GlnAsn: 1.612 ± 0.206
1.612GlnPro: 1.612 ± 0.766
2.257GlnGln: 2.257 ± 0.158
0.967GlnArg: 0.967 ± 0.547
2.257GlnSer: 2.257 ± 0.401
0.967GlnThr: 0.967 ± 0.571
3.547GlnVal: 3.547 ± 0.329
0.645GlnTrp: 0.645 ± 0.365
0.322GlnTyr: 0.322 ± 0.377
0.0GlnXaa: 0.0 ± 0.0
Arg
1.935ArgAla: 1.935 ± 1.702
1.935ArgCys: 1.935 ± 0.535
4.515ArgAsp: 4.515 ± 0.242
5.482ArgGlu: 5.482 ± 1.423
3.225ArgPhe: 3.225 ± 0.146
2.902ArgGly: 2.902 ± 0.595
0.967ArgHis: 0.967 ± 0.012
2.257ArgIle: 2.257 ± 0.718
2.257ArgLys: 2.257 ± 0.718
5.805ArgLeu: 5.805 ± 1.047
2.902ArgMet: 2.902 ± 0.523
2.902ArgAsn: 2.902 ± 0.523
3.87ArgPro: 3.87 ± 0.607
1.29ArgGln: 1.29 ± 0.17
4.515ArgArg: 4.515 ± 0.242
2.257ArgSer: 2.257 ± 0.401
5.16ArgThr: 5.16 ± 1.555
4.515ArgVal: 4.515 ± 1.435
0.967ArgTrp: 0.967 ± 0.547
1.935ArgTyr: 1.935 ± 0.024
0.0ArgXaa: 0.0 ± 0.0
Ser
5.805SerAla: 5.805 ± 0.072
0.322SerCys: 0.322 ± 0.377
2.257SerAsp: 2.257 ± 0.718
3.547SerGlu: 3.547 ± 0.888
4.192SerPhe: 4.192 ± 1.543
7.417SerGly: 7.417 ± 0.281
1.612SerHis: 1.612 ± 0.206
2.58SerIle: 2.58 ± 1.337
1.935SerLys: 1.935 ± 1.095
6.45SerLeu: 6.45 ± 0.852
2.58SerMet: 2.58 ± 0.341
3.225SerAsn: 3.225 ± 2.091
1.935SerPro: 1.935 ± 0.024
2.257SerGln: 2.257 ± 0.401
4.515SerArg: 4.515 ± 0.802
5.482SerSer: 5.482 ± 1.373
4.837SerThr: 4.837 ± 2.297
6.45SerVal: 6.45 ± 0.826
0.0SerTrp: 0.0 ± 0.0
2.902SerTyr: 2.902 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
3.547ThrAla: 3.547 ± 0.329
0.967ThrCys: 0.967 ± 0.012
4.192ThrAsp: 4.192 ± 0.984
2.257ThrGlu: 2.257 ± 0.96
1.935ThrPhe: 1.935 ± 0.024
2.902ThrGly: 2.902 ± 1.155
2.257ThrHis: 2.257 ± 0.96
3.547ThrIle: 3.547 ± 0.329
2.902ThrLys: 2.902 ± 0.036
3.547ThrLeu: 3.547 ± 0.329
2.902ThrMet: 2.902 ± 0.523
4.515ThrAsn: 4.515 ± 2.48
2.58ThrPro: 2.58 ± 0.341
1.612ThrGln: 1.612 ± 0.766
4.837ThrArg: 4.837 ± 0.619
4.515ThrSer: 4.515 ± 2.48
4.515ThrThr: 4.515 ± 2.48
5.805ThrVal: 5.805 ± 2.309
1.29ThrTrp: 1.29 ± 0.389
3.225ThrTyr: 3.225 ± 0.146
0.0ThrXaa: 0.0 ± 0.0
Val
6.45ValAla: 6.45 ± 0.826
1.612ValCys: 1.612 ± 0.912
3.225ValAsp: 3.225 ± 1.531
6.45ValGlu: 6.45 ± 1.944
3.87ValPhe: 3.87 ± 0.511
2.902ValGly: 2.902 ± 0.036
2.58ValHis: 2.58 ± 0.341
3.225ValIle: 3.225 ± 0.972
4.192ValLys: 4.192 ± 1.812
6.127ValLeu: 6.127 ± 1.229
1.935ValMet: 1.935 ± 0.024
5.805ValAsn: 5.805 ± 1.191
3.225ValPro: 3.225 ± 1.531
1.612ValGln: 1.612 ± 0.912
5.805ValArg: 5.805 ± 0.487
6.45ValSer: 6.45 ± 0.266
4.192ValThr: 4.192 ± 0.984
4.515ValVal: 4.515 ± 1.92
1.29ValTrp: 1.29 ± 0.17
2.58ValTyr: 2.58 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
0.322TrpAla: 0.322 ± 0.377
0.0TrpCys: 0.0 ± 0.0
0.645TrpAsp: 0.645 ± 0.365
0.322TrpGlu: 0.322 ± 0.182
0.645TrpPhe: 0.645 ± 0.365
0.645TrpGly: 0.645 ± 0.365
0.0TrpHis: 0.0 ± 0.0
0.322TrpIle: 0.322 ± 0.377
0.645TrpLys: 0.645 ± 0.365
1.612TrpLeu: 1.612 ± 0.353
0.645TrpMet: 0.645 ± 0.194
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.322TrpGln: 0.322 ± 0.182
1.29TrpArg: 1.29 ± 0.17
0.967TrpSer: 0.967 ± 0.571
1.935TrpThr: 1.935 ± 0.535
0.645TrpVal: 0.645 ± 0.365
0.322TrpTrp: 0.322 ± 0.377
0.322TrpTyr: 0.322 ± 0.377
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.902TyrAla: 2.902 ± 0.595
1.29TyrCys: 1.29 ± 0.389
3.87TyrAsp: 3.87 ± 1.071
1.612TyrGlu: 1.612 ± 0.353
1.29TyrPhe: 1.29 ± 0.17
3.225TyrGly: 3.225 ± 1.265
0.645TyrHis: 0.645 ± 0.754
0.645TyrIle: 0.645 ± 0.194
1.29TyrLys: 1.29 ± 0.17
1.612TyrLeu: 1.612 ± 0.206
1.29TyrMet: 1.29 ± 0.17
1.935TyrAsn: 1.935 ± 0.583
1.29TyrPro: 1.29 ± 0.73
1.29TyrGln: 1.29 ± 0.73
2.257TyrArg: 2.257 ± 0.401
1.29TyrSer: 1.29 ± 0.389
1.29TyrThr: 1.29 ± 0.17
2.257TyrVal: 2.257 ± 0.158
1.29TyrTrp: 1.29 ± 0.73
0.967TyrTyr: 0.967 ± 0.547
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski