Amino acid dipepetide frequency for Beihai picorna-like virus 80

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.733AlaAla: 3.733 ± 0.058
1.659AlaCys: 1.659 ± 0.495
3.318AlaAsp: 3.318 ± 0.287
2.489AlaGlu: 2.489 ± 0.039
4.148AlaPhe: 4.148 ± 1.239
6.221AlaGly: 6.221 ± 2.016
0.415AlaHis: 0.415 ± 0.476
4.562AlaIle: 4.562 ± 0.306
3.733AlaLys: 3.733 ± 0.763
4.562AlaLeu: 4.562 ± 0.398
1.659AlaMet: 1.659 ± 0.495
2.074AlaAsn: 2.074 ± 0.971
3.733AlaPro: 3.733 ± 2.171
2.489AlaGln: 2.489 ± 0.039
2.903AlaArg: 2.903 ± 0.189
5.807AlaSer: 5.807 ± 1.734
4.562AlaThr: 4.562 ± 2.419
4.562AlaVal: 4.562 ± 0.398
0.83AlaTrp: 0.83 ± 0.457
2.074AlaTyr: 2.074 ± 0.971
0.0AlaXaa: 0.0 ± 0.0
Cys
2.074CysAla: 2.074 ± 0.267
0.415CysCys: 0.415 ± 0.228
3.318CysAsp: 3.318 ± 0.418
0.415CysGlu: 0.415 ± 0.228
1.659CysPhe: 1.659 ± 0.913
0.83CysGly: 0.83 ± 0.457
0.415CysHis: 0.415 ± 0.228
0.83CysIle: 0.83 ± 0.457
0.83CysLys: 0.83 ± 0.457
0.83CysLeu: 0.83 ± 0.457
0.0CysMet: 0.0 ± 0.0
2.074CysAsn: 2.074 ± 0.267
0.83CysPro: 0.83 ± 0.457
0.83CysGln: 0.83 ± 0.248
0.83CysArg: 0.83 ± 0.457
2.489CysSer: 2.489 ± 0.665
0.415CysThr: 0.415 ± 0.476
2.074CysVal: 2.074 ± 0.437
0.415CysTrp: 0.415 ± 0.228
1.244CysTyr: 1.244 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
5.392AspAla: 5.392 ± 0.151
3.733AspCys: 3.733 ± 2.055
6.221AspAsp: 6.221 ± 2.016
2.903AspGlu: 2.903 ± 0.515
6.221AspPhe: 6.221 ± 0.607
3.733AspGly: 3.733 ± 0.646
0.415AspHis: 0.415 ± 0.228
5.807AspIle: 5.807 ± 1.03
2.903AspLys: 2.903 ± 1.598
6.636AspLeu: 6.636 ± 0.835
1.244AspMet: 1.244 ± 0.019
3.318AspAsn: 3.318 ± 0.287
2.074AspPro: 2.074 ± 0.971
1.244AspGln: 1.244 ± 0.019
2.074AspArg: 2.074 ± 1.141
4.562AspSer: 4.562 ± 1.807
1.659AspThr: 1.659 ± 0.913
6.221AspVal: 6.221 ± 0.801
0.415AspTrp: 0.415 ± 0.228
2.489AspTyr: 2.489 ± 1.37
0.0AspXaa: 0.0 ± 0.0
Glu
4.148GluAla: 4.148 ± 0.874
1.244GluCys: 1.244 ± 0.685
2.489GluAsp: 2.489 ± 0.665
2.074GluGlu: 2.074 ± 1.141
2.074GluPhe: 2.074 ± 1.141
2.489GluGly: 2.489 ± 1.37
0.83GluHis: 0.83 ± 0.457
3.733GluIle: 3.733 ± 0.646
0.83GluLys: 0.83 ± 0.248
5.807GluLeu: 5.807 ± 1.03
0.83GluMet: 0.83 ± 0.248
3.318GluAsn: 3.318 ± 1.122
2.074GluPro: 2.074 ± 0.437
1.244GluGln: 1.244 ± 0.724
2.903GluArg: 2.903 ± 0.894
2.903GluSer: 2.903 ± 0.894
1.659GluThr: 1.659 ± 0.913
3.318GluVal: 3.318 ± 1.122
0.415GluTrp: 0.415 ± 0.228
2.489GluTyr: 2.489 ± 0.665
0.0GluXaa: 0.0 ± 0.0
Phe
3.318PheAla: 3.318 ± 1.122
0.83PheCys: 0.83 ± 0.457
4.148PheAsp: 4.148 ± 0.874
3.733PheGlu: 3.733 ± 0.646
2.074PhePhe: 2.074 ± 0.971
3.318PheGly: 3.318 ± 0.418
0.83PheHis: 0.83 ± 0.457
1.659PheIle: 1.659 ± 0.495
2.489PheLys: 2.489 ± 0.039
5.807PheLeu: 5.807 ± 1.788
0.83PheMet: 0.83 ± 0.248
3.733PheAsn: 3.733 ± 0.763
3.318PhePro: 3.318 ± 0.418
2.074PheGln: 2.074 ± 2.38
2.903PheArg: 2.903 ± 0.515
4.562PheSer: 4.562 ± 0.398
2.074PheThr: 2.074 ± 0.267
1.659PheVal: 1.659 ± 0.209
0.415PheTrp: 0.415 ± 0.228
1.659PheTyr: 1.659 ± 0.495
0.0PheXaa: 0.0 ± 0.0
Gly
3.318GlyAla: 3.318 ± 0.418
1.244GlyCys: 1.244 ± 0.685
7.881GlyAsp: 7.881 ± 1.297
3.733GlyGlu: 3.733 ± 0.646
3.318GlyPhe: 3.318 ± 0.991
5.392GlyGly: 5.392 ± 1.962
0.0GlyHis: 0.0 ± 0.0
4.148GlyIle: 4.148 ± 2.283
4.562GlyLys: 4.562 ± 1.807
4.562GlyLeu: 4.562 ± 0.398
0.83GlyMet: 0.83 ± 0.457
3.318GlyAsn: 3.318 ± 0.418
2.903GlyPro: 2.903 ± 0.515
0.83GlyGln: 0.83 ± 0.248
3.318GlyArg: 3.318 ± 1.122
6.636GlySer: 6.636 ± 1.277
5.807GlyThr: 5.807 ± 3.847
3.733GlyVal: 3.733 ± 0.763
0.83GlyTrp: 0.83 ± 0.248
2.489GlyTyr: 2.489 ± 0.743
0.0GlyXaa: 0.0 ± 0.0
His
1.244HisAla: 1.244 ± 0.019
0.415HisCys: 0.415 ± 0.228
0.0HisAsp: 0.0 ± 0.0
0.415HisGlu: 0.415 ± 0.228
0.415HisPhe: 0.415 ± 0.228
0.415HisGly: 0.415 ± 0.228
0.83HisHis: 0.83 ± 0.248
0.415HisIle: 0.415 ± 0.228
0.0HisLys: 0.0 ± 0.0
0.415HisLeu: 0.415 ± 0.228
0.415HisMet: 0.415 ± 0.476
0.415HisAsn: 0.415 ± 0.228
0.83HisPro: 0.83 ± 0.248
0.0HisGln: 0.0 ± 0.0
1.659HisArg: 1.659 ± 1.2
2.903HisSer: 2.903 ± 0.189
0.83HisThr: 0.83 ± 0.248
0.83HisVal: 0.83 ± 0.457
0.0HisTrp: 0.0 ± 0.0
0.83HisTyr: 0.83 ± 0.457
0.0HisXaa: 0.0 ± 0.0
Ile
4.148IleAla: 4.148 ± 0.874
0.83IleCys: 0.83 ± 0.248
4.977IleAsp: 4.977 ± 0.627
5.392IleGlu: 5.392 ± 0.855
2.489IlePhe: 2.489 ± 0.665
5.807IleGly: 5.807 ± 0.379
1.244IleHis: 1.244 ± 0.685
3.733IleIle: 3.733 ± 1.35
2.489IleLys: 2.489 ± 0.665
5.807IleLeu: 5.807 ± 0.379
0.83IleMet: 0.83 ± 0.248
3.318IleAsn: 3.318 ± 1.695
3.733IlePro: 3.733 ± 0.763
2.903IleGln: 2.903 ± 0.515
4.148IleArg: 4.148 ± 0.874
4.977IleSer: 4.977 ± 0.627
1.659IleThr: 1.659 ± 0.495
5.807IleVal: 5.807 ± 1.083
0.415IleTrp: 0.415 ± 0.476
1.659IleTyr: 1.659 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
1.244LysAla: 1.244 ± 0.685
0.83LysCys: 0.83 ± 0.457
1.659LysAsp: 1.659 ± 0.209
1.659LysGlu: 1.659 ± 0.209
3.318LysPhe: 3.318 ± 1.826
3.318LysGly: 3.318 ± 1.122
0.83LysHis: 0.83 ± 0.457
3.733LysIle: 3.733 ± 0.058
3.318LysLys: 3.318 ± 1.826
5.392LysLeu: 5.392 ± 2.968
0.83LysMet: 0.83 ± 0.457
2.489LysAsn: 2.489 ± 1.37
2.489LysPro: 2.489 ± 0.039
0.415LysGln: 0.415 ± 0.228
2.903LysArg: 2.903 ± 0.894
4.977LysSer: 4.977 ± 1.331
2.489LysThr: 2.489 ± 0.039
2.489LysVal: 2.489 ± 0.039
0.415LysTrp: 0.415 ± 0.228
2.489LysTyr: 2.489 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
7.881LeuAla: 7.881 ± 2.706
1.659LeuCys: 1.659 ± 0.209
6.636LeuAsp: 6.636 ± 2.948
3.318LeuGlu: 3.318 ± 1.122
1.659LeuPhe: 1.659 ± 0.209
4.977LeuGly: 4.977 ± 0.782
0.415LeuHis: 0.415 ± 0.228
7.051LeuIle: 7.051 ± 0.359
3.318LeuLys: 3.318 ± 1.826
5.392LeuLeu: 5.392 ± 1.559
0.415LeuMet: 0.415 ± 0.228
4.562LeuAsn: 4.562 ± 1.103
4.562LeuPro: 4.562 ± 0.398
2.489LeuGln: 2.489 ± 1.447
4.148LeuArg: 4.148 ± 0.17
8.295LeuSer: 8.295 ± 1.069
5.392LeuThr: 5.392 ± 0.855
6.221LeuVal: 6.221 ± 1.311
0.415LeuTrp: 0.415 ± 0.476
2.489LeuTyr: 2.489 ± 0.743
0.0LeuXaa: 0.0 ± 0.0
Met
1.659MetAla: 1.659 ± 0.913
0.415MetCys: 0.415 ± 0.228
1.659MetAsp: 1.659 ± 0.495
1.659MetGlu: 1.659 ± 0.209
0.415MetPhe: 0.415 ± 0.476
1.244MetGly: 1.244 ± 0.019
0.0MetHis: 0.0 ± 0.0
1.244MetIle: 1.244 ± 0.019
0.415MetLys: 0.415 ± 0.228
2.489MetLeu: 2.489 ± 1.447
0.83MetMet: 0.83 ± 0.952
1.659MetAsn: 1.659 ± 0.209
0.415MetPro: 0.415 ± 0.228
0.415MetGln: 0.415 ± 0.228
1.244MetArg: 1.244 ± 0.724
2.489MetSer: 2.489 ± 1.447
0.415MetThr: 0.415 ± 0.476
1.244MetVal: 1.244 ± 0.019
0.415MetTrp: 0.415 ± 0.228
1.244MetTyr: 1.244 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.733AsnAla: 3.733 ± 1.467
1.659AsnCys: 1.659 ± 0.495
2.903AsnAsp: 2.903 ± 1.598
2.074AsnGlu: 2.074 ± 0.437
1.659AsnPhe: 1.659 ± 0.913
3.733AsnGly: 3.733 ± 1.467
0.415AsnHis: 0.415 ± 0.228
3.318AsnIle: 3.318 ± 0.287
3.733AsnLys: 3.733 ± 0.646
3.318AsnLeu: 3.318 ± 0.418
2.074AsnMet: 2.074 ± 1.141
3.733AsnAsn: 3.733 ± 0.763
2.074AsnPro: 2.074 ± 0.267
2.074AsnGln: 2.074 ± 0.437
4.562AsnArg: 4.562 ± 0.398
4.562AsnSer: 4.562 ± 0.398
0.83AsnThr: 0.83 ± 0.457
3.318AsnVal: 3.318 ± 2.4
0.0AsnTrp: 0.0 ± 0.0
2.489AsnTyr: 2.489 ± 0.743
0.0AsnXaa: 0.0 ± 0.0
Pro
3.733ProAla: 3.733 ± 2.171
0.415ProCys: 0.415 ± 0.476
3.733ProAsp: 3.733 ± 0.646
1.659ProGlu: 1.659 ± 0.913
3.733ProPhe: 3.733 ± 1.467
3.318ProGly: 3.318 ± 0.418
0.0ProHis: 0.0 ± 0.0
3.318ProIle: 3.318 ± 1.122
1.659ProLys: 1.659 ± 0.913
2.903ProLeu: 2.903 ± 1.219
1.659ProMet: 1.659 ± 1.2
1.244ProAsn: 1.244 ± 0.685
2.903ProPro: 2.903 ± 1.923
2.903ProGln: 2.903 ± 1.219
2.074ProArg: 2.074 ± 1.141
3.318ProSer: 3.318 ± 0.991
2.489ProThr: 2.489 ± 2.856
5.392ProVal: 5.392 ± 1.258
0.83ProTrp: 0.83 ± 0.457
2.074ProTyr: 2.074 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
2.903GlnAla: 2.903 ± 1.923
1.244GlnCys: 1.244 ± 0.019
1.659GlnAsp: 1.659 ± 0.913
0.83GlnGlu: 0.83 ± 0.457
0.415GlnPhe: 0.415 ± 0.476
1.659GlnGly: 1.659 ± 0.495
0.415GlnHis: 0.415 ± 0.228
1.659GlnIle: 1.659 ± 0.209
0.415GlnLys: 0.415 ± 0.228
2.074GlnLeu: 2.074 ± 0.971
0.83GlnMet: 0.83 ± 0.248
1.659GlnAsn: 1.659 ± 1.904
2.074GlnPro: 2.074 ± 0.267
1.244GlnGln: 1.244 ± 0.724
0.0GlnArg: 0.0 ± 0.0
2.074GlnSer: 2.074 ± 0.437
1.244GlnThr: 1.244 ± 1.428
1.244GlnVal: 1.244 ± 1.428
0.83GlnTrp: 0.83 ± 0.248
1.659GlnTyr: 1.659 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
2.074ArgAla: 2.074 ± 0.267
0.83ArgCys: 0.83 ± 0.457
3.733ArgAsp: 3.733 ± 0.646
0.83ArgGlu: 0.83 ± 0.457
2.903ArgPhe: 2.903 ± 0.515
3.318ArgGly: 3.318 ± 1.695
1.244ArgHis: 1.244 ± 0.685
3.733ArgIle: 3.733 ± 2.055
5.392ArgLys: 5.392 ± 1.559
3.733ArgLeu: 3.733 ± 1.467
2.074ArgMet: 2.074 ± 0.207
2.489ArgAsn: 2.489 ± 1.37
1.244ArgPro: 1.244 ± 0.019
0.0ArgGln: 0.0 ± 0.0
4.562ArgArg: 4.562 ± 1.103
5.807ArgSer: 5.807 ± 2.492
3.318ArgThr: 3.318 ± 0.991
3.733ArgVal: 3.733 ± 0.763
0.83ArgTrp: 0.83 ± 0.457
2.489ArgTyr: 2.489 ± 0.743
0.0ArgXaa: 0.0 ± 0.0
Ser
5.392SerAla: 5.392 ± 1.258
0.83SerCys: 0.83 ± 0.248
5.807SerAsp: 5.807 ± 1.788
6.221SerGlu: 6.221 ± 2.72
5.807SerPhe: 5.807 ± 1.03
6.221SerGly: 6.221 ± 2.21
0.83SerHis: 0.83 ± 0.248
7.881SerIle: 7.881 ± 0.112
5.392SerLys: 5.392 ± 2.264
10.784SerLeu: 10.784 ± 2.414
0.83SerMet: 0.83 ± 0.952
3.733SerAsn: 3.733 ± 0.058
2.489SerPro: 2.489 ± 0.743
0.83SerGln: 0.83 ± 0.248
4.148SerArg: 4.148 ± 0.874
7.051SerSer: 7.051 ± 1.768
3.318SerThr: 3.318 ± 1.695
5.807SerVal: 5.807 ± 0.325
1.244SerTrp: 1.244 ± 0.019
4.148SerTyr: 4.148 ± 1.239
0.0SerXaa: 0.0 ± 0.0
Thr
2.903ThrAla: 2.903 ± 0.515
0.83ThrCys: 0.83 ± 0.457
3.318ThrAsp: 3.318 ± 0.991
2.074ThrGlu: 2.074 ± 0.267
1.659ThrPhe: 1.659 ± 0.495
4.148ThrGly: 4.148 ± 3.352
0.83ThrHis: 0.83 ± 0.952
4.148ThrIle: 4.148 ± 0.534
2.074ThrLys: 2.074 ± 1.141
2.903ThrLeu: 2.903 ± 1.219
2.074ThrMet: 2.074 ± 0.747
1.659ThrAsn: 1.659 ± 0.209
3.318ThrPro: 3.318 ± 2.4
0.83ThrGln: 0.83 ± 0.457
2.903ThrArg: 2.903 ± 0.189
5.807ThrSer: 5.807 ± 3.143
6.221ThrThr: 6.221 ± 2.914
2.903ThrVal: 2.903 ± 1.923
0.0ThrTrp: 0.0 ± 0.0
2.074ThrTyr: 2.074 ± 0.437
0.0ThrXaa: 0.0 ± 0.0
Val
4.148ValAla: 4.148 ± 1.239
1.659ValCys: 1.659 ± 0.495
4.562ValAsp: 4.562 ± 0.398
2.489ValGlu: 2.489 ± 0.039
4.148ValPhe: 4.148 ± 1.579
4.977ValGly: 4.977 ± 1.331
1.244ValHis: 1.244 ± 0.724
3.318ValIle: 3.318 ± 0.287
2.903ValLys: 2.903 ± 0.515
4.977ValLeu: 4.977 ± 0.627
0.415ValMet: 0.415 ± 0.228
4.977ValAsn: 4.977 ± 0.782
6.636ValPro: 6.636 ± 1.277
2.489ValGln: 2.489 ± 0.743
3.733ValArg: 3.733 ± 0.058
4.562ValSer: 4.562 ± 0.306
5.807ValThr: 5.807 ± 1.734
4.562ValVal: 4.562 ± 0.398
1.659ValTrp: 1.659 ± 1.2
1.659ValTyr: 1.659 ± 0.495
0.0ValXaa: 0.0 ± 0.0
Trp
0.415TrpAla: 0.415 ± 0.476
1.244TrpCys: 1.244 ± 0.685
0.0TrpAsp: 0.0 ± 0.0
0.415TrpGlu: 0.415 ± 0.228
0.415TrpPhe: 0.415 ± 0.228
2.074TrpGly: 2.074 ± 0.267
0.83TrpHis: 0.83 ± 0.457
0.0TrpIle: 0.0 ± 0.0
0.83TrpLys: 0.83 ± 0.457
0.0TrpLeu: 0.0 ± 0.0
1.244TrpMet: 1.244 ± 0.019
0.415TrpAsn: 0.415 ± 0.228
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.415TrpArg: 0.415 ± 0.228
1.244TrpSer: 1.244 ± 1.428
0.415TrpThr: 0.415 ± 0.228
1.659TrpVal: 1.659 ± 0.495
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.074TyrAla: 2.074 ± 1.676
0.83TyrCys: 0.83 ± 0.457
1.244TyrAsp: 1.244 ± 0.724
2.489TyrGlu: 2.489 ± 1.37
2.903TyrPhe: 2.903 ± 0.189
1.659TyrGly: 1.659 ± 0.209
1.244TyrHis: 1.244 ± 1.428
2.074TyrIle: 2.074 ± 0.971
0.0TyrLys: 0.0 ± 0.0
2.903TyrLeu: 2.903 ± 0.189
1.244TyrMet: 1.244 ± 0.019
2.074TyrAsn: 2.074 ± 0.267
1.659TyrPro: 1.659 ± 0.913
0.83TyrGln: 0.83 ± 0.248
3.318TyrArg: 3.318 ± 2.4
3.733TyrSer: 3.733 ± 2.055
2.074TyrThr: 2.074 ± 0.267
4.148TyrVal: 4.148 ± 0.534
1.244TyrTrp: 1.244 ± 0.019
1.659TyrTyr: 1.659 ± 0.913
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2412 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski