Amino acid dipepetide frequency for Beihai picorna-like virus 100

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.527AlaAla: 3.527 ± 0.22
0.0AlaCys: 0.0 ± 0.0
3.175AlaAsp: 3.175 ± 0.252
1.764AlaGlu: 1.764 ± 0.431
3.527AlaPhe: 3.527 ± 3.428
4.233AlaGly: 4.233 ± 0.122
1.764AlaHis: 1.764 ± 0.853
2.822AlaIle: 2.822 ± 0.081
2.469AlaLys: 2.469 ± 1.194
4.938AlaLeu: 4.938 ± 0.821
1.058AlaMet: 1.058 ± 0.13
2.822AlaAsn: 2.822 ± 0.561
2.469AlaPro: 2.469 ± 0.731
3.527AlaGln: 3.527 ± 0.22
1.058AlaArg: 1.058 ± 0.512
5.996AlaSer: 5.996 ± 1.593
2.469AlaThr: 2.469 ± 0.09
3.527AlaVal: 3.527 ± 0.422
0.705AlaTrp: 0.705 ± 0.341
2.469AlaTyr: 2.469 ± 0.09
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.705CysAsp: 0.705 ± 0.301
0.353CysGlu: 0.353 ± 0.171
0.353CysPhe: 0.353 ± 0.171
2.116CysGly: 2.116 ± 1.023
0.353CysHis: 0.353 ± 0.471
0.353CysIle: 0.353 ± 0.171
0.705CysLys: 0.705 ± 0.341
1.411CysLeu: 1.411 ± 0.682
0.353CysMet: 0.353 ± 0.171
0.705CysAsn: 0.705 ± 0.301
0.0CysPro: 0.0 ± 0.0
0.705CysGln: 0.705 ± 0.341
0.0CysArg: 0.0 ± 0.0
1.411CysSer: 1.411 ± 0.041
0.705CysThr: 0.705 ± 0.341
1.411CysVal: 1.411 ± 0.041
0.705CysTrp: 0.705 ± 0.341
0.705CysTyr: 0.705 ± 0.341
0.0CysXaa: 0.0 ± 0.0
Asp
3.175AspAla: 3.175 ± 0.39
0.705AspCys: 0.705 ± 0.341
4.938AspAsp: 4.938 ± 1.104
3.88AspGlu: 3.88 ± 1.234
1.411AspPhe: 1.411 ± 0.682
2.469AspGly: 2.469 ± 0.552
1.764AspHis: 1.764 ± 0.853
8.818AspIle: 8.818 ± 0.414
3.527AspLys: 3.527 ± 1.064
3.527AspLeu: 3.527 ± 0.861
1.411AspMet: 1.411 ± 0.041
4.233AspAsn: 4.233 ± 0.52
2.822AspPro: 2.822 ± 1.202
2.469AspGln: 2.469 ± 0.09
1.058AspArg: 1.058 ± 0.772
2.822AspSer: 2.822 ± 0.561
3.527AspThr: 3.527 ± 0.861
3.527AspVal: 3.527 ± 0.422
0.705AspTrp: 0.705 ± 0.341
5.291AspTyr: 5.291 ± 0.633
0.0AspXaa: 0.0 ± 0.0
Glu
4.233GluAla: 4.233 ± 0.52
1.411GluCys: 1.411 ± 0.041
2.469GluAsp: 2.469 ± 1.194
1.411GluGlu: 1.411 ± 0.682
3.175GluPhe: 3.175 ± 0.39
3.175GluGly: 3.175 ± 0.893
0.353GluHis: 0.353 ± 0.171
2.469GluIle: 2.469 ± 0.552
3.175GluLys: 3.175 ± 0.893
5.644GluLeu: 5.644 ± 0.162
1.764GluMet: 1.764 ± 0.323
1.764GluAsn: 1.764 ± 0.211
2.116GluPro: 2.116 ± 1.544
0.705GluGln: 0.705 ± 0.942
2.469GluArg: 2.469 ± 0.731
7.407GluSer: 7.407 ± 2.298
2.469GluThr: 2.469 ± 0.09
4.938GluVal: 4.938 ± 1.104
0.353GluTrp: 0.353 ± 0.171
2.822GluTyr: 2.822 ± 1.202
0.0GluXaa: 0.0 ± 0.0
Phe
2.116PheAla: 2.116 ± 0.902
0.705PheCys: 0.705 ± 0.341
3.527PheAsp: 3.527 ± 0.422
4.233PheGlu: 4.233 ± 1.405
2.822PhePhe: 2.822 ± 0.723
3.175PheGly: 3.175 ± 1.535
1.411PheHis: 1.411 ± 1.885
3.175PheIle: 3.175 ± 1.535
1.764PheLys: 1.764 ± 0.431
2.469PheLeu: 2.469 ± 1.373
0.353PheMet: 0.353 ± 0.171
4.233PheAsn: 4.233 ± 1.162
2.116PhePro: 2.116 ± 2.185
1.411PheGln: 1.411 ± 0.041
1.764PheArg: 1.764 ± 0.431
2.822PheSer: 2.822 ± 1.844
3.175PheThr: 3.175 ± 0.893
4.586PheVal: 4.586 ± 0.35
1.058PheTrp: 1.058 ± 0.772
1.411PheTyr: 1.411 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
1.411GlyAla: 1.411 ± 0.601
1.764GlyCys: 1.764 ± 0.853
3.527GlyAsp: 3.527 ± 1.064
2.822GlyGlu: 2.822 ± 0.723
1.764GlyPhe: 1.764 ± 0.211
2.822GlyGly: 2.822 ± 0.561
0.353GlyHis: 0.353 ± 0.171
4.586GlyIle: 4.586 ± 0.934
4.586GlyLys: 4.586 ± 0.35
2.469GlyLeu: 2.469 ± 0.552
1.058GlyMet: 1.058 ± 0.512
2.116GlyAsn: 2.116 ± 0.26
2.469GlyPro: 2.469 ± 0.09
0.705GlyGln: 0.705 ± 0.341
1.764GlyArg: 1.764 ± 0.211
4.233GlySer: 4.233 ± 0.52
3.88GlyThr: 3.88 ± 1.974
3.527GlyVal: 3.527 ± 1.706
0.0GlyTrp: 0.0 ± 0.0
2.822GlyTyr: 2.822 ± 0.081
0.0GlyXaa: 0.0 ± 0.0
His
0.353HisAla: 0.353 ± 0.171
0.0HisCys: 0.0 ± 0.0
1.058HisAsp: 1.058 ± 0.13
1.058HisGlu: 1.058 ± 0.13
1.411HisPhe: 1.411 ± 0.601
0.705HisGly: 0.705 ± 0.341
0.705HisHis: 0.705 ± 0.341
1.764HisIle: 1.764 ± 0.431
1.411HisLys: 1.411 ± 0.682
1.764HisLeu: 1.764 ± 0.853
0.353HisMet: 0.353 ± 0.171
0.353HisAsn: 0.353 ± 0.171
0.705HisPro: 0.705 ± 0.341
0.353HisGln: 0.353 ± 0.171
0.705HisArg: 0.705 ± 0.341
2.116HisSer: 2.116 ± 1.544
0.705HisThr: 0.705 ± 0.341
1.058HisVal: 1.058 ± 0.13
0.0HisTrp: 0.0 ± 0.0
1.058HisTyr: 1.058 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
3.175IleAla: 3.175 ± 1.535
1.058IleCys: 1.058 ± 0.13
5.291IleAsp: 5.291 ± 1.917
2.469IleGlu: 2.469 ± 0.552
2.822IlePhe: 2.822 ± 0.081
2.822IleGly: 2.822 ± 0.081
1.764IleHis: 1.764 ± 0.211
6.702IleIle: 6.702 ± 1.315
8.113IleLys: 8.113 ± 3.923
4.938IleLeu: 4.938 ± 0.463
1.058IleMet: 1.058 ± 0.512
4.233IleAsn: 4.233 ± 1.405
4.233IlePro: 4.233 ± 1.162
2.116IleGln: 2.116 ± 0.26
4.233IleArg: 4.233 ± 1.405
8.113IleSer: 8.113 ± 1.853
4.586IleThr: 4.586 ± 0.934
4.938IleVal: 4.938 ± 1.746
1.411IleTrp: 1.411 ± 0.601
3.88IleTyr: 3.88 ± 2.616
0.0IleXaa: 0.0 ± 0.0
Lys
2.822LysAla: 2.822 ± 0.561
0.705LysCys: 0.705 ± 0.301
4.938LysAsp: 4.938 ± 1.104
3.88LysGlu: 3.88 ± 0.049
3.88LysPhe: 3.88 ± 0.593
1.058LysGly: 1.058 ± 0.512
1.058LysHis: 1.058 ± 0.13
4.938LysIle: 4.938 ± 1.746
4.938LysLys: 4.938 ± 1.746
5.996LysLeu: 5.996 ± 1.593
1.411LysMet: 1.411 ± 0.682
3.175LysAsn: 3.175 ± 0.893
2.822LysPro: 2.822 ± 0.723
1.764LysGln: 1.764 ± 0.211
3.175LysArg: 3.175 ± 1.535
3.175LysSer: 3.175 ± 1.535
3.88LysThr: 3.88 ± 1.876
3.527LysVal: 3.527 ± 0.22
0.0LysTrp: 0.0 ± 0.0
3.527LysTyr: 3.527 ± 1.064
0.0LysXaa: 0.0 ± 0.0
Leu
7.055LeuAla: 7.055 ± 3.006
1.058LeuCys: 1.058 ± 0.512
4.938LeuAsp: 4.938 ± 0.463
3.527LeuGlu: 3.527 ± 1.064
2.116LeuPhe: 2.116 ± 0.26
3.527LeuGly: 3.527 ± 1.064
1.764LeuHis: 1.764 ± 0.431
5.996LeuIle: 5.996 ± 0.309
5.644LeuLys: 5.644 ± 0.162
7.76LeuLeu: 7.76 ± 1.185
1.411LeuMet: 1.411 ± 0.041
7.055LeuAsn: 7.055 ± 0.439
3.88LeuPro: 3.88 ± 0.593
1.764LeuGln: 1.764 ± 1.714
3.527LeuArg: 3.527 ± 0.422
6.702LeuSer: 6.702 ± 0.61
3.88LeuThr: 3.88 ± 0.049
5.291LeuVal: 5.291 ± 0.008
0.0LeuTrp: 0.0 ± 0.0
2.469LeuTyr: 2.469 ± 0.09
0.0LeuXaa: 0.0 ± 0.0
Met
1.411MetAla: 1.411 ± 0.682
0.353MetCys: 0.353 ± 0.171
0.705MetAsp: 0.705 ± 0.341
0.705MetGlu: 0.705 ± 0.341
0.353MetPhe: 0.353 ± 0.171
1.411MetGly: 1.411 ± 0.041
0.0MetHis: 0.0 ± 0.0
1.411MetIle: 1.411 ± 0.041
3.175MetLys: 3.175 ± 1.535
0.705MetLeu: 0.705 ± 0.301
0.705MetMet: 0.705 ± 0.341
1.764MetAsn: 1.764 ± 0.431
0.353MetPro: 0.353 ± 0.171
1.058MetGln: 1.058 ± 0.13
2.469MetArg: 2.469 ± 0.552
3.175MetSer: 3.175 ± 0.252
0.705MetThr: 0.705 ± 0.301
1.058MetVal: 1.058 ± 0.512
0.353MetTrp: 0.353 ± 0.171
0.705MetTyr: 0.705 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
2.469AsnAla: 2.469 ± 0.552
0.353AsnCys: 0.353 ± 0.171
3.88AsnAsp: 3.88 ± 0.049
3.175AsnGlu: 3.175 ± 0.252
2.116AsnPhe: 2.116 ± 0.382
3.527AsnGly: 3.527 ± 1.503
0.353AsnHis: 0.353 ± 0.171
5.291AsnIle: 5.291 ± 0.633
2.116AsnLys: 2.116 ± 0.902
4.938AsnLeu: 4.938 ± 1.104
3.88AsnMet: 3.88 ± 0.691
5.996AsnAsn: 5.996 ± 0.951
5.996AsnPro: 5.996 ± 0.309
2.116AsnGln: 2.116 ± 0.382
2.822AsnArg: 2.822 ± 0.723
4.938AsnSer: 4.938 ± 1.462
2.469AsnThr: 2.469 ± 2.015
3.88AsnVal: 3.88 ± 1.974
1.058AsnTrp: 1.058 ± 0.512
1.411AsnTyr: 1.411 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
1.058ProAla: 1.058 ± 0.512
0.353ProCys: 0.353 ± 0.171
3.88ProAsp: 3.88 ± 2.616
2.822ProGlu: 2.822 ± 0.081
3.527ProPhe: 3.527 ± 0.861
2.469ProGly: 2.469 ± 0.552
1.764ProHis: 1.764 ± 0.431
3.527ProIle: 3.527 ± 0.22
3.175ProLys: 3.175 ± 0.252
5.291ProLeu: 5.291 ± 3.217
1.411ProMet: 1.411 ± 0.307
2.822ProAsn: 2.822 ± 0.081
1.411ProPro: 1.411 ± 0.041
1.058ProGln: 1.058 ± 0.772
2.469ProArg: 2.469 ± 0.552
2.116ProSer: 2.116 ± 0.382
3.88ProThr: 3.88 ± 1.332
2.469ProVal: 2.469 ± 0.731
1.058ProTrp: 1.058 ± 0.13
4.938ProTyr: 4.938 ± 0.821
0.0ProXaa: 0.0 ± 0.0
Gln
2.116GlnAla: 2.116 ± 0.26
0.0GlnCys: 0.0 ± 0.0
1.411GlnAsp: 1.411 ± 0.682
2.469GlnGlu: 2.469 ± 0.09
2.116GlnPhe: 2.116 ± 0.902
0.705GlnGly: 0.705 ± 0.301
0.0GlnHis: 0.0 ± 0.0
3.175GlnIle: 3.175 ± 0.252
1.411GlnLys: 1.411 ± 0.041
2.469GlnLeu: 2.469 ± 0.731
1.411GlnMet: 1.411 ± 0.682
2.469GlnAsn: 2.469 ± 0.09
2.822GlnPro: 2.822 ± 1.202
1.411GlnGln: 1.411 ± 0.041
2.116GlnArg: 2.116 ± 0.26
3.175GlnSer: 3.175 ± 0.252
1.411GlnThr: 1.411 ± 0.601
1.411GlnVal: 1.411 ± 0.041
0.0GlnTrp: 0.0 ± 0.0
1.764GlnTyr: 1.764 ± 0.853
0.0GlnXaa: 0.0 ± 0.0
Arg
1.764ArgAla: 1.764 ± 0.211
0.705ArgCys: 0.705 ± 0.341
4.233ArgAsp: 4.233 ± 0.52
1.764ArgGlu: 1.764 ± 0.853
2.116ArgPhe: 2.116 ± 0.26
1.764ArgGly: 1.764 ± 0.211
0.705ArgHis: 0.705 ± 0.341
5.291ArgIle: 5.291 ± 0.633
2.469ArgLys: 2.469 ± 1.194
4.233ArgLeu: 4.233 ± 0.122
1.764ArgMet: 1.764 ± 0.211
3.175ArgAsn: 3.175 ± 0.893
2.116ArgPro: 2.116 ± 0.382
1.764ArgGln: 1.764 ± 0.211
0.705ArgArg: 0.705 ± 0.341
2.822ArgSer: 2.822 ± 0.723
2.469ArgThr: 2.469 ± 0.731
3.527ArgVal: 3.527 ± 0.422
0.0ArgTrp: 0.0 ± 0.0
1.058ArgTyr: 1.058 ± 0.512
0.0ArgXaa: 0.0 ± 0.0
Ser
7.055SerAla: 7.055 ± 1.081
0.705SerCys: 0.705 ± 0.341
3.88SerAsp: 3.88 ± 1.332
5.291SerGlu: 5.291 ± 0.008
2.469SerPhe: 2.469 ± 0.731
3.175SerGly: 3.175 ± 0.252
2.116SerHis: 2.116 ± 0.382
5.644SerIle: 5.644 ± 0.162
2.469SerLys: 2.469 ± 0.09
7.76SerLeu: 7.76 ± 1.185
0.705SerMet: 0.705 ± 0.341
5.644SerAsn: 5.644 ± 1.763
3.175SerPro: 3.175 ± 0.893
3.88SerGln: 3.88 ± 0.593
3.175SerArg: 3.175 ± 1.032
7.055SerSer: 7.055 ± 4.289
4.586SerThr: 4.586 ± 0.991
3.88SerVal: 3.88 ± 1.974
1.411SerTrp: 1.411 ± 0.041
8.818SerTyr: 8.818 ± 0.87
0.0SerXaa: 0.0 ± 0.0
Thr
2.822ThrAla: 2.822 ± 0.081
0.705ThrCys: 0.705 ± 0.301
2.116ThrAsp: 2.116 ± 0.902
4.938ThrGlu: 4.938 ± 0.179
4.233ThrPhe: 4.233 ± 0.52
3.175ThrGly: 3.175 ± 1.032
1.058ThrHis: 1.058 ± 0.512
3.527ThrIle: 3.527 ± 0.861
4.233ThrLys: 4.233 ± 0.122
3.88ThrLeu: 3.88 ± 1.234
0.353ThrMet: 0.353 ± 0.171
3.88ThrAsn: 3.88 ± 1.974
4.586ThrPro: 4.586 ± 0.991
1.764ThrGln: 1.764 ± 0.853
1.058ThrArg: 1.058 ± 0.512
4.233ThrSer: 4.233 ± 1.804
4.233ThrThr: 4.233 ± 0.122
4.233ThrVal: 4.233 ± 0.52
0.353ThrTrp: 0.353 ± 0.171
2.469ThrTyr: 2.469 ± 0.09
0.0ThrXaa: 0.0 ± 0.0
Val
3.527ValAla: 3.527 ± 0.861
0.705ValCys: 0.705 ± 0.341
3.175ValAsp: 3.175 ± 0.252
3.88ValGlu: 3.88 ± 0.691
2.469ValPhe: 2.469 ± 0.552
2.469ValGly: 2.469 ± 0.552
0.0ValHis: 0.0 ± 0.0
4.233ValIle: 4.233 ± 1.405
2.469ValLys: 2.469 ± 0.09
3.175ValLeu: 3.175 ± 0.893
0.705ValMet: 0.705 ± 0.341
4.233ValAsn: 4.233 ± 0.52
4.586ValPro: 4.586 ± 0.991
3.88ValGln: 3.88 ± 1.332
6.349ValArg: 6.349 ± 2.428
7.407ValSer: 7.407 ± 0.91
4.233ValThr: 4.233 ± 0.763
3.175ValVal: 3.175 ± 0.39
0.0ValTrp: 0.0 ± 0.0
3.175ValTyr: 3.175 ± 1.674
0.0ValXaa: 0.0 ± 0.0
Trp
1.058TrpAla: 1.058 ± 0.13
0.353TrpCys: 0.353 ± 0.171
0.705TrpAsp: 0.705 ± 0.341
0.0TrpGlu: 0.0 ± 0.0
1.058TrpPhe: 1.058 ± 0.13
0.353TrpGly: 0.353 ± 0.171
0.353TrpHis: 0.353 ± 0.171
1.764TrpIle: 1.764 ± 0.211
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.353TrpAsn: 0.353 ± 0.471
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.705TrpArg: 0.705 ± 0.301
0.705TrpSer: 0.705 ± 0.341
0.705TrpThr: 0.705 ± 0.301
1.058TrpVal: 1.058 ± 0.512
0.0TrpTrp: 0.0 ± 0.0
0.353TrpTyr: 0.353 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.175TyrAla: 3.175 ± 0.252
1.411TyrCys: 1.411 ± 0.041
3.527TyrAsp: 3.527 ± 0.861
4.233TyrGlu: 4.233 ± 2.445
4.586TyrPhe: 4.586 ± 0.292
3.527TyrGly: 3.527 ± 1.503
0.0TyrHis: 0.0 ± 0.0
2.469TyrIle: 2.469 ± 1.194
2.822TyrLys: 2.822 ± 0.081
5.996TyrLeu: 5.996 ± 0.951
1.058TyrMet: 1.058 ± 0.512
1.764TyrAsn: 1.764 ± 0.853
2.822TyrPro: 2.822 ± 1.202
1.411TyrGln: 1.411 ± 0.682
2.822TyrArg: 2.822 ± 0.081
2.469TyrSer: 2.469 ± 1.194
3.88TyrThr: 3.88 ± 1.332
2.822TyrVal: 2.822 ± 0.561
0.353TyrTrp: 0.353 ± 0.171
3.175TyrTyr: 3.175 ± 1.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2836 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski