Amino acid dipepetide frequency for Sanxia picorna-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.175AlaAla: 3.175 ± 1.677
2.116AlaCys: 2.116 ± 1.084
2.116AlaAsp: 2.116 ± 0.237
4.233AlaGlu: 4.233 ± 1.135
3.88AlaPhe: 3.88 ± 0.006
2.116AlaGly: 2.116 ± 0.237
2.116AlaHis: 2.116 ± 1.084
2.116AlaIle: 2.116 ± 0.898
3.175AlaLys: 3.175 ± 0.966
4.233AlaLeu: 4.233 ± 0.474
1.058AlaMet: 1.058 ± 0.119
2.469AlaAsn: 2.469 ± 0.717
3.175AlaPro: 3.175 ± 0.356
2.116AlaGln: 2.116 ± 0.237
4.938AlaArg: 4.938 ± 2.095
8.466AlaSer: 8.466 ± 1.609
5.291AlaThr: 5.291 ± 0.593
4.586AlaVal: 4.586 ± 0.293
1.058AlaTrp: 1.058 ± 0.119
2.469AlaTyr: 2.469 ± 0.717
0.0AlaXaa: 0.0 ± 0.0
Cys
1.764CysAla: 1.764 ± 0.243
0.353CysCys: 0.353 ± 0.181
1.058CysAsp: 1.058 ± 0.542
2.116CysGlu: 2.116 ± 1.084
0.705CysPhe: 0.705 ± 0.361
1.411CysGly: 1.411 ± 0.723
0.353CysHis: 0.353 ± 0.181
2.469CysIle: 2.469 ± 0.056
0.705CysLys: 0.705 ± 0.299
1.411CysLeu: 1.411 ± 0.723
1.411CysMet: 1.411 ± 0.062
1.058CysAsn: 1.058 ± 0.542
0.705CysPro: 0.705 ± 0.361
0.0CysGln: 0.0 ± 0.0
0.705CysArg: 0.705 ± 0.361
0.353CysSer: 0.353 ± 0.181
0.353CysThr: 0.353 ± 0.181
1.764CysVal: 1.764 ± 0.243
0.705CysTrp: 0.705 ± 0.361
1.058CysTyr: 1.058 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.124
1.411AspCys: 1.411 ± 0.723
2.822AspAsp: 2.822 ± 0.124
3.175AspGlu: 3.175 ± 1.016
1.764AspPhe: 1.764 ± 1.739
2.469AspGly: 2.469 ± 0.056
0.353AspHis: 0.353 ± 0.181
5.996AspIle: 5.996 ± 0.892
1.411AspLys: 1.411 ± 0.723
4.586AspLeu: 4.586 ± 0.293
2.469AspMet: 2.469 ± 1.265
0.705AspAsn: 0.705 ± 0.299
2.469AspPro: 2.469 ± 0.056
1.764AspGln: 1.764 ± 0.243
2.822AspArg: 2.822 ± 0.785
3.88AspSer: 3.88 ± 0.006
4.586AspThr: 4.586 ± 0.367
4.938AspVal: 4.938 ± 1.434
0.0AspTrp: 0.0 ± 0.0
1.764AspTyr: 1.764 ± 0.904
0.0AspXaa: 0.0 ± 0.0
Glu
2.116GluAla: 2.116 ± 0.424
2.116GluCys: 2.116 ± 1.084
1.764GluAsp: 1.764 ± 0.904
3.527GluGlu: 3.527 ± 1.147
1.411GluPhe: 1.411 ± 1.259
1.058GluGly: 1.058 ± 0.119
0.705GluHis: 0.705 ± 0.361
4.233GluIle: 4.233 ± 1.508
4.938GluLys: 4.938 ± 1.87
4.586GluLeu: 4.586 ± 1.028
2.469GluMet: 2.469 ± 0.056
3.175GluAsn: 3.175 ± 0.305
3.527GluPro: 3.527 ± 0.836
4.586GluGln: 4.586 ± 0.367
2.469GluArg: 2.469 ± 0.604
3.88GluSer: 3.88 ± 0.006
4.233GluThr: 4.233 ± 0.847
4.586GluVal: 4.586 ± 0.954
1.411GluTrp: 1.411 ± 0.062
3.527GluTyr: 3.527 ± 1.147
0.0GluXaa: 0.0 ± 0.0
Phe
3.88PheAla: 3.88 ± 0.006
1.058PheCys: 1.058 ± 0.119
4.233PheAsp: 4.233 ± 0.187
2.469PheGlu: 2.469 ± 0.056
1.058PhePhe: 1.058 ± 0.119
3.175PheGly: 3.175 ± 1.016
1.764PheHis: 1.764 ± 1.078
1.411PheIle: 1.411 ± 0.062
1.764PheLys: 1.764 ± 0.243
5.291PheLeu: 5.291 ± 0.068
1.058PheMet: 1.058 ± 0.779
2.822PheAsn: 2.822 ± 1.858
1.058PhePro: 1.058 ± 1.44
2.469PheGln: 2.469 ± 0.604
1.764PheArg: 1.764 ± 0.243
1.058PheSer: 1.058 ± 0.542
3.527PheThr: 3.527 ± 0.175
0.705PheVal: 0.705 ± 0.361
1.058PheTrp: 1.058 ± 0.119
1.058PheTyr: 1.058 ± 0.542
0.0PheXaa: 0.0 ± 0.0
Gly
3.88GlyAla: 3.88 ± 1.316
0.0GlyCys: 0.0 ± 0.0
2.116GlyAsp: 2.116 ± 0.237
4.233GlyGlu: 4.233 ± 0.187
2.116GlyPhe: 2.116 ± 0.424
3.527GlyGly: 3.527 ± 0.486
0.353GlyHis: 0.353 ± 0.181
4.233GlyIle: 4.233 ± 1.508
4.233GlyLys: 4.233 ± 0.187
5.644GlyLeu: 5.644 ± 0.412
1.411GlyMet: 1.411 ± 0.201
4.233GlyAsn: 4.233 ± 0.474
1.764GlyPro: 1.764 ± 0.904
1.411GlyGln: 1.411 ± 0.723
1.058GlyArg: 1.058 ± 0.542
5.644GlySer: 5.644 ± 1.073
2.116GlyThr: 2.116 ± 1.558
4.938GlyVal: 4.938 ± 0.773
1.764GlyTrp: 1.764 ± 1.078
2.116GlyTyr: 2.116 ± 1.558
0.0GlyXaa: 0.0 ± 0.0
His
2.116HisAla: 2.116 ± 0.424
1.058HisCys: 1.058 ± 0.542
1.058HisAsp: 1.058 ± 0.542
1.411HisGlu: 1.411 ± 0.723
1.411HisPhe: 1.411 ± 0.598
0.353HisGly: 0.353 ± 0.181
0.0HisHis: 0.0 ± 0.0
2.116HisIle: 2.116 ± 0.237
0.353HisLys: 0.353 ± 0.181
0.705HisLeu: 0.705 ± 0.299
0.353HisMet: 0.353 ± 0.181
0.353HisAsn: 0.353 ± 0.181
0.705HisPro: 0.705 ± 0.299
1.058HisGln: 1.058 ± 0.542
2.822HisArg: 2.822 ± 0.536
0.705HisSer: 0.705 ± 0.299
1.058HisThr: 1.058 ± 0.119
2.116HisVal: 2.116 ± 0.424
0.0HisTrp: 0.0 ± 0.0
0.705HisTyr: 0.705 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
5.996IleAla: 5.996 ± 0.43
1.058IleCys: 1.058 ± 0.542
2.822IleAsp: 2.822 ± 0.124
3.175IleGlu: 3.175 ± 0.966
0.705IlePhe: 0.705 ± 0.361
3.527IleGly: 3.527 ± 0.175
0.705IleHis: 0.705 ± 0.96
3.527IleIle: 3.527 ± 0.175
2.469IleLys: 2.469 ± 0.056
4.586IleLeu: 4.586 ± 0.367
1.764IleMet: 1.764 ± 0.243
3.175IleAsn: 3.175 ± 0.356
5.644IlePro: 5.644 ± 1.57
1.058IleGln: 1.058 ± 0.542
2.469IleArg: 2.469 ± 0.056
6.349IleSer: 6.349 ± 2.033
6.349IleThr: 6.349 ± 2.033
2.116IleVal: 2.116 ± 0.424
1.058IleTrp: 1.058 ± 0.542
2.469IleTyr: 2.469 ± 0.604
0.0IleXaa: 0.0 ± 0.0
Lys
2.822LysAla: 2.822 ± 0.124
0.353LysCys: 0.353 ± 0.181
4.938LysAsp: 4.938 ± 2.53
3.88LysGlu: 3.88 ± 1.988
2.822LysPhe: 2.822 ± 0.785
2.116LysGly: 2.116 ± 1.084
1.764LysHis: 1.764 ± 0.904
4.233LysIle: 4.233 ± 0.847
1.764LysLys: 1.764 ± 0.904
5.644LysLeu: 5.644 ± 0.412
1.058LysMet: 1.058 ± 0.542
2.822LysAsn: 2.822 ± 0.124
1.764LysPro: 1.764 ± 0.904
1.764LysGln: 1.764 ± 0.904
1.411LysArg: 1.411 ± 0.598
3.175LysSer: 3.175 ± 1.627
4.233LysThr: 4.233 ± 1.508
2.116LysVal: 2.116 ± 0.237
0.705LysTrp: 0.705 ± 0.361
2.469LysTyr: 2.469 ± 0.604
0.0LysXaa: 0.0 ± 0.0
Leu
8.466LeuAla: 8.466 ± 3.591
0.705LeuCys: 0.705 ± 0.361
3.88LeuAsp: 3.88 ± 0.655
7.055LeuGlu: 7.055 ± 2.293
3.88LeuPhe: 3.88 ± 0.667
2.469LeuGly: 2.469 ± 0.604
1.764LeuHis: 1.764 ± 1.078
2.822LeuIle: 2.822 ± 0.785
4.938LeuLys: 4.938 ± 1.209
6.702LeuLeu: 6.702 ± 1.452
2.822LeuMet: 2.822 ± 1.197
2.116LeuAsn: 2.116 ± 0.237
4.938LeuPro: 4.938 ± 0.548
3.527LeuGln: 3.527 ± 2.157
5.291LeuArg: 5.291 ± 0.593
5.644LeuSer: 5.644 ± 0.91
8.113LeuThr: 8.113 ± 0.468
4.233LeuVal: 4.233 ± 0.847
1.411LeuTrp: 1.411 ± 0.723
3.527LeuTyr: 3.527 ± 0.175
0.0LeuXaa: 0.0 ± 0.0
Met
2.469MetAla: 2.469 ± 0.717
0.705MetCys: 0.705 ± 0.361
1.411MetAsp: 1.411 ± 0.723
1.764MetGlu: 1.764 ± 0.904
0.705MetPhe: 0.705 ± 0.299
1.411MetGly: 1.411 ± 0.062
0.705MetHis: 0.705 ± 0.361
3.175MetIle: 3.175 ± 0.305
2.822MetLys: 2.822 ± 0.785
1.411MetLeu: 1.411 ± 0.598
1.058MetMet: 1.058 ± 0.542
0.705MetAsn: 0.705 ± 0.361
0.705MetPro: 0.705 ± 0.361
0.705MetGln: 0.705 ± 0.299
1.764MetArg: 1.764 ± 0.904
1.764MetSer: 1.764 ± 0.904
2.822MetThr: 2.822 ± 1.197
3.175MetVal: 3.175 ± 0.356
0.0MetTrp: 0.0 ± 0.0
1.764MetTyr: 1.764 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
2.469AsnAla: 2.469 ± 0.717
0.705AsnCys: 0.705 ± 0.361
2.469AsnAsp: 2.469 ± 0.056
1.411AsnGlu: 1.411 ± 0.598
3.175AsnPhe: 3.175 ± 0.356
3.527AsnGly: 3.527 ± 0.175
0.353AsnHis: 0.353 ± 0.181
3.175AsnIle: 3.175 ± 0.305
2.469AsnLys: 2.469 ± 0.604
4.233AsnLeu: 4.233 ± 0.474
3.527AsnMet: 3.527 ± 0.139
3.175AsnAsn: 3.175 ± 0.305
2.116AsnPro: 2.116 ± 0.237
1.058AsnGln: 1.058 ± 0.119
1.411AsnArg: 1.411 ± 1.259
2.822AsnSer: 2.822 ± 1.446
1.764AsnThr: 1.764 ± 0.418
3.175AsnVal: 3.175 ± 1.016
0.705AsnTrp: 0.705 ± 0.96
2.116AsnTyr: 2.116 ± 0.898
0.0AsnXaa: 0.0 ± 0.0
Pro
1.764ProAla: 1.764 ± 0.243
0.705ProCys: 0.705 ± 0.361
2.116ProAsp: 2.116 ± 1.558
2.822ProGlu: 2.822 ± 0.785
3.88ProPhe: 3.88 ± 1.976
1.764ProGly: 1.764 ± 1.078
0.705ProHis: 0.705 ± 0.299
2.116ProIle: 2.116 ± 1.558
1.764ProLys: 1.764 ± 0.243
4.233ProLeu: 4.233 ± 1.135
1.058ProMet: 1.058 ± 0.542
1.411ProAsn: 1.411 ± 0.062
1.411ProPro: 1.411 ± 0.062
0.353ProGln: 0.353 ± 0.48
2.116ProArg: 2.116 ± 0.898
2.822ProSer: 2.822 ± 0.536
6.349ProThr: 6.349 ± 1.271
3.527ProVal: 3.527 ± 0.486
2.822ProTrp: 2.822 ± 0.785
2.469ProTyr: 2.469 ± 0.717
0.0ProXaa: 0.0 ± 0.0
Gln
2.116GlnAla: 2.116 ± 0.424
0.705GlnCys: 0.705 ± 0.299
1.058GlnAsp: 1.058 ± 0.119
2.469GlnGlu: 2.469 ± 0.604
2.116GlnPhe: 2.116 ± 0.424
1.411GlnGly: 1.411 ± 0.062
0.353GlnHis: 0.353 ± 0.181
1.764GlnIle: 1.764 ± 1.078
1.411GlnLys: 1.411 ± 0.723
1.764GlnLeu: 1.764 ± 0.418
1.058GlnMet: 1.058 ± 0.779
1.411GlnAsn: 1.411 ± 0.062
1.058GlnPro: 1.058 ± 0.779
1.058GlnGln: 1.058 ± 0.119
2.116GlnArg: 2.116 ± 0.237
4.586GlnSer: 4.586 ± 0.367
3.88GlnThr: 3.88 ± 0.655
1.411GlnVal: 1.411 ± 0.598
0.705GlnTrp: 0.705 ± 0.361
2.116GlnTyr: 2.116 ± 0.424
0.0GlnXaa: 0.0 ± 0.0
Arg
2.116ArgAla: 2.116 ± 0.424
1.058ArgCys: 1.058 ± 0.542
1.058ArgAsp: 1.058 ± 0.119
1.764ArgGlu: 1.764 ± 0.243
3.175ArgPhe: 3.175 ± 0.356
3.175ArgGly: 3.175 ± 1.677
1.764ArgHis: 1.764 ± 0.243
3.88ArgIle: 3.88 ± 1.316
3.175ArgLys: 3.175 ± 1.627
4.586ArgLeu: 4.586 ± 0.367
2.116ArgMet: 2.116 ± 1.084
3.527ArgAsn: 3.527 ± 0.836
3.88ArgPro: 3.88 ± 1.976
2.116ArgGln: 2.116 ± 0.898
2.822ArgArg: 2.822 ± 0.785
2.469ArgSer: 2.469 ± 0.604
2.116ArgThr: 2.116 ± 0.237
3.88ArgVal: 3.88 ± 0.667
1.411ArgTrp: 1.411 ± 0.062
1.764ArgTyr: 1.764 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
6.349SerAla: 6.349 ± 0.711
1.058SerCys: 1.058 ± 0.542
5.291SerAsp: 5.291 ± 0.729
2.469SerGlu: 2.469 ± 0.604
3.175SerPhe: 3.175 ± 0.966
5.644SerGly: 5.644 ± 0.412
1.764SerHis: 1.764 ± 0.418
5.291SerIle: 5.291 ± 0.729
2.822SerLys: 2.822 ± 0.124
5.644SerLeu: 5.644 ± 0.412
2.116SerMet: 2.116 ± 1.084
4.586SerAsn: 4.586 ± 1.028
2.116SerPro: 2.116 ± 0.237
3.88SerGln: 3.88 ± 0.655
4.233SerArg: 4.233 ± 0.474
4.938SerSer: 4.938 ± 0.548
3.88SerThr: 3.88 ± 1.316
6.702SerVal: 6.702 ± 0.53
0.705SerTrp: 0.705 ± 0.361
1.764SerTyr: 1.764 ± 0.904
0.0SerXaa: 0.0 ± 0.0
Thr
4.233ThrAla: 4.233 ± 0.187
1.411ThrCys: 1.411 ± 0.062
4.233ThrAsp: 4.233 ± 1.795
5.996ThrGlu: 5.996 ± 0.892
2.116ThrPhe: 2.116 ± 0.237
7.055ThrGly: 7.055 ± 0.311
2.116ThrHis: 2.116 ± 0.424
3.527ThrIle: 3.527 ± 0.175
2.822ThrLys: 2.822 ± 1.446
5.996ThrLeu: 5.996 ± 0.231
1.411ThrMet: 1.411 ± 0.062
5.291ThrAsn: 5.291 ± 1.914
4.938ThrPro: 4.938 ± 2.755
2.116ThrGln: 2.116 ± 0.237
4.586ThrArg: 4.586 ± 1.028
5.291ThrSer: 5.291 ± 0.593
5.644ThrThr: 5.644 ± 1.57
4.586ThrVal: 4.586 ± 0.293
0.705ThrTrp: 0.705 ± 0.96
3.88ThrTyr: 3.88 ± 0.655
0.0ThrXaa: 0.0 ± 0.0
Val
3.175ValAla: 3.175 ± 1.677
2.469ValCys: 2.469 ± 0.056
4.938ValAsp: 4.938 ± 1.434
2.469ValGlu: 2.469 ± 0.056
2.116ValPhe: 2.116 ± 0.237
4.586ValGly: 4.586 ± 2.275
1.411ValHis: 1.411 ± 0.723
2.822ValIle: 2.822 ± 0.785
5.291ValLys: 5.291 ± 2.05
6.349ValLeu: 6.349 ± 1.271
1.058ValMet: 1.058 ± 0.542
1.411ValAsn: 1.411 ± 0.598
2.469ValPro: 2.469 ± 0.717
1.411ValGln: 1.411 ± 0.598
4.233ValArg: 4.233 ± 0.474
5.291ValSer: 5.291 ± 0.729
6.702ValThr: 6.702 ± 1.852
2.822ValVal: 2.822 ± 0.536
0.353ValTrp: 0.353 ± 0.181
2.822ValTyr: 2.822 ± 0.785
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.361
0.0TrpCys: 0.0 ± 0.0
1.411TrpAsp: 1.411 ± 0.062
1.411TrpGlu: 1.411 ± 0.723
1.058TrpPhe: 1.058 ± 0.119
1.411TrpGly: 1.411 ± 0.062
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.116TrpLys: 2.116 ± 0.424
1.411TrpLeu: 1.411 ± 0.062
0.353TrpMet: 0.353 ± 0.48
0.705TrpAsn: 0.705 ± 0.299
0.705TrpPro: 0.705 ± 0.299
0.705TrpGln: 0.705 ± 0.299
1.411TrpArg: 1.411 ± 0.062
1.764TrpSer: 1.764 ± 0.904
1.058TrpThr: 1.058 ± 0.779
0.353TrpVal: 0.353 ± 0.181
0.705TrpTrp: 0.705 ± 0.361
0.353TrpTyr: 0.353 ± 0.48
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.469TyrAla: 2.469 ± 1.378
1.411TyrCys: 1.411 ± 0.598
1.411TyrAsp: 1.411 ± 0.598
2.469TyrGlu: 2.469 ± 0.056
1.411TyrPhe: 1.411 ± 0.598
4.938TyrGly: 4.938 ± 0.548
1.411TyrHis: 1.411 ± 0.062
2.116TyrIle: 2.116 ± 0.424
1.411TyrLys: 1.411 ± 0.723
5.291TyrLeu: 5.291 ± 0.068
1.058TyrMet: 1.058 ± 0.542
0.705TyrAsn: 0.705 ± 0.361
1.058TyrPro: 1.058 ± 0.119
1.411TyrGln: 1.411 ± 0.723
1.411TyrArg: 1.411 ± 0.723
3.527TyrSer: 3.527 ± 0.175
3.88TyrThr: 3.88 ± 0.006
2.469TyrVal: 2.469 ± 0.056
0.353TyrTrp: 0.353 ± 0.48
1.058TyrTyr: 1.058 ± 0.779
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2836 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski