Amino acid dipepetide frequency for Rodent arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.548AlaAla: 11.548 ± 1.606
3.132AlaCys: 3.132 ± 1.044
3.915AlaAsp: 3.915 ± 1.086
4.306AlaGlu: 4.306 ± 0.645
4.893AlaPhe: 4.893 ± 1.177
6.068AlaGly: 6.068 ± 1.373
1.957AlaHis: 1.957 ± 0.517
5.089AlaIle: 5.089 ± 1.662
3.327AlaLys: 3.327 ± 1.646
7.242AlaLeu: 7.242 ± 0.984
1.957AlaMet: 1.957 ± 0.76
2.349AlaAsn: 2.349 ± 0.865
7.829AlaPro: 7.829 ± 1.108
2.936AlaGln: 2.936 ± 1.452
3.132AlaArg: 3.132 ± 0.809
8.612AlaSer: 8.612 ± 0.847
3.523AlaThr: 3.523 ± 0.66
7.046AlaVal: 7.046 ± 0.815
0.587AlaTrp: 0.587 ± 0.412
2.545AlaTyr: 2.545 ± 0.993
0.0AlaXaa: 0.0 ± 0.0
Cys
2.349CysAla: 2.349 ± 1.41
1.174CysCys: 1.174 ± 0.482
2.349CysAsp: 2.349 ± 0.617
0.783CysGlu: 0.783 ± 0.322
1.174CysPhe: 1.174 ± 0.668
1.762CysGly: 1.762 ± 0.871
0.783CysHis: 0.783 ± 0.387
0.979CysIle: 0.979 ± 0.519
1.37CysLys: 1.37 ± 0.8
3.915CysLeu: 3.915 ± 0.866
0.783CysMet: 0.783 ± 0.952
0.783CysAsn: 0.783 ± 0.387
1.174CysPro: 1.174 ± 0.581
0.979CysGln: 0.979 ± 0.484
2.153CysArg: 2.153 ± 0.678
2.349CysSer: 2.349 ± 0.826
1.957CysThr: 1.957 ± 0.634
1.957CysVal: 1.957 ± 0.449
1.566CysTrp: 1.566 ± 0.576
1.174CysTyr: 1.174 ± 0.379
0.0CysXaa: 0.0 ± 0.0
Asp
2.936AspAla: 2.936 ± 1.118
0.587AspCys: 0.587 ± 0.29
2.545AspAsp: 2.545 ± 0.692
1.762AspGlu: 1.762 ± 0.527
1.174AspPhe: 1.174 ± 0.581
3.915AspGly: 3.915 ± 1.59
0.391AspHis: 0.391 ± 0.194
2.545AspIle: 2.545 ± 0.993
2.153AspLys: 2.153 ± 0.88
5.285AspLeu: 5.285 ± 1.145
1.174AspMet: 1.174 ± 0.584
0.391AspAsn: 0.391 ± 0.903
3.327AspPro: 3.327 ± 1.646
1.566AspGln: 1.566 ± 0.657
3.132AspArg: 3.132 ± 0.747
2.936AspSer: 2.936 ± 0.585
1.762AspThr: 1.762 ± 0.588
3.719AspVal: 3.719 ± 0.864
1.566AspTrp: 1.566 ± 0.775
1.174AspTyr: 1.174 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
3.327GluAla: 3.327 ± 0.95
1.957GluCys: 1.957 ± 0.725
1.957GluAsp: 1.957 ± 0.57
2.153GluGlu: 2.153 ± 0.622
1.566GluPhe: 1.566 ± 0.525
2.545GluGly: 2.545 ± 0.436
0.783GluHis: 0.783 ± 0.387
1.37GluIle: 1.37 ± 0.497
2.545GluLys: 2.545 ± 1.179
3.915GluLeu: 3.915 ± 0.805
0.587GluMet: 0.587 ± 0.421
0.979GluAsn: 0.979 ± 0.428
2.545GluPro: 2.545 ± 0.936
1.566GluGln: 1.566 ± 0.54
1.762GluArg: 1.762 ± 0.588
3.132GluSer: 3.132 ± 0.79
2.545GluThr: 2.545 ± 0.985
3.132GluVal: 3.132 ± 1.152
0.979GluTrp: 0.979 ± 0.623
1.37GluTyr: 1.37 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
5.285PheAla: 5.285 ± 1.69
1.174PheCys: 1.174 ± 0.636
1.37PheAsp: 1.37 ± 0.527
2.545PheGlu: 2.545 ± 0.744
3.132PhePhe: 3.132 ± 1.494
1.957PheGly: 1.957 ± 0.47
0.391PheHis: 0.391 ± 0.476
1.566PheIle: 1.566 ± 0.558
1.957PheLys: 1.957 ± 0.449
5.285PheLeu: 5.285 ± 2.363
0.587PheMet: 0.587 ± 0.421
1.174PheAsn: 1.174 ± 1.196
2.74PhePro: 2.74 ± 0.884
0.587PheGln: 0.587 ± 0.412
0.979PheArg: 0.979 ± 0.428
4.893PheSer: 4.893 ± 2.502
2.349PheThr: 2.349 ± 0.757
2.74PheVal: 2.74 ± 1.372
0.587PheTrp: 0.587 ± 0.29
1.762PheTyr: 1.762 ± 0.611
0.0PheXaa: 0.0 ± 0.0
Gly
4.306GlyAla: 4.306 ± 1.081
1.957GlyCys: 1.957 ± 0.968
4.893GlyAsp: 4.893 ± 1.282
1.762GlyGlu: 1.762 ± 0.466
3.132GlyPhe: 3.132 ± 0.926
5.089GlyGly: 5.089 ± 0.823
1.566GlyHis: 1.566 ± 0.444
2.545GlyIle: 2.545 ± 0.704
4.11GlyLys: 4.11 ± 1.238
5.285GlyLeu: 5.285 ± 2.27
0.783GlyMet: 0.783 ± 0.387
2.74GlyAsn: 2.74 ± 0.941
4.306GlyPro: 4.306 ± 0.786
2.545GlyGln: 2.545 ± 1.011
4.893GlyArg: 4.893 ± 1.683
6.263GlySer: 6.263 ± 0.96
3.915GlyThr: 3.915 ± 1.301
6.655GlyVal: 6.655 ± 1.347
0.783GlyTrp: 0.783 ± 0.421
2.349GlyTyr: 2.349 ± 1.164
0.0GlyXaa: 0.0 ± 0.0
His
1.957HisAla: 1.957 ± 0.552
0.979HisCys: 0.979 ± 0.403
0.587HisAsp: 0.587 ± 0.421
1.174HisGlu: 1.174 ± 0.581
1.957HisPhe: 1.957 ± 2.285
1.566HisGly: 1.566 ± 0.463
0.783HisHis: 0.783 ± 0.726
0.783HisIle: 0.783 ± 0.413
0.391HisLys: 0.391 ± 0.194
2.545HisLeu: 2.545 ± 0.936
0.979HisMet: 0.979 ± 0.428
1.174HisAsn: 1.174 ± 0.842
1.762HisPro: 1.762 ± 0.557
0.587HisGln: 0.587 ± 0.29
0.783HisArg: 0.783 ± 0.898
0.391HisSer: 0.391 ± 0.194
1.566HisThr: 1.566 ± 0.612
2.349HisVal: 2.349 ± 1.106
0.979HisTrp: 0.979 ± 0.484
0.196HisTyr: 0.196 ± 0.097
0.0HisXaa: 0.0 ± 0.0
Ile
4.11IleAla: 4.11 ± 1.568
0.979IleCys: 0.979 ± 0.428
2.545IleAsp: 2.545 ± 0.673
1.762IleGlu: 1.762 ± 0.62
2.153IlePhe: 2.153 ± 1.631
1.957IleGly: 1.957 ± 0.677
1.37IleHis: 1.37 ± 0.558
1.957IleIle: 1.957 ± 1.187
2.349IleLys: 2.349 ± 0.681
5.089IleLeu: 5.089 ± 2.015
0.391IleMet: 0.391 ± 0.433
1.762IleAsn: 1.762 ± 0.527
1.762IlePro: 1.762 ± 1.097
1.174IleGln: 1.174 ± 0.475
1.762IleArg: 1.762 ± 0.926
1.957IleSer: 1.957 ± 1.472
3.915IleThr: 3.915 ± 1.261
3.523IleVal: 3.523 ± 0.716
0.391IleTrp: 0.391 ± 0.194
1.762IleTyr: 1.762 ± 1.48
0.0IleXaa: 0.0 ± 0.0
Lys
3.327LysAla: 3.327 ± 1.091
1.174LysCys: 1.174 ± 0.581
1.174LysAsp: 1.174 ± 0.541
1.566LysGlu: 1.566 ± 0.468
1.762LysPhe: 1.762 ± 0.663
3.523LysGly: 3.523 ± 1.472
0.783LysHis: 0.783 ± 0.396
1.762LysIle: 1.762 ± 0.527
3.915LysLys: 3.915 ± 0.923
3.132LysLeu: 3.132 ± 1.549
1.37LysMet: 1.37 ± 1.15
2.936LysAsn: 2.936 ± 1.519
2.936LysPro: 2.936 ± 0.628
1.566LysGln: 1.566 ± 1.069
1.37LysArg: 1.37 ± 0.436
2.349LysSer: 2.349 ± 0.481
2.74LysThr: 2.74 ± 0.466
4.502LysVal: 4.502 ± 1.192
1.37LysTrp: 1.37 ± 0.436
2.153LysTyr: 2.153 ± 1.119
0.0LysXaa: 0.0 ± 0.0
Leu
10.765LeuAla: 10.765 ± 1.556
3.523LeuCys: 3.523 ± 1.034
5.089LeuAsp: 5.089 ± 1.045
4.502LeuGlu: 4.502 ± 1.096
4.502LeuPhe: 4.502 ± 2.311
6.459LeuGly: 6.459 ± 0.916
2.153LeuHis: 2.153 ± 0.492
4.306LeuIle: 4.306 ± 1.412
3.523LeuLys: 3.523 ± 1.307
9.395LeuLeu: 9.395 ± 2.932
1.762LeuMet: 1.762 ± 0.624
2.936LeuAsn: 2.936 ± 0.789
7.046LeuPro: 7.046 ± 2.12
2.936LeuGln: 2.936 ± 0.475
5.481LeuArg: 5.481 ± 1.778
8.025LeuSer: 8.025 ± 1.786
6.068LeuThr: 6.068 ± 1.184
7.829LeuVal: 7.829 ± 2.068
1.37LeuTrp: 1.37 ± 0.558
1.37LeuTyr: 1.37 ± 0.522
0.0LeuXaa: 0.0 ± 0.0
Met
2.153MetAla: 2.153 ± 0.97
0.587MetCys: 0.587 ± 0.957
0.979MetAsp: 0.979 ± 0.484
0.391MetGlu: 0.391 ± 0.194
0.783MetPhe: 0.783 ± 0.387
1.957MetGly: 1.957 ± 1.504
0.391MetHis: 0.391 ± 0.449
1.37MetIle: 1.37 ± 0.804
0.979MetLys: 0.979 ± 0.403
2.545MetLeu: 2.545 ± 0.83
0.979MetMet: 0.979 ± 0.484
0.587MetAsn: 0.587 ± 0.421
0.783MetPro: 0.783 ± 0.535
0.0MetGln: 0.0 ± 0.0
0.783MetArg: 0.783 ± 0.413
1.566MetSer: 1.566 ± 1.406
1.174MetThr: 1.174 ± 0.47
2.153MetVal: 2.153 ± 0.604
0.783MetTrp: 0.783 ± 0.76
0.196MetTyr: 0.196 ± 0.497
0.0MetXaa: 0.0 ± 0.0
Asn
1.174AsnAla: 1.174 ± 0.546
1.37AsnCys: 1.37 ± 0.7
1.566AsnAsp: 1.566 ± 0.45
1.37AsnGlu: 1.37 ± 0.48
0.783AsnPhe: 0.783 ± 0.387
2.349AsnGly: 2.349 ± 1.119
0.979AsnHis: 0.979 ± 0.693
1.37AsnIle: 1.37 ± 0.829
1.762AsnLys: 1.762 ± 0.958
2.349AsnLeu: 2.349 ± 0.891
1.174AsnMet: 1.174 ± 1.09
0.587AsnAsn: 0.587 ± 0.29
0.783AsnPro: 0.783 ± 0.387
1.566AsnGln: 1.566 ± 1.318
2.153AsnArg: 2.153 ± 0.859
2.349AsnSer: 2.349 ± 0.592
2.936AsnThr: 2.936 ± 1.087
3.523AsnVal: 3.523 ± 1.039
0.196AsnTrp: 0.196 ± 0.497
0.979AsnTyr: 0.979 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
6.459ProAla: 6.459 ± 1.661
1.37ProCys: 1.37 ± 0.8
3.327ProAsp: 3.327 ± 1.646
3.132ProGlu: 3.132 ± 0.85
2.349ProPhe: 2.349 ± 0.796
5.481ProGly: 5.481 ± 1.091
2.349ProHis: 2.349 ± 0.926
3.132ProIle: 3.132 ± 1.839
3.523ProLys: 3.523 ± 1.208
6.655ProLeu: 6.655 ± 1.16
0.391ProMet: 0.391 ± 0.601
2.153ProAsn: 2.153 ± 1.065
3.719ProPro: 3.719 ± 1.186
2.153ProGln: 2.153 ± 0.619
1.762ProArg: 1.762 ± 0.647
4.698ProSer: 4.698 ± 0.732
3.523ProThr: 3.523 ± 0.906
6.459ProVal: 6.459 ± 2.532
1.174ProTrp: 1.174 ± 0.432
2.545ProTyr: 2.545 ± 0.673
0.0ProXaa: 0.0 ± 0.0
Gln
3.327GlnAla: 3.327 ± 0.631
1.174GlnCys: 1.174 ± 0.581
0.391GlnAsp: 0.391 ± 0.194
0.391GlnGlu: 0.391 ± 0.45
1.37GlnPhe: 1.37 ± 0.5
3.132GlnGly: 3.132 ± 1.14
0.783GlnHis: 0.783 ± 0.387
0.783GlnIle: 0.783 ± 0.387
0.783GlnLys: 0.783 ± 0.535
4.893GlnLeu: 4.893 ± 1.142
0.783GlnMet: 0.783 ± 0.646
0.783GlnAsn: 0.783 ± 0.387
2.153GlnPro: 2.153 ± 0.895
1.957GlnGln: 1.957 ± 1.219
1.566GlnArg: 1.566 ± 0.691
2.545GlnSer: 2.545 ± 1.259
2.349GlnThr: 2.349 ± 0.661
3.915GlnVal: 3.915 ± 0.939
0.391GlnTrp: 0.391 ± 0.45
0.587GlnTyr: 0.587 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
3.523ArgAla: 3.523 ± 0.989
1.566ArgCys: 1.566 ± 0.535
2.349ArgAsp: 2.349 ± 0.557
1.37ArgGlu: 1.37 ± 0.412
2.545ArgPhe: 2.545 ± 0.936
3.523ArgGly: 3.523 ± 1.398
1.762ArgHis: 1.762 ± 0.623
2.349ArgIle: 2.349 ± 0.534
1.762ArgLys: 1.762 ± 1.927
4.11ArgLeu: 4.11 ± 1.208
1.762ArgMet: 1.762 ± 0.576
2.349ArgAsn: 2.349 ± 0.496
4.306ArgPro: 4.306 ± 0.949
1.957ArgGln: 1.957 ± 0.599
3.132ArgArg: 3.132 ± 1.081
1.957ArgSer: 1.957 ± 0.648
1.37ArgThr: 1.37 ± 0.678
4.698ArgVal: 4.698 ± 0.909
1.174ArgTrp: 1.174 ± 1.212
2.153ArgTyr: 2.153 ± 0.713
0.0ArgXaa: 0.0 ± 0.0
Ser
7.438SerAla: 7.438 ± 1.412
2.349SerCys: 2.349 ± 0.656
2.74SerAsp: 2.74 ± 0.785
3.719SerGlu: 3.719 ± 0.864
2.349SerPhe: 2.349 ± 0.878
6.068SerGly: 6.068 ± 1.173
1.762SerHis: 1.762 ± 1.265
3.523SerIle: 3.523 ± 1.978
2.936SerLys: 2.936 ± 0.902
7.242SerLeu: 7.242 ± 1.526
1.37SerMet: 1.37 ± 0.48
2.153SerAsn: 2.153 ± 0.768
4.11SerPro: 4.11 ± 1.1
2.936SerGln: 2.936 ± 0.789
2.153SerArg: 2.153 ± 0.771
7.634SerSer: 7.634 ± 1.961
4.698SerThr: 4.698 ± 1.239
3.523SerVal: 3.523 ± 0.896
1.762SerTrp: 1.762 ± 1.468
3.327SerTyr: 3.327 ± 1.177
0.0SerXaa: 0.0 ± 0.0
Thr
5.089ThrAla: 5.089 ± 0.967
1.37ThrCys: 1.37 ± 0.481
1.762ThrAsp: 1.762 ± 0.588
1.566ThrGlu: 1.566 ± 0.576
1.762ThrPhe: 1.762 ± 0.588
4.306ThrGly: 4.306 ± 0.963
1.762ThrHis: 1.762 ± 0.459
2.153ThrIle: 2.153 ± 0.872
2.74ThrLys: 2.74 ± 0.667
5.089ThrLeu: 5.089 ± 1.179
1.174ThrMet: 1.174 ± 0.626
2.153ThrAsn: 2.153 ± 1.071
6.655ThrPro: 6.655 ± 1.499
2.74ThrGln: 2.74 ± 0.884
3.915ThrArg: 3.915 ± 0.804
3.719ThrSer: 3.719 ± 0.947
2.936ThrThr: 2.936 ± 0.435
5.089ThrVal: 5.089 ± 0.949
0.587ThrTrp: 0.587 ± 0.412
1.566ThrTyr: 1.566 ± 0.898
0.0ThrXaa: 0.0 ± 0.0
Val
8.221ValAla: 8.221 ± 2.267
3.327ValCys: 3.327 ± 0.742
2.153ValAsp: 2.153 ± 0.519
4.893ValGlu: 4.893 ± 1.477
3.327ValPhe: 3.327 ± 1.054
4.698ValGly: 4.698 ± 1.525
0.783ValHis: 0.783 ± 0.387
2.936ValIle: 2.936 ± 1.753
2.936ValLys: 2.936 ± 0.435
8.808ValLeu: 8.808 ± 1.889
2.349ValMet: 2.349 ± 1.277
2.545ValAsn: 2.545 ± 1.356
6.655ValPro: 6.655 ± 1.952
2.153ValGln: 2.153 ± 1.146
6.068ValArg: 6.068 ± 1.231
5.481ValSer: 5.481 ± 0.874
6.068ValThr: 6.068 ± 0.567
7.046ValVal: 7.046 ± 1.258
1.174ValTrp: 1.174 ± 1.495
2.936ValTyr: 2.936 ± 1.144
0.0ValXaa: 0.0 ± 0.0
Trp
1.37TrpAla: 1.37 ± 0.762
0.587TrpCys: 0.587 ± 0.29
0.391TrpAsp: 0.391 ± 0.194
0.587TrpGlu: 0.587 ± 0.29
1.566TrpPhe: 1.566 ± 1.138
0.783TrpGly: 0.783 ± 0.504
0.783TrpHis: 0.783 ± 0.387
0.391TrpIle: 0.391 ± 0.194
0.587TrpLys: 0.587 ± 0.334
2.74TrpLeu: 2.74 ± 1.326
0.196TrpMet: 0.196 ± 0.097
0.391TrpAsn: 0.391 ± 0.194
0.587TrpPro: 0.587 ± 0.412
0.587TrpGln: 0.587 ± 0.421
0.783TrpArg: 0.783 ± 0.743
0.979TrpSer: 0.979 ± 0.339
1.566TrpThr: 1.566 ± 0.576
2.153TrpVal: 2.153 ± 1.63
0.391TrpTrp: 0.391 ± 0.372
0.587TrpTyr: 0.587 ± 0.412
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.327TyrAla: 3.327 ± 1.292
1.174TyrCys: 1.174 ± 0.581
1.957TyrAsp: 1.957 ± 0.671
0.979TyrGlu: 0.979 ± 0.428
0.783TyrPhe: 0.783 ± 0.387
2.74TyrGly: 2.74 ± 1.688
1.174TyrHis: 1.174 ± 0.581
1.762TyrIle: 1.762 ± 0.901
1.762TyrLys: 1.762 ± 0.522
3.523TyrLeu: 3.523 ± 1.251
0.391TyrMet: 0.391 ± 0.194
0.391TyrAsn: 0.391 ± 0.372
1.37TyrPro: 1.37 ± 0.449
1.566TyrGln: 1.566 ± 0.439
1.957TyrArg: 1.957 ± 0.517
1.957TyrSer: 1.957 ± 1.504
1.174TyrThr: 1.174 ± 0.636
2.545TyrVal: 2.545 ± 0.695
0.196TyrTrp: 0.196 ± 0.097
0.391TyrTyr: 0.391 ± 0.45
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5110 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski