Amino acid dipepetide frequency for Sanxia atyid shrimp virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.031AlaAla: 7.031 ± 1.082
0.27AlaCys: 0.27 ± 0.882
1.893AlaAsp: 1.893 ± 0.729
1.082AlaGlu: 1.082 ± 0.595
3.245AlaPhe: 3.245 ± 1.242
3.515AlaGly: 3.515 ± 1.417
2.434AlaHis: 2.434 ± 0.736
4.056AlaIle: 4.056 ± 1.52
3.786AlaLys: 3.786 ± 1.294
8.924AlaLeu: 8.924 ± 2.317
1.082AlaMet: 1.082 ± 0.595
2.975AlaAsn: 2.975 ± 0.578
4.597AlaPro: 4.597 ± 1.451
1.893AlaGln: 1.893 ± 1.919
4.056AlaArg: 4.056 ± 3.041
6.76AlaSer: 6.76 ± 1.68
2.163AlaThr: 2.163 ± 0.479
6.76AlaVal: 6.76 ± 1.68
0.541AlaTrp: 0.541 ± 0.355
2.163AlaTyr: 2.163 ± 0.854
0.0AlaXaa: 0.0 ± 0.0
Cys
3.245CysAla: 3.245 ± 1.349
0.811CysCys: 0.811 ± 1.687
1.082CysAsp: 1.082 ± 0.595
1.082CysGlu: 1.082 ± 0.759
1.622CysPhe: 1.622 ± 1.529
1.082CysGly: 1.082 ± 0.334
0.541CysHis: 0.541 ± 0.355
0.541CysIle: 0.541 ± 0.297
0.811CysLys: 0.811 ± 0.446
1.622CysLeu: 1.622 ± 0.614
0.27CysMet: 0.27 ± 0.882
0.541CysAsn: 0.541 ± 0.297
2.434CysPro: 2.434 ± 0.65
0.27CysGln: 0.27 ± 0.149
1.352CysArg: 1.352 ± 1.564
1.352CysSer: 1.352 ± 0.669
1.622CysThr: 1.622 ± 0.866
1.893CysVal: 1.893 ± 0.754
0.0CysTrp: 0.0 ± 0.0
1.622CysTyr: 1.622 ± 1.521
0.0CysXaa: 0.0 ± 0.0
Asp
4.867AspAla: 4.867 ± 1.163
0.541AspCys: 0.541 ± 0.297
4.597AspAsp: 4.597 ± 2.032
3.245AspGlu: 3.245 ± 1.784
3.515AspPhe: 3.515 ± 0.788
2.704AspGly: 2.704 ± 1.487
1.893AspHis: 1.893 ± 0.842
4.597AspIle: 4.597 ± 1.132
3.515AspLys: 3.515 ± 1.933
6.49AspLeu: 6.49 ± 1.36
1.893AspMet: 1.893 ± 0.648
0.541AspAsn: 0.541 ± 0.355
3.245AspPro: 3.245 ± 1.001
0.541AspGln: 0.541 ± 0.297
3.515AspArg: 3.515 ± 1.784
4.867AspSer: 4.867 ± 1.841
2.704AspThr: 2.704 ± 1.122
5.138AspVal: 5.138 ± 2.415
0.0AspTrp: 0.0 ± 0.0
3.245AspTyr: 3.245 ± 0.817
0.0AspXaa: 0.0 ± 0.0
Glu
1.622GluAla: 1.622 ± 0.523
2.163GluCys: 2.163 ± 0.957
2.163GluAsp: 2.163 ± 1.189
1.893GluGlu: 1.893 ± 1.041
1.622GluPhe: 1.622 ± 1.332
0.811GluGly: 0.811 ± 0.616
1.352GluHis: 1.352 ± 0.669
3.515GluIle: 3.515 ± 1.494
2.163GluLys: 2.163 ± 0.562
4.056GluLeu: 4.056 ± 0.862
1.082GluMet: 1.082 ± 0.595
1.352GluAsn: 1.352 ± 0.743
1.893GluPro: 1.893 ± 1.096
0.541GluGln: 0.541 ± 0.297
2.704GluArg: 2.704 ± 0.825
3.245GluSer: 3.245 ± 1.349
1.622GluThr: 1.622 ± 0.927
3.245GluVal: 3.245 ± 1.173
0.0GluTrp: 0.0 ± 0.0
1.352GluTyr: 1.352 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
2.434PheAla: 2.434 ± 0.86
2.163PheCys: 2.163 ± 1.189
3.515PheAsp: 3.515 ± 1.494
2.163PheGlu: 2.163 ± 1.067
3.245PhePhe: 3.245 ± 1.951
2.704PheGly: 2.704 ± 0.97
0.811PheHis: 0.811 ± 0.794
3.245PheIle: 3.245 ± 0.652
2.975PheLys: 2.975 ± 1.205
4.867PheLeu: 4.867 ± 2.901
1.082PheMet: 1.082 ± 0.711
3.515PheAsn: 3.515 ± 1.657
2.975PhePro: 2.975 ± 1.88
1.082PheGln: 1.082 ± 1.597
1.622PheArg: 1.622 ± 0.622
7.301PheSer: 7.301 ± 1.819
3.515PheThr: 3.515 ± 1.284
6.49PheVal: 6.49 ± 1.597
0.541PheTrp: 0.541 ± 0.736
1.352PheTyr: 1.352 ± 0.688
0.0PheXaa: 0.0 ± 0.0
Gly
3.245GlyAla: 3.245 ± 0.652
1.352GlyCys: 1.352 ± 0.745
3.515GlyAsp: 3.515 ± 1.494
0.541GlyGlu: 0.541 ± 0.297
1.893GlyPhe: 1.893 ± 0.648
2.434GlyGly: 2.434 ± 1.335
1.352GlyHis: 1.352 ± 1.307
3.515GlyIle: 3.515 ± 2.131
2.163GlyLys: 2.163 ± 1.189
3.245GlyLeu: 3.245 ± 0.862
0.27GlyMet: 0.27 ± 0.518
2.434GlyAsn: 2.434 ± 0.921
0.811GlyPro: 0.811 ± 0.423
2.434GlyGln: 2.434 ± 0.611
2.163GlyArg: 2.163 ± 0.814
4.327GlySer: 4.327 ± 1.006
2.704GlyThr: 2.704 ± 1.673
2.434GlyVal: 2.434 ± 0.86
0.541GlyTrp: 0.541 ± 0.355
2.434GlyTyr: 2.434 ± 1.02
0.0GlyXaa: 0.0 ± 0.0
His
1.352HisAla: 1.352 ± 0.413
1.622HisCys: 1.622 ± 0.781
2.434HisAsp: 2.434 ± 0.736
1.082HisGlu: 1.082 ± 0.738
1.622HisPhe: 1.622 ± 0.927
0.811HisGly: 0.811 ± 0.76
0.811HisHis: 0.811 ± 0.446
2.163HisIle: 2.163 ± 0.854
0.811HisLys: 0.811 ± 0.446
2.704HisLeu: 2.704 ± 0.538
0.541HisMet: 0.541 ± 0.355
1.082HisAsn: 1.082 ± 0.595
1.893HisPro: 1.893 ± 0.566
0.0HisGln: 0.0 ± 0.0
0.541HisArg: 0.541 ± 0.297
2.704HisSer: 2.704 ± 2.148
0.27HisThr: 0.27 ± 0.447
2.163HisVal: 2.163 ± 0.667
0.541HisTrp: 0.541 ± 0.297
1.893HisTyr: 1.893 ± 1.286
0.0HisXaa: 0.0 ± 0.0
Ile
5.138IleAla: 5.138 ± 2.457
0.27IleCys: 0.27 ± 0.149
5.408IleAsp: 5.408 ± 1.074
3.786IleGlu: 3.786 ± 0.733
4.056IlePhe: 4.056 ± 1.406
1.893IleGly: 1.893 ± 1.491
1.893IleHis: 1.893 ± 0.567
2.975IleIle: 2.975 ± 0.93
2.434IleLys: 2.434 ± 0.921
6.22IleLeu: 6.22 ± 1.717
0.811IleMet: 0.811 ± 0.446
2.163IleAsn: 2.163 ± 1.189
2.975IlePro: 2.975 ± 1.205
0.541IleGln: 0.541 ± 0.297
3.245IleArg: 3.245 ± 0.613
6.22IleSer: 6.22 ± 2.663
4.327IleThr: 4.327 ± 0.66
3.515IleVal: 3.515 ± 0.832
1.622IleTrp: 1.622 ± 1.588
1.622IleTyr: 1.622 ± 0.892
0.0IleXaa: 0.0 ± 0.0
Lys
1.622LysAla: 1.622 ± 0.892
0.811LysCys: 0.811 ± 0.446
2.975LysAsp: 2.975 ± 1.189
1.352LysGlu: 1.352 ± 0.413
4.056LysPhe: 4.056 ± 1.383
2.975LysGly: 2.975 ± 1.205
1.352LysHis: 1.352 ± 0.743
4.056LysIle: 4.056 ± 1.786
2.434LysLys: 2.434 ± 0.921
2.704LysLeu: 2.704 ± 1.062
1.622LysMet: 1.622 ± 0.892
2.163LysAsn: 2.163 ± 0.932
2.163LysPro: 2.163 ± 0.863
2.163LysGln: 2.163 ± 0.997
2.704LysArg: 2.704 ± 0.627
4.597LysSer: 4.597 ± 1.466
2.434LysThr: 2.434 ± 0.736
2.704LysVal: 2.704 ± 1.062
0.811LysTrp: 0.811 ± 0.446
2.975LysTyr: 2.975 ± 1.205
0.0LysXaa: 0.0 ± 0.0
Leu
7.572LeuAla: 7.572 ± 3.547
2.704LeuCys: 2.704 ± 0.579
5.679LeuAsp: 5.679 ± 0.908
4.327LeuGlu: 4.327 ± 1.167
5.949LeuPhe: 5.949 ± 2.155
3.786LeuGly: 3.786 ± 1.151
2.704LeuHis: 2.704 ± 2.293
4.597LeuIle: 4.597 ± 1.559
4.597LeuLys: 4.597 ± 1.414
14.602LeuLeu: 14.602 ± 6.633
1.352LeuMet: 1.352 ± 0.413
5.679LeuAsn: 5.679 ± 2.308
5.138LeuPro: 5.138 ± 1.265
2.704LeuGln: 2.704 ± 1.487
4.056LeuArg: 4.056 ± 1.016
7.842LeuSer: 7.842 ± 3.955
6.76LeuThr: 6.76 ± 1.974
7.031LeuVal: 7.031 ± 2.935
0.27LeuTrp: 0.27 ± 0.149
4.597LeuTyr: 4.597 ± 2.97
0.0LeuXaa: 0.0 ± 0.0
Met
3.515MetAla: 3.515 ± 0.564
0.27MetCys: 0.27 ± 0.447
0.27MetAsp: 0.27 ± 0.149
0.27MetGlu: 0.27 ± 0.447
1.352MetPhe: 1.352 ± 0.743
1.082MetGly: 1.082 ± 0.595
0.27MetHis: 0.27 ± 0.149
0.27MetIle: 0.27 ± 0.149
1.082MetLys: 1.082 ± 0.595
2.434MetLeu: 2.434 ± 0.933
0.811MetMet: 0.811 ± 0.449
1.352MetAsn: 1.352 ± 0.413
0.541MetPro: 0.541 ± 1.134
0.27MetGln: 0.27 ± 0.149
0.541MetArg: 0.541 ± 0.297
2.163MetSer: 2.163 ± 0.923
1.622MetThr: 1.622 ± 0.614
0.541MetVal: 0.541 ± 0.355
0.811MetTrp: 0.811 ± 0.446
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.434AsnAla: 2.434 ± 1.02
1.352AsnCys: 1.352 ± 1.564
2.434AsnAsp: 2.434 ± 0.921
1.893AsnGlu: 1.893 ± 1.041
1.622AsnPhe: 1.622 ± 0.622
2.163AsnGly: 2.163 ± 1.022
0.811AsnHis: 0.811 ± 0.446
2.704AsnIle: 2.704 ± 0.868
1.622AsnLys: 1.622 ± 0.622
3.245AsnLeu: 3.245 ± 0.813
1.082AsnMet: 1.082 ± 0.625
1.893AsnAsn: 1.893 ± 0.567
2.975AsnPro: 2.975 ± 0.93
0.541AsnGln: 0.541 ± 0.355
1.082AsnArg: 1.082 ± 0.334
3.515AsnSer: 3.515 ± 0.427
2.704AsnThr: 2.704 ± 2.827
3.786AsnVal: 3.786 ± 1.575
0.541AsnTrp: 0.541 ± 0.297
1.893AsnTyr: 1.893 ± 0.566
0.0AsnXaa: 0.0 ± 0.0
Pro
3.245ProAla: 3.245 ± 0.617
1.352ProCys: 1.352 ± 2.073
5.408ProAsp: 5.408 ± 1.875
2.434ProGlu: 2.434 ± 0.736
3.245ProPhe: 3.245 ± 0.57
1.622ProGly: 1.622 ± 0.847
1.622ProHis: 1.622 ± 0.892
2.163ProIle: 2.163 ± 0.782
2.975ProLys: 2.975 ± 1.205
5.138ProLeu: 5.138 ± 2.512
1.352ProMet: 1.352 ± 0.432
2.434ProAsn: 2.434 ± 1.423
4.056ProPro: 4.056 ± 3.15
2.163ProGln: 2.163 ± 0.827
3.245ProArg: 3.245 ± 1.121
6.76ProSer: 6.76 ± 1.767
0.811ProThr: 0.811 ± 0.423
2.975ProVal: 2.975 ± 0.984
0.27ProTrp: 0.27 ± 0.149
2.434ProTyr: 2.434 ± 0.921
0.0ProXaa: 0.0 ± 0.0
Gln
2.163GlnAla: 2.163 ± 0.827
0.541GlnCys: 0.541 ± 0.81
1.082GlnAsp: 1.082 ± 0.759
0.27GlnGlu: 0.27 ± 0.149
1.352GlnPhe: 1.352 ± 0.669
1.352GlnGly: 1.352 ± 0.669
0.27GlnHis: 0.27 ± 0.882
2.434GlnIle: 2.434 ± 0.609
1.082GlnLys: 1.082 ± 0.595
3.515GlnLeu: 3.515 ± 0.427
0.541GlnMet: 0.541 ± 0.3
1.082GlnAsn: 1.082 ± 0.595
2.163GlnPro: 2.163 ± 0.896
1.082GlnGln: 1.082 ± 0.759
0.811GlnArg: 0.811 ± 0.446
3.515GlnSer: 3.515 ± 1.407
0.541GlnThr: 0.541 ± 0.775
1.352GlnVal: 1.352 ± 0.86
0.27GlnTrp: 0.27 ± 0.447
0.541GlnTyr: 0.541 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
1.893ArgAla: 1.893 ± 0.781
1.622ArgCys: 1.622 ± 0.601
1.352ArgAsp: 1.352 ± 0.743
1.893ArgGlu: 1.893 ± 0.567
2.975ArgPhe: 2.975 ± 1.166
2.704ArgGly: 2.704 ± 0.692
1.082ArgHis: 1.082 ± 0.595
2.975ArgIle: 2.975 ± 0.594
2.163ArgLys: 2.163 ± 0.896
6.49ArgLeu: 6.49 ± 0.693
1.622ArgMet: 1.622 ± 0.734
2.434ArgAsn: 2.434 ± 1.02
2.434ArgPro: 2.434 ± 1.27
2.163ArgGln: 2.163 ± 0.677
2.704ArgArg: 2.704 ± 0.627
2.975ArgSer: 2.975 ± 1.865
1.893ArgThr: 1.893 ± 0.859
2.975ArgVal: 2.975 ± 1.244
0.811ArgTrp: 0.811 ± 1.624
1.622ArgTyr: 1.622 ± 0.892
0.0ArgXaa: 0.0 ± 0.0
Ser
6.49SerAla: 6.49 ± 1.971
2.704SerCys: 2.704 ± 1.337
5.679SerAsp: 5.679 ± 0.88
3.786SerGlu: 3.786 ± 1.453
6.49SerPhe: 6.49 ± 1.212
4.327SerGly: 4.327 ± 1.167
2.704SerHis: 2.704 ± 0.68
5.949SerIle: 5.949 ± 1.373
6.22SerLys: 6.22 ± 1.919
8.924SerLeu: 8.924 ± 1.744
1.352SerMet: 1.352 ± 0.413
1.352SerAsn: 1.352 ± 1.201
4.867SerPro: 4.867 ± 2.043
2.975SerGln: 2.975 ± 1.457
3.515SerArg: 3.515 ± 0.947
10.546SerSer: 10.546 ± 3.47
6.22SerThr: 6.22 ± 2.075
7.842SerVal: 7.842 ± 2.38
1.082SerTrp: 1.082 ± 0.511
3.515SerTyr: 3.515 ± 2.324
0.0SerXaa: 0.0 ± 0.0
Thr
4.056ThrAla: 4.056 ± 1.267
0.811ThrCys: 0.811 ± 0.794
3.245ThrAsp: 3.245 ± 0.882
1.082ThrGlu: 1.082 ± 0.334
2.704ThrPhe: 2.704 ± 0.789
2.163ThrGly: 2.163 ± 1.353
2.163ThrHis: 2.163 ± 1.316
2.163ThrIle: 2.163 ± 0.782
1.893ThrLys: 1.893 ± 0.818
4.056ThrLeu: 4.056 ± 3.282
1.352ThrMet: 1.352 ± 0.413
2.434ThrAsn: 2.434 ± 0.736
3.245ThrPro: 3.245 ± 1.055
2.163ThrGln: 2.163 ± 0.605
2.434ThrArg: 2.434 ± 0.956
7.842ThrSer: 7.842 ± 3.27
5.679ThrThr: 5.679 ± 4.577
3.245ThrVal: 3.245 ± 0.934
0.541ThrTrp: 0.541 ± 0.297
1.082ThrTyr: 1.082 ± 1.239
0.0ThrXaa: 0.0 ± 0.0
Val
4.867ValAla: 4.867 ± 0.496
2.163ValCys: 2.163 ± 0.782
4.327ValAsp: 4.327 ± 1.933
3.786ValGlu: 3.786 ± 1.297
4.056ValPhe: 4.056 ± 1.44
2.975ValGly: 2.975 ± 1.244
1.893ValHis: 1.893 ± 0.567
4.867ValIle: 4.867 ± 0.865
4.327ValLys: 4.327 ± 1.561
7.301ValLeu: 7.301 ± 2.28
0.811ValMet: 0.811 ± 0.446
2.975ValAsn: 2.975 ± 0.93
5.679ValPro: 5.679 ± 1.635
0.541ValGln: 0.541 ± 0.297
2.704ValArg: 2.704 ± 0.579
6.49ValSer: 6.49 ± 0.487
4.867ValThr: 4.867 ± 1.189
7.301ValVal: 7.301 ± 2.496
0.541ValTrp: 0.541 ± 0.775
4.056ValTyr: 4.056 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.541TrpAsp: 0.541 ± 0.355
0.541TrpGlu: 0.541 ± 0.775
0.811TrpPhe: 0.811 ± 0.311
0.27TrpGly: 0.27 ± 0.149
0.0TrpHis: 0.0 ± 0.0
1.622TrpIle: 1.622 ± 0.757
0.27TrpLys: 0.27 ± 0.149
2.163TrpLeu: 2.163 ± 0.957
0.27TrpMet: 0.27 ± 0.149
0.0TrpAsn: 0.0 ± 0.0
0.811TrpPro: 0.811 ± 0.886
0.27TrpGln: 0.27 ± 0.518
1.082TrpArg: 1.082 ± 0.595
0.0TrpSer: 0.0 ± 0.0
0.27TrpThr: 0.27 ± 0.149
0.27TrpVal: 0.27 ± 0.855
0.27TrpTrp: 0.27 ± 0.149
0.811TrpTyr: 0.811 ± 0.311
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.434TyrAla: 2.434 ± 0.582
0.27TyrCys: 0.27 ± 0.149
4.056TyrAsp: 4.056 ± 1.37
1.893TyrGlu: 1.893 ± 1.041
2.163TyrPhe: 2.163 ± 0.932
2.434TyrGly: 2.434 ± 0.609
1.082TyrHis: 1.082 ± 0.334
2.704TyrIle: 2.704 ± 1.362
1.352TyrLys: 1.352 ± 0.432
3.245TyrLeu: 3.245 ± 0.481
0.27TyrMet: 0.27 ± 0.149
1.893TyrAsn: 1.893 ± 0.566
0.811TyrPro: 0.811 ± 0.311
1.622TyrGln: 1.622 ± 0.601
2.704TyrArg: 2.704 ± 0.662
3.515TyrSer: 3.515 ± 4.553
1.622TyrThr: 1.622 ± 1.066
4.867TyrVal: 4.867 ± 1.153
0.27TyrTrp: 0.27 ± 0.149
1.622TyrTyr: 1.622 ± 0.523
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3699 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski