Amino acid dipepetide frequency for Norwalk virus (strain GI/Human/United States/Norwalk/1968) (Hu/NV/NV/1968/US)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.3AlaAla: 8.3 ± 2.369
0.395AlaCys: 0.395 ± 0.202
3.557AlaAsp: 3.557 ± 0.612
3.162AlaGlu: 3.162 ± 1.166
1.976AlaPhe: 1.976 ± 0.428
7.115AlaGly: 7.115 ± 3.448
0.395AlaHis: 0.395 ± 0.202
5.138AlaIle: 5.138 ± 1.889
3.162AlaLys: 3.162 ± 0.488
6.719AlaLeu: 6.719 ± 1.968
1.186AlaMet: 1.186 ± 0.296
2.767AlaAsn: 2.767 ± 0.428
6.324AlaPro: 6.324 ± 1.591
1.581AlaGln: 1.581 ± 0.713
4.743AlaArg: 4.743 ± 1.127
8.3AlaSer: 8.3 ± 2.884
4.743AlaThr: 4.743 ± 1.734
5.138AlaVal: 5.138 ± 1.955
0.791AlaTrp: 0.791 ± 0.403
1.976AlaTyr: 1.976 ± 0.677
0.0AlaXaa: 0.0 ± 0.0
Cys
0.791CysAla: 0.791 ± 0.403
0.395CysCys: 0.395 ± 0.202
1.581CysAsp: 1.581 ± 0.308
0.395CysGlu: 0.395 ± 0.202
1.186CysPhe: 1.186 ± 0.605
1.186CysGly: 1.186 ± 0.605
0.0CysHis: 0.0 ± 0.0
0.395CysIle: 0.395 ± 0.563
0.0CysLys: 0.0 ± 0.0
0.791CysLeu: 0.791 ± 0.402
0.395CysMet: 0.395 ± 0.563
0.395CysAsn: 0.395 ± 0.202
1.186CysPro: 1.186 ± 0.296
0.791CysGln: 0.791 ± 0.403
0.791CysArg: 0.791 ± 0.403
0.791CysSer: 0.791 ± 0.403
1.186CysThr: 1.186 ± 0.296
0.395CysVal: 0.395 ± 0.563
0.0CysTrp: 0.0 ± 0.0
0.395CysTyr: 0.395 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
3.953AspAla: 3.953 ± 0.769
1.186AspCys: 1.186 ± 0.605
2.767AspAsp: 2.767 ± 1.412
2.767AspGlu: 2.767 ± 1.412
2.767AspPhe: 2.767 ± 0.778
4.348AspGly: 4.348 ± 1.256
0.791AspHis: 0.791 ± 0.403
1.186AspIle: 1.186 ± 0.605
2.372AspLys: 2.372 ± 1.21
5.138AspLeu: 5.138 ± 1.019
1.581AspMet: 1.581 ± 0.807
1.186AspAsn: 1.186 ± 0.296
3.953AspPro: 3.953 ± 2.011
2.372AspGln: 2.372 ± 1.21
3.953AspArg: 3.953 ± 1.139
1.976AspSer: 1.976 ± 0.428
3.162AspThr: 3.162 ± 0.964
3.557AspVal: 3.557 ± 1.473
2.767AspTrp: 2.767 ± 0.428
1.581AspTyr: 1.581 ± 0.807
0.0AspXaa: 0.0 ± 0.0
Glu
4.348GluAla: 4.348 ± 1.988
0.0GluCys: 0.0 ± 0.0
3.162GluAsp: 3.162 ± 0.968
4.743GluGlu: 4.743 ± 2.421
3.557GluPhe: 3.557 ± 0.718
5.138GluGly: 5.138 ± 1.955
1.976GluHis: 1.976 ± 1.009
2.372GluIle: 2.372 ± 1.21
2.372GluLys: 2.372 ± 1.21
3.953GluLeu: 3.953 ± 1.359
2.767GluMet: 2.767 ± 1.014
2.372GluAsn: 2.372 ± 0.882
2.372GluPro: 2.372 ± 1.21
1.976GluGln: 1.976 ± 0.677
1.976GluArg: 1.976 ± 1.009
2.372GluSer: 2.372 ± 1.432
1.976GluThr: 1.976 ± 1.009
3.953GluVal: 3.953 ± 0.854
1.186GluTrp: 1.186 ± 0.605
1.976GluTyr: 1.976 ± 0.428
0.0GluXaa: 0.0 ± 0.0
Phe
1.186PheAla: 1.186 ± 1.619
0.791PheCys: 0.791 ± 0.403
2.372PheAsp: 2.372 ± 0.595
1.186PheGlu: 1.186 ± 0.296
1.186PhePhe: 1.186 ± 0.296
4.743PheGly: 4.743 ± 0.362
1.581PheHis: 1.581 ± 0.804
0.791PheIle: 0.791 ± 0.403
1.581PheLys: 1.581 ± 0.713
2.372PheLeu: 2.372 ± 1.916
1.186PheMet: 1.186 ± 0.882
0.791PheAsn: 0.791 ± 1.063
1.186PhePro: 1.186 ± 0.958
1.976PheGln: 1.976 ± 0.428
0.791PheArg: 0.791 ± 0.403
2.767PheSer: 2.767 ± 1.412
2.372PheThr: 2.372 ± 1.207
3.557PheVal: 3.557 ± 2.159
1.581PheTrp: 1.581 ± 0.807
2.372PheTyr: 2.372 ± 0.595
0.0PheXaa: 0.0 ± 0.0
Gly
5.138GlyAla: 5.138 ± 2.413
0.791GlyCys: 0.791 ± 0.402
4.348GlyAsp: 4.348 ± 1.017
6.324GlyGlu: 6.324 ± 0.476
3.953GlyPhe: 3.953 ± 1.487
6.719GlyGly: 6.719 ± 1.461
2.372GlyHis: 2.372 ± 0.595
3.557GlyIle: 3.557 ± 1.269
4.743GlyLys: 4.743 ± 1.756
9.091GlyLeu: 9.091 ± 3.29
2.372GlyMet: 2.372 ± 1.21
3.162GlyAsn: 3.162 ± 1.148
7.51GlyPro: 7.51 ± 1.568
2.372GlyGln: 2.372 ± 0.592
4.348GlyArg: 4.348 ± 1.874
5.138GlySer: 5.138 ± 2.898
4.348GlyThr: 4.348 ± 2.56
4.348GlyVal: 4.348 ± 0.21
1.186GlyTrp: 1.186 ± 0.296
1.976GlyTyr: 1.976 ± 1.47
0.0GlyXaa: 0.0 ± 0.0
His
1.976HisAla: 1.976 ± 0.78
0.0HisCys: 0.0 ± 0.0
1.186HisAsp: 1.186 ± 0.716
1.581HisGlu: 1.581 ± 0.807
0.395HisPhe: 0.395 ± 0.202
1.186HisGly: 1.186 ± 0.605
0.395HisHis: 0.395 ± 0.202
0.791HisIle: 0.791 ± 0.402
1.186HisLys: 1.186 ± 0.605
3.162HisLeu: 3.162 ± 2.314
0.395HisMet: 0.395 ± 0.202
1.186HisAsn: 1.186 ± 0.958
1.186HisPro: 1.186 ± 0.958
1.186HisGln: 1.186 ± 0.605
0.395HisArg: 0.395 ± 0.202
1.186HisSer: 1.186 ± 0.296
1.186HisThr: 1.186 ± 0.605
1.976HisVal: 1.976 ± 0.677
0.791HisTrp: 0.791 ± 0.403
1.186HisTyr: 1.186 ± 0.296
0.0HisXaa: 0.0 ± 0.0
Ile
2.767IleAla: 2.767 ± 0.669
0.0IleCys: 0.0 ± 0.0
1.581IleAsp: 1.581 ± 0.308
3.162IleGlu: 3.162 ± 0.968
1.186IlePhe: 1.186 ± 0.605
4.743IleGly: 4.743 ± 0.917
0.791IleHis: 0.791 ± 0.403
1.976IleIle: 1.976 ± 1.066
3.162IleLys: 3.162 ± 1.614
1.581IleLeu: 1.581 ± 0.807
1.581IleMet: 1.581 ± 0.308
3.953IleAsn: 3.953 ± 1.354
1.581IlePro: 1.581 ± 1.267
4.348IleGln: 4.348 ± 0.944
2.372IleArg: 2.372 ± 0.595
6.719IleSer: 6.719 ± 1.39
3.557IleThr: 3.557 ± 1.163
2.372IleVal: 2.372 ± 0.459
0.0IleTrp: 0.0 ± 0.0
1.976IleTyr: 1.976 ± 1.009
0.0IleXaa: 0.0 ± 0.0
Lys
2.767LysAla: 2.767 ± 0.778
0.395LysCys: 0.395 ± 0.202
2.767LysAsp: 2.767 ± 0.778
2.767LysGlu: 2.767 ± 1.412
1.186LysPhe: 1.186 ± 0.605
3.162LysGly: 3.162 ± 1.614
1.186LysHis: 1.186 ± 0.716
4.348LysIle: 4.348 ± 0.867
2.767LysLys: 2.767 ± 1.412
2.767LysLeu: 2.767 ± 1.412
1.581LysMet: 1.581 ± 0.308
3.953LysAsn: 3.953 ± 1.506
3.557LysPro: 3.557 ± 1.163
2.767LysGln: 2.767 ± 1.423
2.767LysArg: 2.767 ± 1.412
1.976LysSer: 1.976 ± 1.009
3.953LysThr: 3.953 ± 1.359
2.372LysVal: 2.372 ± 1.21
0.395LysTrp: 0.395 ± 0.202
0.791LysTyr: 0.791 ± 0.403
0.0LysXaa: 0.0 ± 0.0
Leu
7.51LeuAla: 7.51 ± 0.413
0.791LeuCys: 0.791 ± 0.403
3.162LeuAsp: 3.162 ± 0.964
5.138LeuGlu: 5.138 ± 1.955
2.372LeuPhe: 2.372 ± 2.271
8.696LeuGly: 8.696 ± 0.774
1.186LeuHis: 1.186 ± 0.958
3.162LeuIle: 3.162 ± 1.614
5.534LeuLys: 5.534 ± 2.155
7.905LeuLeu: 7.905 ± 0.821
0.0LeuMet: 0.0 ± 0.0
1.976LeuAsn: 1.976 ± 0.564
5.929LeuPro: 5.929 ± 2.049
3.953LeuGln: 3.953 ± 2.134
3.557LeuArg: 3.557 ± 1.269
6.719LeuSer: 6.719 ± 2.097
5.534LeuThr: 5.534 ± 0.841
6.324LeuVal: 6.324 ± 1.437
1.976LeuTrp: 1.976 ± 0.428
0.395LeuTyr: 0.395 ± 0.563
0.0LeuXaa: 0.0 ± 0.0
Met
3.953MetAla: 3.953 ± 0.769
1.186MetCys: 1.186 ± 0.605
1.581MetAsp: 1.581 ± 0.308
1.581MetGlu: 1.581 ± 0.807
0.791MetPhe: 0.791 ± 0.403
1.186MetGly: 1.186 ± 0.296
0.0MetHis: 0.0 ± 0.0
0.791MetIle: 0.791 ± 0.767
1.186MetLys: 1.186 ± 0.605
1.976MetLeu: 1.976 ± 0.677
2.372MetMet: 2.372 ± 0.592
0.395MetAsn: 0.395 ± 0.202
0.791MetPro: 0.791 ± 0.402
0.395MetGln: 0.395 ± 0.202
2.372MetArg: 2.372 ± 0.592
1.581MetSer: 1.581 ± 0.713
1.976MetThr: 1.976 ± 0.677
1.976MetVal: 1.976 ± 1.009
0.0MetTrp: 0.0 ± 0.0
1.186MetTyr: 1.186 ± 0.296
0.0MetXaa: 0.0 ± 0.0
Asn
4.348AsnAla: 4.348 ± 1.017
1.581AsnCys: 1.581 ± 0.807
1.976AsnAsp: 1.976 ± 0.428
1.976AsnGlu: 1.976 ± 1.009
1.581AsnPhe: 1.581 ± 0.804
4.743AsnGly: 4.743 ± 2.595
1.186AsnHis: 1.186 ± 0.605
0.0AsnIle: 0.0 ± 0.0
1.186AsnLys: 1.186 ± 0.605
4.348AsnLeu: 4.348 ± 3.366
1.581AsnMet: 1.581 ± 0.804
3.557AsnAsn: 3.557 ± 0.949
1.581AsnPro: 1.581 ± 1.267
1.581AsnGln: 1.581 ± 0.713
1.581AsnArg: 1.581 ± 0.721
2.372AsnSer: 2.372 ± 1.424
2.767AsnThr: 2.767 ± 1.313
1.976AsnVal: 1.976 ± 0.677
0.0AsnTrp: 0.0 ± 0.0
1.976AsnTyr: 1.976 ± 1.066
0.0AsnXaa: 0.0 ± 0.0
Pro
2.767ProAla: 2.767 ± 0.778
0.791ProCys: 0.791 ± 0.402
4.743ProAsp: 4.743 ± 2.413
5.138ProGlu: 5.138 ± 1.316
3.557ProPhe: 3.557 ± 1.681
5.929ProGly: 5.929 ± 1.312
2.372ProHis: 2.372 ± 1.207
5.138ProIle: 5.138 ± 1.666
1.581ProLys: 1.581 ± 0.807
5.534ProLeu: 5.534 ± 1.486
0.0ProMet: 0.0 ± 0.0
2.767ProAsn: 2.767 ± 2.074
5.138ProPro: 5.138 ± 0.923
2.767ProGln: 2.767 ± 1.073
1.976ProArg: 1.976 ± 0.78
4.743ProSer: 4.743 ± 1.106
4.743ProThr: 4.743 ± 1.189
3.162ProVal: 3.162 ± 1.148
1.186ProTrp: 1.186 ± 0.296
1.186ProTyr: 1.186 ± 0.716
0.0ProXaa: 0.0 ± 0.0
Gln
5.929GlnAla: 5.929 ± 0.595
0.0GlnCys: 0.0 ± 0.0
2.372GlnAsp: 2.372 ± 1.21
1.976GlnGlu: 1.976 ± 0.564
1.581GlnPhe: 1.581 ± 0.804
3.162GlnGly: 3.162 ± 0.968
0.0GlnHis: 0.0 ± 0.0
2.372GlnIle: 2.372 ± 1.21
1.186GlnLys: 1.186 ± 0.882
4.743GlnLeu: 4.743 ± 1.17
1.186GlnMet: 1.186 ± 0.296
2.767GlnAsn: 2.767 ± 1.313
0.791GlnPro: 0.791 ± 0.403
3.953GlnGln: 3.953 ± 0.151
3.162GlnArg: 3.162 ± 1.166
3.557GlnSer: 3.557 ± 1.166
1.976GlnThr: 1.976 ± 0.677
4.348GlnVal: 4.348 ± 1.988
0.395GlnTrp: 0.395 ± 0.202
0.791GlnTyr: 0.791 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
4.348ArgAla: 4.348 ± 0.944
0.395ArgCys: 0.395 ± 0.563
3.953ArgAsp: 3.953 ± 2.017
1.976ArgGlu: 1.976 ± 0.78
2.372ArgPhe: 2.372 ± 0.459
3.162ArgGly: 3.162 ± 0.474
0.395ArgHis: 0.395 ± 0.202
2.372ArgIle: 2.372 ± 0.459
3.953ArgLys: 3.953 ± 2.017
5.929ArgLeu: 5.929 ± 0.307
2.372ArgMet: 2.372 ± 0.49
1.581ArgAsn: 1.581 ± 1.519
3.162ArgPro: 3.162 ± 0.968
1.976ArgGln: 1.976 ± 1.009
2.372ArgArg: 2.372 ± 0.459
1.976ArgSer: 1.976 ± 1.47
2.372ArgThr: 2.372 ± 0.867
4.743ArgVal: 4.743 ± 0.383
0.791ArgTrp: 0.791 ± 0.403
1.976ArgTyr: 1.976 ± 1.47
0.0ArgXaa: 0.0 ± 0.0
Ser
4.743SerAla: 4.743 ± 2.758
0.791SerCys: 0.791 ± 0.402
2.767SerAsp: 2.767 ± 0.778
2.372SerGlu: 2.372 ± 0.595
1.581SerPhe: 1.581 ± 0.713
7.115SerGly: 7.115 ± 0.576
2.767SerHis: 2.767 ± 1.757
4.743SerIle: 4.743 ± 0.383
4.348SerLys: 4.348 ± 0.21
4.743SerLeu: 4.743 ± 0.362
2.372SerMet: 2.372 ± 0.595
2.372SerAsn: 2.372 ± 1.765
5.534SerPro: 5.534 ± 1.338
2.372SerGln: 2.372 ± 2.304
5.534SerArg: 5.534 ± 0.857
9.091SerSer: 9.091 ± 5.383
4.348SerThr: 4.348 ± 1.926
4.743SerVal: 4.743 ± 2.138
1.186SerTrp: 1.186 ± 1.467
0.791SerTyr: 0.791 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
5.929ThrAla: 5.929 ± 1.141
0.791ThrCys: 0.791 ± 1.127
1.186ThrAsp: 1.186 ± 0.716
3.557ThrGlu: 3.557 ± 0.718
1.186ThrPhe: 1.186 ± 0.296
3.953ThrGly: 3.953 ± 0.854
1.186ThrHis: 1.186 ± 0.296
4.348ThrIle: 4.348 ± 1.017
3.162ThrLys: 3.162 ± 1.614
4.743ThrLeu: 4.743 ± 1.734
1.186ThrMet: 1.186 ± 0.296
1.581ThrAsn: 1.581 ± 0.807
4.743ThrPro: 4.743 ± 1.851
3.557ThrGln: 3.557 ± 0.718
4.348ThrArg: 4.348 ± 1.148
4.743ThrSer: 4.743 ± 0.917
6.324ThrThr: 6.324 ± 0.976
4.743ThrVal: 4.743 ± 0.362
0.791ThrTrp: 0.791 ± 0.403
0.791ThrTyr: 0.791 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
3.953ValAla: 3.953 ± 1.487
2.372ValCys: 2.372 ± 0.595
5.534ValAsp: 5.534 ± 0.75
2.767ValGlu: 2.767 ± 1.313
2.372ValPhe: 2.372 ± 1.207
4.348ValGly: 4.348 ± 2.038
1.976ValHis: 1.976 ± 1.009
3.557ValIle: 3.557 ± 0.718
2.767ValLys: 2.767 ± 1.412
4.743ValLeu: 4.743 ± 1.184
1.976ValMet: 1.976 ± 0.412
1.976ValAsn: 1.976 ± 0.677
7.115ValPro: 7.115 ± 1.188
3.953ValGln: 3.953 ± 0.75
4.348ValArg: 4.348 ± 0.553
5.534ValSer: 5.534 ± 0.75
2.767ValThr: 2.767 ± 1.014
4.743ValVal: 4.743 ± 1.756
0.395ValTrp: 0.395 ± 0.862
1.581ValTyr: 1.581 ± 0.308
0.0ValXaa: 0.0 ± 0.0
Trp
0.395TrpAla: 0.395 ± 0.202
0.0TrpCys: 0.0 ± 0.0
1.581TrpAsp: 1.581 ± 0.807
0.0TrpGlu: 0.0 ± 0.0
0.395TrpPhe: 0.395 ± 0.202
0.791TrpGly: 0.791 ± 0.403
0.395TrpHis: 0.395 ± 0.563
1.186TrpIle: 1.186 ± 0.958
0.395TrpLys: 0.395 ± 0.563
0.791TrpLeu: 0.791 ± 0.767
0.395TrpMet: 0.395 ± 0.202
1.581TrpAsn: 1.581 ± 0.721
0.395TrpPro: 0.395 ± 0.202
0.791TrpGln: 0.791 ± 0.403
0.791TrpArg: 0.791 ± 0.403
1.581TrpSer: 1.581 ± 0.807
1.976TrpThr: 1.976 ± 1.009
1.976TrpVal: 1.976 ± 1.066
0.395TrpTrp: 0.395 ± 0.202
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 1.21
0.395TyrCys: 0.395 ± 0.202
1.186TyrAsp: 1.186 ± 0.296
1.976TyrGlu: 1.976 ± 1.009
1.186TyrPhe: 1.186 ± 0.716
2.372TyrGly: 2.372 ± 0.595
1.581TyrHis: 1.581 ± 0.721
1.186TyrIle: 1.186 ± 0.296
1.581TyrLys: 1.581 ± 0.807
0.791TyrLeu: 0.791 ± 0.403
0.395TyrMet: 0.395 ± 0.202
1.581TyrAsn: 1.581 ± 0.804
1.186TyrPro: 1.186 ± 0.958
1.581TyrGln: 1.581 ± 1.698
0.395TyrArg: 0.395 ± 0.202
1.186TyrSer: 1.186 ± 1.619
1.581TyrThr: 1.581 ± 0.308
2.372TyrVal: 2.372 ± 0.592
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2531 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski